Lines Matching refs:shift

8 ; Verify the cost of vector shift left instructions.
17 ; SSE2: Found an estimated cost of 4 for instruction: %shift
18 ; SSE41: Found an estimated cost of 4 for instruction: %shift
19 ; AVX: Found an estimated cost of 4 for instruction: %shift
20 ; AVX2: Found an estimated cost of 1 for instruction: %shift
21 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
22 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
23 %shift = shl <2 x i64> %a, %b
24 ret <2 x i64> %shift
29 ; SSE2: Found an estimated cost of 8 for instruction: %shift
30 ; SSE41: Found an estimated cost of 8 for instruction: %shift
31 ; AVX: Found an estimated cost of 8 for instruction: %shift
32 ; AVX2: Found an estimated cost of 1 for instruction: %shift
33 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
34 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
35 %shift = shl <4 x i64> %a, %b
36 ret <4 x i64> %shift
41 ; SSE2: Found an estimated cost of 10 for instruction: %shift
42 ; SSE41: Found an estimated cost of 10 for instruction: %shift
43 ; AVX: Found an estimated cost of 10 for instruction: %shift
44 ; AVX2: Found an estimated cost of 1 for instruction: %shift
45 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
46 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
47 %shift = shl <4 x i32> %a, %b
48 ret <4 x i32> %shift
53 ; SSE2: Found an estimated cost of 20 for instruction: %shift
54 ; SSE41: Found an estimated cost of 20 for instruction: %shift
55 ; AVX: Found an estimated cost of 20 for instruction: %shift
56 ; AVX2: Found an estimated cost of 1 for instruction: %shift
57 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
58 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
59 %shift = shl <8 x i32> %a, %b
60 ret <8 x i32> %shift
65 ; SSE2: Found an estimated cost of 32 for instruction: %shift
66 ; SSE41: Found an estimated cost of 32 for instruction: %shift
67 ; AVX: Found an estimated cost of 32 for instruction: %shift
68 ; AVX2: Found an estimated cost of 32 for instruction: %shift
69 ; XOP: Found an estimated cost of 1 for instruction: %shift
70 %shift = shl <8 x i16> %a, %b
71 ret <8 x i16> %shift
76 ; SSE2: Found an estimated cost of 64 for instruction: %shift
77 ; SSE41: Found an estimated cost of 64 for instruction: %shift
78 ; AVX: Found an estimated cost of 64 for instruction: %shift
79 ; AVX2: Found an estimated cost of 10 for instruction: %shift
80 ; XOP: Found an estimated cost of 2 for instruction: %shift
81 %shift = shl <16 x i16> %a, %b
82 ret <16 x i16> %shift
87 ; SSE2: Found an estimated cost of 26 for instruction: %shift
88 ; SSE41: Found an estimated cost of 26 for instruction: %shift
89 ; AVX: Found an estimated cost of 26 for instruction: %shift
90 ; AVX2: Found an estimated cost of 26 for instruction: %shift
91 ; XOP: Found an estimated cost of 1 for instruction: %shift
92 %shift = shl <16 x i8> %a, %b
93 ret <16 x i8> %shift
98 ; SSE2: Found an estimated cost of 52 for instruction: %shift
99 ; SSE41: Found an estimated cost of 52 for instruction: %shift
100 ; AVX: Found an estimated cost of 52 for instruction: %shift
101 ; AVX2: Found an estimated cost of 11 for instruction: %shift
102 ; XOP: Found an estimated cost of 2 for instruction: %shift
103 %shift = shl <32 x i8> %a, %b
104 ret <32 x i8> %shift
113 ; SSE2: Found an estimated cost of 4 for instruction: %shift
114 ; SSE41: Found an estimated cost of 4 for instruction: %shift
115 ; AVX: Found an estimated cost of 4 for instruction: %shift
116 ; AVX2: Found an estimated cost of 1 for instruction: %shift
117 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
118 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
120 %shift = shl <2 x i64> %a, %splat
121 ret <2 x i64> %shift
126 ; SSE2: Found an estimated cost of 8 for instruction: %shift
127 ; SSE41: Found an estimated cost of 8 for instruction: %shift
128 ; AVX: Found an estimated cost of 8 for instruction: %shift
129 ; AVX2: Found an estimated cost of 1 for instruction: %shift
130 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
131 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
133 %shift = shl <4 x i64> %a, %splat
134 ret <4 x i64> %shift
139 ; SSE2: Found an estimated cost of 10 for instruction: %shift
140 ; SSE41: Found an estimated cost of 10 for instruction: %shift
141 ; AVX: Found an estimated cost of 10 for instruction: %shift
142 ; AVX2: Found an estimated cost of 1 for instruction: %shift
143 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
144 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
146 %shift = shl <4 x i32> %a, %splat
147 ret <4 x i32> %shift
152 ; SSE2: Found an estimated cost of 20 for instruction: %shift
153 ; SSE41: Found an estimated cost of 20 for instruction: %shift
154 ; AVX: Found an estimated cost of 20 for instruction: %shift
155 ; AVX2: Found an estimated cost of 1 for instruction: %shift
156 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
157 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
159 %shift = shl <8 x i32> %a, %splat
160 ret <8 x i32> %shift
165 ; SSE2: Found an estimated cost of 32 for instruction: %shift
166 ; SSE41: Found an estimated cost of 32 for instruction: %shift
167 ; AVX: Found an estimated cost of 32 for instruction: %shift
168 ; AVX2: Found an estimated cost of 32 for instruction: %shift
169 ; XOP: Found an estimated cost of 1 for instruction: %shift
171 %shift = shl <8 x i16> %a, %splat
172 ret <8 x i16> %shift
177 ; SSE2: Found an estimated cost of 64 for instruction: %shift
178 ; SSE41: Found an estimated cost of 64 for instruction: %shift
179 ; AVX: Found an estimated cost of 64 for instruction: %shift
180 ; AVX2: Found an estimated cost of 10 for instruction: %shift
181 ; XOP: Found an estimated cost of 2 for instruction: %shift
183 %shift = shl <16 x i16> %a, %splat
184 ret <16 x i16> %shift
189 ; SSE2: Found an estimated cost of 26 for instruction: %shift
190 ; SSE41: Found an estimated cost of 26 for instruction: %shift
191 ; AVX: Found an estimated cost of 26 for instruction: %shift
192 ; AVX2: Found an estimated cost of 26 for instruction: %shift
193 ; XOP: Found an estimated cost of 1 for instruction: %shift
195 %shift = shl <16 x i8> %a, %splat
196 ret <16 x i8> %shift
201 ; SSE2: Found an estimated cost of 52 for instruction: %shift
202 ; SSE41: Found an estimated cost of 52 for instruction: %shift
203 ; AVX: Found an estimated cost of 52 for instruction: %shift
204 ; AVX2: Found an estimated cost of 11 for instruction: %shift
205 ; XOP: Found an estimated cost of 2 for instruction: %shift
207 %shift = shl <32 x i8> %a, %splat
208 ret <32 x i8> %shift
217 ; SSE2: Found an estimated cost of 4 for instruction: %shift
218 ; SSE41: Found an estimated cost of 4 for instruction: %shift
219 ; AVX: Found an estimated cost of 4 for instruction: %shift
220 ; AVX2: Found an estimated cost of 1 for instruction: %shift
221 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
222 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
223 %shift = shl <2 x i64> %a, <i64 1, i64 7>
224 ret <2 x i64> %shift
229 ; SSE2: Found an estimated cost of 8 for instruction: %shift
230 ; SSE41: Found an estimated cost of 8 for instruction: %shift
231 ; AVX: Found an estimated cost of 8 for instruction: %shift
232 ; AVX2: Found an estimated cost of 1 for instruction: %shift
233 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
234 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
235 %shift = shl <4 x i64> %a, <i64 1, i64 7, i64 15, i64 31>
236 ret <4 x i64> %shift
241 ; SSE2: Found an estimated cost of 6 for instruction: %shift
242 ; SSE41: Found an estimated cost of 1 for instruction: %shift
243 ; AVX: Found an estimated cost of 1 for instruction: %shift
244 ; AVX2: Found an estimated cost of 1 for instruction: %shift
245 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
246 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
247 %shift = shl <4 x i32> %a, <i32 4, i32 5, i32 6, i32 7>
248 ret <4 x i32> %shift
253 ; SSE2: Found an estimated cost of 12 for instruction: %shift
254 ; SSE41: Found an estimated cost of 2 for instruction: %shift
255 ; AVX: Found an estimated cost of 4 for instruction: %shift
256 ; AVX2: Found an estimated cost of 1 for instruction: %shift
257 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
258 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
259 %shift = shl <8 x i32> %a, <i32 4, i32 5, i32 6, i32 7, i32 0, i32 1, i32 2, i32 3>
260 ret <8 x i32> %shift
265 ; SSE2: Found an estimated cost of 1 for instruction: %shift
266 ; SSE41: Found an estimated cost of 1 for instruction: %shift
267 ; AVX: Found an estimated cost of 1 for instruction: %shift
268 ; AVX2: Found an estimated cost of 1 for instruction: %shift
269 ; XOP: Found an estimated cost of 1 for instruction: %shift
270 %shift = shl <8 x i16> %a, <i16 0, i16 1, i16 2, i16 3, i16 4, i16 5, i16 6, i16 7>
271 ret <8 x i16> %shift
276 ; SSE2: Found an estimated cost of 2 for instruction: %shift
277 ; SSE41: Found an estimated cost of 2 for instruction: %shift
278 ; AVX: Found an estimated cost of 4 for instruction: %shift
279 ; AVX2: Found an estimated cost of 1 for instruction: %shift
280 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
281 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
282 …%shift = shl <16 x i16> %a, <i16 0, i16 1, i16 2, i16 3, i16 4, i16 5, i16 6, i16 7, i16 0, i16 1,…
283 ret <16 x i16> %shift
288 ; SSE2: Found an estimated cost of 26 for instruction: %shift
289 ; SSE41: Found an estimated cost of 26 for instruction: %shift
290 ; AVX: Found an estimated cost of 26 for instruction: %shift
291 ; AVX2: Found an estimated cost of 26 for instruction: %shift
292 ; XOP: Found an estimated cost of 1 for instruction: %shift
293 …%shift = shl <16 x i8> %a, <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 7, i8 6, i8 5, i8 4…
294 ret <16 x i8> %shift
299 ; SSE2: Found an estimated cost of 52 for instruction: %shift
300 ; SSE41: Found an estimated cost of 52 for instruction: %shift
301 ; AVX: Found an estimated cost of 52 for instruction: %shift
302 ; AVX2: Found an estimated cost of 11 for instruction: %shift
303 ; XOP: Found an estimated cost of 2 for instruction: %shift
304 …%shift = shl <32 x i8> %a, <i8 0, i8 1, i8 2, i8 3, i8 4, i8 5, i8 6, i8 7, i8 7, i8 6, i8 5, i8 4…
305 ret <32 x i8> %shift
314 ; SSE2: Found an estimated cost of 1 for instruction: %shift
315 ; SSE41: Found an estimated cost of 1 for instruction: %shift
316 ; AVX: Found an estimated cost of 1 for instruction: %shift
317 ; AVX2: Found an estimated cost of 1 for instruction: %shift
318 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
319 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
320 %shift = shl <2 x i64> %a, <i64 7, i64 7>
321 ret <2 x i64> %shift
326 ; SSE2: Found an estimated cost of 2 for instruction: %shift
327 ; SSE41: Found an estimated cost of 2 for instruction: %shift
328 ; AVX: Found an estimated cost of 2 for instruction: %shift
329 ; AVX2: Found an estimated cost of 1 for instruction: %shift
330 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
331 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
332 %shift = shl <4 x i64> %a, <i64 7, i64 7, i64 7, i64 7>
333 ret <4 x i64> %shift
338 ; SSE2: Found an estimated cost of 1 for instruction: %shift
339 ; SSE41: Found an estimated cost of 1 for instruction: %shift
340 ; AVX: Found an estimated cost of 1 for instruction: %shift
341 ; AVX2: Found an estimated cost of 1 for instruction: %shift
342 ; XOPAVX: Found an estimated cost of 1 for instruction: %shift
343 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
344 %shift = shl <4 x i32> %a, <i32 5, i32 5, i32 5, i32 5>
345 ret <4 x i32> %shift
350 ; SSE2: Found an estimated cost of 2 for instruction: %shift
351 ; SSE41: Found an estimated cost of 2 for instruction: %shift
352 ; AVX: Found an estimated cost of 2 for instruction: %shift
353 ; AVX2: Found an estimated cost of 1 for instruction: %shift
354 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
355 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
356 %shift = shl <8 x i32> %a, <i32 5, i32 5, i32 5, i32 5, i32 5, i32 5, i32 5, i32 5>
357 ret <8 x i32> %shift
362 ; SSE2: Found an estimated cost of 1 for instruction: %shift
363 ; SSE41: Found an estimated cost of 1 for instruction: %shift
364 ; AVX: Found an estimated cost of 1 for instruction: %shift
365 ; AVX2: Found an estimated cost of 1 for instruction: %shift
366 ; XOP: Found an estimated cost of 1 for instruction: %shift
367 %shift = shl <8 x i16> %a, <i16 3, i16 3, i16 3, i16 3, i16 3, i16 3, i16 3, i16 3>
368 ret <8 x i16> %shift
373 ; SSE2: Found an estimated cost of 2 for instruction: %shift
374 ; SSE41: Found an estimated cost of 2 for instruction: %shift
375 ; AVX: Found an estimated cost of 2 for instruction: %shift
376 ; AVX2: Found an estimated cost of 1 for instruction: %shift
377 ; XOPAVX: Found an estimated cost of 2 for instruction: %shift
378 ; XOPAVX2: Found an estimated cost of 1 for instruction: %shift
379 …%shift = shl <16 x i16> %a, <i16 3, i16 3, i16 3, i16 3, i16 3, i16 3, i16 3, i16 3, i16 3, i16 3,…
380 ret <16 x i16> %shift
385 ; SSE2: Found an estimated cost of 1 for instruction: %shift
386 ; SSE41: Found an estimated cost of 1 for instruction: %shift
387 ; AVX: Found an estimated cost of 1 for instruction: %shift
388 ; AVX2: Found an estimated cost of 1 for instruction: %shift
389 ; XOP: Found an estimated cost of 1 for instruction: %shift
390 …%shift = shl <16 x i8> %a, <i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3…
391 ret <16 x i8> %shift
396 ; SSE2: Found an estimated cost of 2 for instruction: %shift
397 ; SSE41: Found an estimated cost of 2 for instruction: %shift
398 ; AVX: Found an estimated cost of 2 for instruction: %shift
399 ; AVX2: Found an estimated cost of 11 for instruction: %shift
400 ; XOP: Found an estimated cost of 2 for instruction: %shift
401 …%shift = shl <32 x i8> %a, <i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3, i8 3…
402 ret <32 x i8> %shift
457 ; On AVX2 we are able to lower the following shift into a single
459 ; In all other cases, this shift is scalarized as the target does not support
474 ; v16i16 and v8i32 shift left by non-uniform constant are lowered into
482 ; With SSE2 and SSE4.1, the vector shift cost for 'test6' is twice
500 ; With SSE2 and SSE4.1, the vector shift cost for 'test7' is twice
517 ; On AVX2 we are able to lower the following shift into a single
519 ; In all other cases, this shift is scalarized as the target does not support
565 ; On AVX2 we are able to lower the following shift into a sequence of
567 ; In all other cases, this shift is scalarized as we don't have vpsllv