nir - OpenGrok cross reference for /external/mesa3d/src/compiler/nir/

New IR, or NIR, is an IR for Mesa intended to sit below GLSL IR and Mesa IR.
Its design inherits from the various IRs that Mesa has used in the past, as
well as Direct3D assembly, and it includes a few new ideas as well. It is a
flat (in terms of using instructions instead of expressions), typeless IR,
similar to TGSI and Mesa IR.  It also supports SSA (although it doesn't require
it).

Variables
=========

NIR includes support for source-level GLSL variables through a structure mostly
copied from GLSL IR. These will be used for linking and conversion from GLSL IR
(and later, from an AST), but for the most part, they will be lowered to
registers (see below) and loads/stores.

Registers
=========

Registers are light-weight; they consist of a structure that only contains its
size, its index for liveness analysis, and an optional name for debugging. In
addition, registers can be local to a function or global to the entire shader;
the latter will be used in ARB_shader_subroutine for passing parameters and
getting return values from subroutines. Registers can also be an array, in which
case they can be accessed indirectly. Each ALU instruction (add, subtract, etc.)
works directly with registers or SSA values (see below).

SSA
========

Everywhere a register can be loaded/stored, an SSA value can be used instead.
The only exception is that arrays/indirect addressing are not supported with
SSA; although research has been done on extensions of SSA to arrays before, it's
usually for the purpose of parallelization (which we're not interested in), and
adds some overhead in the form of adding copies or extra arrays (which is much
more expensive than introducing copies between non-array registers). SSA uses
point directly to their corresponding definition, which in turn points to the
instruction it is part of. This creates an implicit use-def chain and avoids the
need for an external structure for each SSA register.

Functions
=========

Support for function calls is mostly similar to GLSL IR. Each shader contains a
list of functions, and each function has a list of overloads. Each overload
contains a list of parameters, and may contain an implementation which specifies
the variables that correspond to the parameters and return value. Inlining a
function, assuming it has a single return point, is as simple as copying its
instructions, registers, and local variables into the target function and then
inserting copies to and from the new parameters as appropriate. After functions
are inlined and any non-subroutine functions are deleted, parameters and return
variables will be converted to global variables and then global registers. We
don't do this lowering earlier (i.e. the fortranizer idea) for a few reasons:

- If we want to do optimizations before link time, we need to have the function
signature available during link-time.

- If we do any inlining before link time, then we might wind up with the
inlined function and the non-inlined function using the same global
variables/registers which would preclude optimization.

Intrinsics
=========

Any operation (other than function calls and textures) which touches a variable
or is not referentially transparent is represented by an intrinsic. Intrinsics
are similar to the idea of a "builtin function," i.e. a function declaration
whose implementation is provided by the backend, except they are more powerful
in the following ways:

- They can also load and store registers when appropriate, which limits the
number of variables needed in later stages of the IR while obviating the need
for a separate load/store variable instruction.

- Intrinsics can be marked as side-effect free, which permits them to be
treated like any other instruction when it comes to optimizations. This allows
load intrinsics to be represented as intrinsics while still being optimized
away by dead code elimination, common subexpression elimination, etc.

Intrinsics are used for:

- Atomic operations
- Memory barriers
- Subroutine calls
- Geometry shader emitVertex and endPrimitive
- Loading and storing variables (before lowering)
- Loading and storing uniforms, shader inputs and outputs, etc (after lowering)
- Copying variables (cases where in GLSL the destination is a structure or
array)
- The kitchen sink
- ...

Textures
=========

Unfortunately, there are far too many texture operations to represent each one
of them with an intrinsic, so there's a special texture instruction similar to
the GLSL IR one. The biggest difference is that, while the texture instruction
has a sampler dereference field used just like in GLSL IR, this gets lowered to
a texture unit index (with a possible indirect offset) while the type
information of the original sampler is kept around for backends. Also, all the
non-constant sources are stored in a single array to make it easier for
optimization passes to iterate over all the sources.

Control Flow
=========

Like in GLSL IR, control flow consists of a tree of "control flow nodes", which
include if statements and loops, and jump instructions (break, continue, and
return). Unlike GLSL IR, though, the leaves of the tree aren't statements but
basic blocks. Each basic block also keeps track of its successors and
predecessors, and function implementations keep track of the beginning basic
block (the first basic block of the function) and the ending basic block (a fake
basic block that every return statement points to). Together, these elements
make up the control flow graph, in this case a redundant piece of information on
top of the control flow tree that will be used by almost all the optimizations.
There are helper functions to add and remove control flow nodes that also update
the control flow graph, and so usually it doesn't need to be touched by passes
that modify control flow nodes.
Name		Date	Size	#Lines	LOC
..		-	-
tests/		22-Nov-2023	-	151	70
.gitignore	D	22-Nov-2023	97	6	5
README	D	23-Nov-2023	5.7 KiB	119	98
meson.build	D	23-Nov-2023	6.6 KiB	235	217
nir.c	D	23-Nov-2023	55.3 KiB	2,049	1,597
nir.h	D	23-Nov-2023	82.4 KiB	2,780	1,607
nir_algebraic.py	D	23-Nov-2023	21.3 KiB	633	483
nir_builder.h	D	23-Nov-2023	20.3 KiB	684	526
nir_builder_opcodes_h.py	D	23-Nov-2023	1.8 KiB	50	41
nir_clone.c	D	23-Nov-2023	24 KiB	782	538
nir_constant_expressions.h	D	23-Nov-2023	1.5 KiB	37	6
nir_constant_expressions.py	D	23-Nov-2023	12.4 KiB	439	398
nir_control_flow.c	D	23-Nov-2023	21.7 KiB	737	495
nir_control_flow.h	D	23-Nov-2023	6.4 KiB	169	53
nir_control_flow_private.h	D	23-Nov-2023	1.5 KiB	42	6
nir_dominance.c	D	22-Nov-2023	7.9 KiB	313	207
nir_from_ssa.c	D	23-Nov-2023	32.2 KiB	984	601
nir_gather_info.c	D	23-Nov-2023	11.7 KiB	370	271
nir_gs_count_vertices.c	D	22-Nov-2023	3 KiB	94	43
nir_inline_functions.c	D	22-Nov-2023	9.3 KiB	286	183
nir_instr_set.c	D	22-Nov-2023	16.5 KiB	544	400
nir_instr_set.h	D	23-Nov-2023	2.4 KiB	65	8
nir_intrinsics.c	D	22-Nov-2023	2 KiB	57	25
nir_intrinsics.h	D	23-Nov-2023	21 KiB	481	189
nir_linking_helpers.c	D	23-Nov-2023	18.2 KiB	507	335
nir_liveness.c	D	22-Nov-2023	9.5 KiB	294	160
nir_loop_analyze.c	D	23-Nov-2023	26.6 KiB	845	557
nir_loop_analyze.h	D	23-Nov-2023	3.2 KiB	96	49
nir_lower_64bit_packing.c	D	23-Nov-2023	3.3 KiB	108	60
nir_lower_alpha_test.c	D	23-Nov-2023	4.1 KiB	112	64
nir_lower_alu_to_scalar.c	D	23-Nov-2023	9.5 KiB	280	194
nir_lower_atomics.c	D	23-Nov-2023	6.8 KiB	198	128
nir_lower_atomics_to_ssbo.c	D	23-Nov-2023	8.4 KiB	237	163
nir_lower_bitmap.c	D	23-Nov-2023	4.5 KiB	140	68
nir_lower_clamp_color_outputs.c	D	23-Nov-2023	3.8 KiB	144	102
nir_lower_clip.c	D	23-Nov-2023	10.1 KiB	340	206
nir_lower_clip_cull_distance_arrays.c	D	23-Nov-2023	6.2 KiB	205	115
nir_lower_constant_initializers.c	D	22-Nov-2023	3.6 KiB	113	68
nir_lower_double_ops.c	D	23-Nov-2023	19.6 KiB	600	296
nir_lower_drawpixels.c	D	23-Nov-2023	8.5 KiB	262	184
nir_lower_global_vars_to_local.c	D	22-Nov-2023	3.5 KiB	108	57
nir_lower_gs_intrinsics.c	D	23-Nov-2023	7 KiB	213	100
nir_lower_idiv.c	D	23-Nov-2023	4.1 KiB	153	88
nir_lower_indirect_derefs.c	D	23-Nov-2023	7.6 KiB	221	149
nir_lower_int64.c	D	23-Nov-2023	9.7 KiB	297	207
nir_lower_io.c	D	23-Nov-2023	18 KiB	566	420
nir_lower_io_arrays_to_elements.c	D	23-Nov-2023	14.9 KiB	429	297
nir_lower_io_to_scalar.c	D	23-Nov-2023	13.3 KiB	384	260
nir_lower_io_to_temporaries.c	D	23-Nov-2023	7 KiB	202	119
nir_lower_io_types.c	D	23-Nov-2023	5.6 KiB	177	105
nir_lower_load_const_to_scalar.c	D	23-Nov-2023	3.5 KiB	103	54
nir_lower_locals_to_regs.c	D	23-Nov-2023	9.7 KiB	302	205
nir_lower_passthrough_edgeflags.c	D	22-Nov-2023	2 KiB	57	26
nir_lower_patch_vertices.c	D	22-Nov-2023	2.2 KiB	54	27
nir_lower_phis_to_scalar.c	D	22-Nov-2023	10.7 KiB	309	162
nir_lower_regs_to_ssa.c	D	23-Nov-2023	9.2 KiB	298	192
nir_lower_returns.c	D	23-Nov-2023	9.1 KiB	283	170
nir_lower_samplers.c	D	23-Nov-2023	5.4 KiB	166	107
nir_lower_samplers_as_deref.c	D	23-Nov-2023	7.8 KiB	246	149
nir_lower_subgroups.c	D	23-Nov-2023	8.2 KiB	245	172
nir_lower_system_values.c	D	23-Nov-2023	6 KiB	185	113
nir_lower_tex.c	D	23-Nov-2023	29.4 KiB	875	587
nir_lower_to_source_mods.c	D	23-Nov-2023	6.4 KiB	216	135
nir_lower_two_sided_color.c	D	23-Nov-2023	5.9 KiB	208	129
nir_lower_uniforms_to_ubo.c	D	23-Nov-2023	3.5 KiB	98	55
nir_lower_var_copies.c	D	23-Nov-2023	7.1 KiB	202	103
nir_lower_vars_to_ssa.c	D	23-Nov-2023	24.7 KiB	756	465
nir_lower_vec_to_movs.c	D	23-Nov-2023	10.1 KiB	315	179
nir_lower_wpos_center.c	D	23-Nov-2023	4.3 KiB	125	66
nir_lower_wpos_ytransform.c	D	23-Nov-2023	13.2 KiB	364	226
nir_metadata.c	D	23-Nov-2023	3 KiB	97	46
nir_move_vec_src_uses_to_dest.c	D	23-Nov-2023	6.6 KiB	213	110
nir_normalize_cubemap_coords.c	D	22-Nov-2023	3.8 KiB	117	62
nir_opcodes.py	D	23-Nov-2023	25.6 KiB	744	530
nir_opcodes_c.py	D	23-Nov-2023	4.9 KiB	134	102
nir_opcodes_h.py	D	23-Nov-2023	1.6 KiB	47	39
nir_opt_algebraic.py	D	23-Nov-2023	27.8 KiB	595	394
nir_opt_conditional_discard.c	D	22-Nov-2023	4.2 KiB	125	72
nir_opt_constant_folding.c	D	22-Nov-2023	7 KiB	238	147
nir_opt_copy_prop_vars.c	D	23-Nov-2023	27.8 KiB	814	531
nir_opt_copy_propagate.c	D	23-Nov-2023	9.1 KiB	342	249
nir_opt_cse.c	D	23-Nov-2023	2.7 KiB	94	41
nir_opt_dce.c	D	22-Nov-2023	4.7 KiB	177	117
nir_opt_dead_cf.c	D	22-Nov-2023	10.4 KiB	357	198
nir_opt_gcm.c	D	22-Nov-2023	17.2 KiB	519	277
nir_opt_global_to_local.c	D	22-Nov-2023	2.9 KiB	103	58
nir_opt_if.c	D	22-Nov-2023	8.2 KiB	257	123
nir_opt_intrinsics.c	D	23-Nov-2023	2.8 KiB	92	53
nir_opt_loop_unroll.c	D	23-Nov-2023	20.8 KiB	595	323
nir_opt_move_comparisons.c	D	22-Nov-2023	6 KiB	186	92
nir_opt_peephole_select.c	D	23-Nov-2023	7.7 KiB	265	152
nir_opt_remove_phis.c	D	23-Nov-2023	5.4 KiB	173	83
nir_opt_trivial_continues.c	D	22-Nov-2023	4.8 KiB	138	81
nir_opt_undef.c	D	22-Nov-2023	5 KiB	162	95
nir_phi_builder.c	D	23-Nov-2023	10.7 KiB	298	148
nir_phi_builder.h	D	23-Nov-2023	4.7 KiB	120	17
nir_print.c	D	23-Nov-2023	32.1 KiB	1,260	1,009
nir_propagate_invariant.c	D	22-Nov-2023	5.4 KiB	197	140
nir_remove_dead_variables.c	D	23-Nov-2023	6.5 KiB	218	151
nir_repair_ssa.c	D	22-Nov-2023	4.5 KiB	153	90
nir_search.c	D	22-Nov-2023	19.5 KiB	623	444
nir_search.h	D	22-Nov-2023	4.2 KiB	131	50
nir_search_helpers.h	D	23-Nov-2023	5 KiB	189	129
nir_serialize.c	D	23-Nov-2023	35.3 KiB	1,222	958
nir_serialize.h	D	23-Nov-2023	1.5 KiB	44	15
nir_split_var_copies.c	D	23-Nov-2023	11.4 KiB	297	151
nir_sweep.c	D	23-Nov-2023	5 KiB	177	108
nir_to_lcssa.c	D	22-Nov-2023	6.4 KiB	204	124
nir_validate.c	D	23-Nov-2023	39.5 KiB	1,248	929
nir_vla.h	D	23-Nov-2023	2.1 KiB	57	10
nir_worklist.c	D	22-Nov-2023	3.4 KiB	139	82
nir_worklist.h	D	23-Nov-2023	2.9 KiB	91	32