LLI(1) | User Commands | LLI(1) |
lli - manual page for lli 19
OVERVIEW: llvm interpreter & dynamic compiler
USAGE: lli [options] <input bitcode> <program arguments>...
OPTIONS:
Color Options:
--color - Use colors in output (default=autodetect)
General options:
-O <char> - Optimization level. [-O0, -O1, -O2, or -O3] (default = '-O2')
--aarch64-neon-syntax=<value> - Choose style of NEON code to emit from AArch64 backend:
--aarch64-use-aa - Enable the use of AA during codegen.
--abort-on-max-devirt-iterations-reached - Abort when the max iterations for devirtualization CGSCC repeat pass is reached
--addrsig - Emit an address-significance table
--align-loops=<uint> - Default alignment for loops
--allow-ginsert-as-artifact - Allow G_INSERT to be considered an artifact. Hack around AMDGPU test infinite loops.
--amdgpu-atomic-optimizer-strategy=<value> - Select DPP or Iterative strategy for scan
--amdgpu-bypass-slow-div - Skip 64-bit divide for dynamic 32-bit values
--amdgpu-disable-loop-alignment - Do not align and prefetch loops
--amdgpu-dpp-combine - Enable DPP combiner
--amdgpu-dump-hsa-metadata - Dump AMDGPU HSA Metadata
--amdgpu-enable-merge-m0 - Merge and hoist M0 initializations
--amdgpu-enable-power-sched - Enable scheduling to minimize mAI power bursts
--amdgpu-kernarg-preload-count=<uint> - How many kernel arguments to preload onto SGPRs
--amdgpu-promote-alloca-to-vector-limit=<uint> - Maximum byte size to consider promote alloca to vector
--amdgpu-sdwa-peephole - Enable SDWA peepholer
--amdgpu-use-aa-in-codegen - Enable the use of AA during codegen.
--amdgpu-verify-hsa-metadata - Verify AMDGPU HSA Metadata
--amdgpu-vgpr-index-mode - Use GPR indexing mode instead of movrel for vector indexing
--arm-add-build-attributes -
--arm-implicit-it=<value> - Allow conditional instructions outdside of an IT block
--asm-show-inst - Emit internal instruction representation to assembly file
--atomic-first-counter - Use atomic fetch add for first counter in a function (usually the entry counter)
--basic-block-address-map - Emit the basic block address map section
--basic-block-sections=<all | <function list (file)> | labels | none> - Emit basic blocks into separate sections
--bounds-checking-single-trap - Use one trap block per function
--bounds-checking-unique-traps - Always use one trap per check
--bpf-stack-size=<int> - Specify the BPF stack size limit
--cfg-hide-cold-paths=<number> - Hide blocks with relative frequency below the given value
--cfg-hide-deoptimize-paths -
--cfg-hide-unreachable-paths -
--check-functions-filter=<regex> - Only emit checks for arguments of functions whose names match the given regular expression
--code-model=<value> - Choose code model
--compile-threads=<uint> - Choose the number of compile threads (jit-kind=orc-lazy only)
--cost-kind=<value> - Target cost kind
--crel - Use CREL relocation format for ELF
--cs-profile-generate - Perform context sensitive PGO instrumentation
--cs-profile-path=<string> - Context sensitive profile file path
--data-sections - Emit data into separate sections
--debug-entry-values - Enable debug info for the debug entry values.
--debug-info-correlate - Use debug info to correlate profiles. (Deprecated, use -profile-correlate=debug-info)
--debugger-tune=<value> - Tune debug info for a particular debugger
--debugify-func-limit=<ulong> - Set max number of processed functions per pass.
--debugify-level=<value> - Kind of debug info to add
--debugify-quiet - Suppress verbose debugify output
--denormal-fp-math=<value> - Select which denormal numbers the code is permitted to require
--denormal-fp-math-f32=<value> - Select which denormal numbers the code is permitted to require for float
--disable-auto-upgrade-debug-info - Disable autoupgrade of debug info
--disable-i2p-p2i-opt - Disables inttoptr/ptrtoint roundtrip optimization
--disable-lazy-compilation - Disable JIT lazy compilation
--disable-promote-alloca-to-lds - Disable promote alloca to LDS
--disable-promote-alloca-to-vector - Disable promote alloca to vector
--disable-tail-calls - Never emit tail calls
--dlopen=<string> - Dynamic libraries to load before linking
--do-counter-promotion - Do counter register promotion
--dot-cfg-mssa=<file name for generated dot file> - file name for generated dot file
--dwarf-version=<int> - Dwarf version
--dwarf64 - Generate debugging info in the 64-bit DWARF format
--emit-call-site-info - Emit call site debug information, if debug information is enabled.
--emit-compact-unwind-non-canonical - Whether to try to emit Compact Unwind for non canonical entries.
--emit-dwarf-unwind=<value> - Whether to emit DWARF EH frame entries.
--emscripten-cxx-exceptions-allowed=<string> - The list of function names in which Emscripten-style exception handling is enabled (see emscripten EMSCRIPTEN_CATCHING_ALLOWED options)
--emulated-tls - Use emulated TLS model
--enable-approx-func-fp-math - Enable FP math optimizations that assume approx func
--enable-cache-manager - Use cache manager to save/load modules
--enable-cse-in-irtranslator - Should enable CSE in irtranslator
--enable-cse-in-legalizer - Should enable CSE in Legalizer
--enable-emscripten-cxx-exceptions - WebAssembly Emscripten-style exception handling
--enable-emscripten-sjlj - WebAssembly Emscripten-style setjmp/longjmp handling
--enable-gvn-hoist - Enable the GVN hoisting pass (default = off)
--enable-gvn-memdep -
--enable-gvn-sink - Enable the GVN sinking pass (default = off)
--enable-jmc-instrument - Instrument functions with a call to __CheckForDebuggerJustMyCode
--enable-jump-table-to-switch - Enable JumpTableToSwitch pass (default = off)
--enable-load-in-loop-pre -
--enable-load-pre -
--enable-loop-simplifycfg-term-folding -
--enable-name-compression - Enable name/filename string compression
--enable-no-infs-fp-math - Enable FP math optimizations that assume no +-Infs
--enable-no-nans-fp-math - Enable FP math optimizations that assume no NaNs
--enable-no-signed-zeros-fp-math - Enable FP math optimizations that assume the sign of 0 is insignificant
--enable-no-trapping-fp-math - Enable setting the FP exceptions build attribute not to use exceptions
--enable-pgo-force-function-attrs - Enable pass to set function attributes based on PGO profiles
--enable-split-backedge-in-load-pre -
--enable-split-loopiv-heuristic - Enable loop iv regalloc heuristic
--enable-tlsdesc - Enable the use of TLS Descriptors
--enable-unsafe-fp-math - Enable optimizations that may decrease FP precision
--enable-vtable-profile-use - If ThinLTO and WPD is enabled and this option is true, vtable profiles will be used by ICP pass for more efficient indirect call sequence. If false, type profiles won't be used.
--enable-vtable-value-profiling - If true, the virtual table address will be instrumented to know the types of a C++ pointer. The information is used in indirect call promotion to do selective vtable-based comparison.
--entry-function=<function> - Specify the entry function (default = 'main') of the executable
--exception-model=<value> - exception model
--expand-variadics-override=<value> - Override the behaviour of expand-variadics
--experimental-debug-variable-locations - Use experimental new value-tracking variable locations
--experimental-debuginfo-iterators - Enable communicating debuginfo positions through iterators, eliminating intrinsics. Has no effect if --preserve-input-debuginfo-format=true.
--extra-archive=<input archive> - Extra archive files to be loaded
--extra-module=<input bitcode> - Extra modules to be loaded
--extra-object=<input object> - Extra object files to be loaded
--fake-argv0=<executable> - Override the 'argv[0]' value passed into the executing program
--fatal-warnings - Treat warnings as errors
--fdpic - Use the FDPIC ABI
--filetype=<value> - Choose a file type (not all types are supported by all targets):
--float-abi=<value> - Choose float ABI type
--force-dwarf-frame-section - Always emit a debug frame section.
--force-interpreter - Force interpretation: disable JIT
--force-tail-folding-style=<value> - Force the tail folding style
--fp-contract=<value> - Enable aggressive formation of fused FP ops
--frame-pointer=<value> - Specify frame pointer elimination optimization
--fs-profile-debug-prob-diff-threshold=<uint> - Only show debug message if the branch probility is greater than this value (in percentage).
--function-sections - Emit functions into separate sections
--generate-merged-base-profiles - When generating nested context-sensitive profiles, always generate extra base profile for function with all its context profiles merged into it.
--hash-based-counter-split - Rename counter variable of a comdat function based on cfg hash
--hexagon-add-build-attributes -
--hexagon-rdf-limit=<uint> -
--hot-cold-split - Enable hot-cold splitting pass
--hwasan-percentile-cutoff-hot=<int> - Hot percentile cuttoff.
--hwasan-random-rate=<number> - Probability value in the range [0.0, 1.0] to keep instrumentation of a function.
--ignore-xcoff-visibility - Not emit the visibility attribute for asm in AIX OS or give all symbols 'unspecified' visibility in XCOFF object file
--import-all-index - Import all external functions in index.
--incremental-linker-compatible - When used with filetype=obj, emit an object file which can be used with an incremental linker
--instcombine-code-sinking - Enable code sinking
--instcombine-guard-widening-window=<uint> - How wide an instruction window to bypass looking for another guard
--instcombine-max-num-phis=<uint> - Maximum number phis to handle in intptr/ptrint folding
--instcombine-max-sink-users=<uint> - Maximum number of undroppable users for instruction sinking
--instcombine-maxarray-size=<uint> - Maximum array size considered when doing a combine
--instcombine-negator-enabled - Should we attempt to sink negations?
--instcombine-negator-max-depth=<uint> - What is the maximal lookup depth when trying to check for viability of negation sinking.
--instrprof-atomic-counter-update-all - Make all profile counter updates atomic (for testing only)
--internalize-public-api-file=<filename> - A file containing list of symbol names to preserve
--internalize-public-api-list=<list> - A list of symbol names to preserve
--iterative-counter-promotion - Allow counter promotion across the whole loop nest.
--jd=<string> - Specifies the JITDylib to be used for any subsequent -extra-module arguments.
--jit-kind=<value> - Choose underlying JIT kind.
--jit-linker=<value> - Choose the dynamic linker/loader.
--large-data-threshold=<ulong> - Choose large data threshold for x86_64 medium code model
--lint-abort-on-error - In the Lint pass, abort on errors.
--load=<pluginfilename> - Load the specified plugin
--lower-allow-check-percentile-cutoff-hot=<int> - Hot percentile cuttoff.
--lower-allow-check-random-rate=<number> - Probability value in the range [0.0, 1.0] of unconditional pseudo-random checks.
--lto-aix-system-assembler=<path> - Path to a system assembler, picked up on AIX only
--lto-embed-bitcode=<value> - Embed LLVM bitcode in object files produced by LTO
--lto-pass-remarks-filter=<regex> - Only record optimization remarks from passes whose names match the given regular expression
--lto-pass-remarks-format=<format> - The format used for serializing remarks (default: YAML)
--lto-pass-remarks-output=<filename> - Output filename for pass remarks
--march=<string> - Architecture to generate code for (see --version)
--matrix-default-layout=<value> - Sets the default matrix layout
--matrix-print-after-transpose-opt -
--mattr=<a1,+a2,-a3,...> - Target specific attributes (-mattr=help for details)
--max-counter-promotions=<int> - Max number of allowed counter promotions
--max-counter-promotions-per-loop=<uint> - Max number counter promotions per loop to avoid increasing register pressure too much
--mc-relax-all - When used with filetype=obj, relax all fixups in the emitted object file
--mcabac - tbd
--mcpu=<cpu-name> - Target a specific cpu type (-mcpu=help for details)
--meabi=<value> - Set EABI type (default depends on triple):
--merror-missing-parenthesis - Error for missing parenthesis around predicate registers
--merror-noncontigious-register - Error for register names that aren't contigious
--mhvx - Enable Hexagon Vector eXtensions
--mhvx=<value> - Enable Hexagon Vector eXtensions
--mips-compact-branches=<value> - MIPS Specific: Compact branch policy.
--mips16-constant-islands - Enable mips16 constant islands.
--mips16-hard-float - Enable mips16 hard float.
--mir-strip-debugify-only - Should mir-strip-debug only strip debug info from debugified modules by default
--misexpect-tolerance=<uint> - Prevents emitting diagnostics when profile counts are within N% of the threshold..
--mno-compound - Disable looking for compound instructions for Hexagon
--mno-fixup - Disable fixing up resolved relocations for Hexagon
--mno-ldc1-sdc1 - Expand double precision loads and stores to their single precision counterparts
--mno-pairing - Disable looking for duplex instructions for Hexagon
--mtriple=<string> - Override target triple for module
--mwarn-missing-parenthesis - Warn for missing parenthesis around predicate registers
--mwarn-noncontigious-register - Warn for register names that arent contigious
--mwarn-sign-mismatch - Warn for mismatching a signed and unsigned value
--mxcoff-roptr - When set to true, const objects with relocatable address values are put into the RO data section.
--no-deprecated-warn - Suppress all deprecated warnings
--no-discriminators - Disable generation of discriminator information.
--no-integrated-as - Disable integrated assembler
--no-process-syms - Do not resolve lli process symbols in JIT'd code
--no-type-check - Suppress type errors (Wasm)
--no-warn - Suppress all warnings
--nozero-initialized-in-bss - Don't place zero-initialized symbols into bss section
--nvptx-sched4reg - NVPTX Specific: schedule for register pressue
--object-cache-dir=<string> - Directory to store cached object files (must be user writable)
--object-size-offset-visitor-max-visit-instructions=<uint> - Maximum number of instructions for ObjectSizeOffsetVisitor to look at
--orc-runtime=<string> - Use ORC runtime from given path
--per-module-lazy - Performs lazy compilation on whole module boundaries rather than individual functions
--pgo-block-coverage - Use this option to enable basic block coverage instrumentation
--pgo-temporal-instrumentation - Use this option to enable temporal instrumentation
--pgo-view-block-coverage-graph - Create a dot file of CFGs with block coverage inference information
--poison-checking-function-local - Check that returns are non-poison (for testing)
--print-pipeline-passes - Print a '-passes' compatible string describing the pipeline (best-effort only).
--profile-correlate=<value> - Use debug info or binary file to correlate profiles.
--promote-alloca-vector-loop-user-weight=<uint> - The bonus weight of users of allocas within loop when sorting profitable allocas
--r600-ir-structurize - Use StructurizeCFG IR pass
--relocation-model=<value> - Choose relocation model
--remote-mcjit - Execute MCJIT'ed code in a separate process.
--riscv-add-build-attributes -
--riscv-use-aa - Enable the use of AA during codegen.
--runtime-counter-relocation - Enable relocating counters at runtime.
--safepoint-ir-verifier-print-only -
--sample-profile-check-record-coverage=<N> - Emit a warning if less than N% of records in the input profile are matched to the IR.
--sample-profile-check-sample-coverage=<N> - Emit a warning if less than N% of samples in the input profile are matched to the IR.
--sample-profile-max-propagate-iterations=<uint> - Maximum number of iterations to go through when propagating sample block/edge weights through the CFG.
--sampled-instr-burst-duration=<uint> - Set the profile instrumentation burst duration, which can range from 0 to one less than the value of 'sampled-instr-period'. This number of samples will be recorded for each 'sampled-instr-period' count update. Setting to 1 enables simple sampling, in which case it is recommended to set 'sampled-instr-period' to a prime number.
--sampled-instr-period=<uint> - Set the profile instrumentation sample period. For each sample period, a fixed number of consecutive samples will be recorded. The number is controlled by 'sampled-instr-burst-duration' flag. The default sample period of 65535 is optimized for generating efficient code that leverages unsigned integer wrapping in overflow.
--sampled-instrumentation - Do PGO instrumentation sampling
--save-temp-labels - Don't discard temporary labels
--separate-named-sections - Use separate unique sections for named sections
--skip-ret-exit-block - Suppress counter promotion if exit blocks contain ret.
--soft-float - Generate software floating point library calls
--split-machine-functions - Split out cold basic blocks from machine functions based on profile information
--stack-size-section - Emit a section containing stack size metadata
--stack-symbol-ordering - Order local stack symbols.
--stackrealign - Force align the stack to the minimum alignment
--strict-dwarf - use strict dwarf
--summary-file=<string> - The summary file to use for function importing.
--swift-async-fp=<value> - Determine when the Swift async frame pointer should be set
--tail-predication=<value> - MVE tail-predication pass options
--tailcallopt - Turn fastcc calls into tail calls by (potentially) changing ABI.
--thinlto-assume-merged - Assume the input has already undergone ThinLTO function importing and the other pre-optimization pipeline changes.
--thread-entry=<string> - calls the given entry-point on a new thread (jit-kind=orc-lazy only)
--thread-model=<value> - Choose threading model
--threads=<int> -
--tls-size=<uint> - Bit size of immediate TLS offsets
--type-based-intrinsic-cost - Calculate intrinsics cost based only on argument types
--unique-basic-block-section-names - Give unique names to every basic block section
--unique-section-names - Give unique names to every section
--use-ctors - Use .ctors instead of .init_array.
--use-undef - Use undef when generating programs.
--vec-extabi - Enable the AIX Extended Altivec ABI.
--verify-region-info - Verify region info (time consuming)
--vp-counters-per-site=<number> - The average number of profile counters allocated per value profiling site.
--vp-static-alloc - Do static counter allocation for value profiler
--wasm-enable-eh - WebAssembly exception handling
--wasm-enable-exnref - WebAssembly exception handling (exnref)
--wasm-enable-sjlj - WebAssembly setjmp/longjmp handling
--write-experimental-debuginfo - Write debug info in the new non-intrinsic format. Has no effect if --preserve-input-debuginfo-format=true.
--x86-align-branch-boundary=<uint> - Control how the assembler should align branches with NOP. If the boundary's size is not 0, it should be a power of 2 and no less than 32. Branches will be aligned to prevent from being across or against the boundary of specified size. The default value 0 does not align branches.
--x86-pad-max-prefix-size=<uint> - Maximum number of prefixes to use for padding
--x86-relax-relocations - Emit GOTPCRELX/REX_GOTPCRELX instead of GOTPCREL on x86-64 ELF
--x86-sse2avx - Specify that the assembler should encode SSE instructions with VEX prefix
--xcoff-traceback-table - Emit the XCOFF traceback table
--xray-function-index - Emit xray_fn_idx section
Generic Options:
--help - Display available options (--help-hidden for more)
--help-list - Display list of available options (--help-list-hidden for more)
--version - Display the version of this program
Polly Options: Configure the polly loop optimizer
--polly - Enable the polly optimizer (with -O1, -O2 or -O3)
--polly-2nd-level-tiling - Enable a 2nd level loop of loop tiling
--polly-ast-print-accesses - Print memory access functions
--polly-context=<isl parameter set> - Provide additional constraints on the context parameters
--polly-dce-precise-steps=<int> - The number of precise steps between two approximating iterations. (A value of -1 schedules another approximation stage before the actual dead code elimination.
--polly-delicm-max-ops=<int> - Maximum number of isl operations to invest for lifetime analysis; 0=no limit
--polly-detect-full-functions - Allow the detection of full functions
--polly-dump-after - Dump module after Polly transformations into a file suffixed with "-after"
--polly-dump-after-file=<string> - Dump module after Polly transformations to the given file
--polly-dump-before - Dump module before Polly transformations into a file suffixed with "-before"
--polly-dump-before-file=<string> - Dump module before Polly transformations to the given file
--polly-enable-simplify - Simplify SCoP after optimizations
--polly-ignore-func=<string> - Ignore functions that match a regex. Multiple regexes can be comma separated. Scop detection will ignore all functions that match ANY of the regexes provided.
--polly-isl-arg=<argument> - Option passed to ISL
--polly-matmul-opt - Perform optimizations of matrix multiplications based on pattern matching
--polly-on-isl-error-abort - Abort if an isl error is encountered
--polly-only-func=<string> - Only run on functions that match a regex. Multiple regexes can be comma separated. Scop detection will run on all functions that match ANY of the regexes provided.
--polly-only-region=<identifier> - Only run on certain regions (The provided identifier must appear in the name of the region's entry block
--polly-only-scop-detection - Only run scop detection, but no other optimizations
--polly-optimized-scops - Polly - Dump polyhedral description of Scops optimized with the isl scheduling optimizer and the set of post-scheduling transformations is applied on the schedule tree
--polly-parallel - Generate thread parallel code (isl codegen only)
--polly-parallel-force - Force generation of thread parallel code ignoring any cost model
--polly-pattern-matching-based-opts - Perform optimizations based on pattern matching
--polly-postopts - Apply post-rescheduling optimizations such as tiling (requires -polly-reschedule)
--polly-pragma-based-opts - Apply user-directed transformation from metadata
--polly-pragma-ignore-depcheck - Skip the dependency check for pragma-based transformations
--polly-process-unprofitable - Process scops that are unlikely to benefit from Polly optimizations.
--polly-register-tiling - Enable register tiling
--polly-report - Print information about the activities of Polly
--polly-reschedule - Optimize SCoPs using ISL
--polly-show - Highlight the code regions that will be optimized in a (CFG BBs and LLVM-IR instructions)
--polly-show-only - Highlight the code regions that will be optimized in a (CFG only BBs)
--polly-stmt-granularity=<value> - Algorithm to use for splitting basic blocks into multiple statements
--polly-tc-opt - Perform optimizations of tensor contractions based on pattern matching
--polly-tiling - Enable loop tiling
--polly-vectorizer=<value> - Select the vectorization strategy
December 2024 | lli 19 |