Mergen

mirror of https://github.com/NaC-L/Mergen.git synced 2026-05-12 09:40:34 +00:00

Author	SHA1	Message	Date
naci	babe982b65	docs: correct SCOPE loop-header generalization status (#117 ) Line 28 read 'Temporarily disabled while the team keeps required VMP 3.8.x targets on the safe high-budget path'. That is stale relative to the current code: canGeneralizeStructuredLoopHeader (lifter/core/LifterClass.hpp) gates generalization on path-solve context plus nine operational guards, and the corresponding loop_generalization_* microtests pass on main. Describe the actual gating and point readers at docs/LOOP_HANDLING.md. Co-authored-by: yusufcanislek <yusuf.canislek@meetdandy.com>	2026-04-22 23:06:47 +03:00
naci	c6e4c33627	docs: add LOOP_HANDLING.md reference for loop detection, generalization, and phi consumption (#116 ) Captures the three-phase architecture (detect/generalize/consume), the path-solve context gating table, the GeneralizedLoopControlFieldState layout, mergeValue's widenFirstBackedge contract, the full set of retrieve_generalized_loop_* helpers, and the hardcoded reference-sample addresses (kThemidaControlCursorSlot, the three resolveTargetedThemidаR9 instruction addresses with fire-counts on the reference binary). Documents known limitations at the bottom: REP SCAS, VMP 3.6 INT 2 dispatcher, the reference-sample hardcodes, unrolling/LICM, multi-way backedges. Flags that SCOPE.md's 'loop-header generalization temporarily disabled' entry appears to be stale: the code gates generalization on path-solve context (ConditionalBranch / DirectJump / resolved IndirectJump) rather than disabling it wholesale. Not changed in this PR; maintainer decision. Co-authored-by: yusufcanislek <yusuf.canislek@meetdandy.com>	2026-04-22 23:05:24 +03:00
naci	5708deef54	lifter: allow resolved indirect jumps to participate in structured loop generalization (#98 ) * docs: sync rewrite workflow guidance * docs: drop machine-local pointers and fix stale README branch link * lifter: allow resolved indirect jumps to participate in structured loop generalization When a register-indirect jmp has already been resolved to a concrete target via solvePath (ConstantInt or solver), it's no longer speculative. If the target also points backward at a visited block, treat it as a loop back-edge for generalization purposes, the same way a direct or conditional jump would be treated. Introduces currentPathSolveAllowsStructuredLoopGeneralizationForResolvedTarget() alongside the existing narrow predicate. canGeneralizeStructuredLoopHeader gains an opt-in targetResolvedConcretely parameter that routes through the widened check. getLiftedBackedgeBB uses the widened variant so back-edge reuse fires for resolved indirect jumps. resolveTargetBlock passes targetResolvedConcretely=true (its entry condition requires a concrete destination) and extends stackBypassGeneralizedLoopAddresses to include IndirectJump-context inserts. Ret-path contexts remain excluded. Tests updated: the old runLoopGeneralizationIndirectJumpBlocked splits into runLoopGeneralizationIndirectJumpBlockedWhenUnresolved (unchanged semantics) and runLoopGeneralizationIndirectJumpAllowedWhenResolved (new). runPendingGeneralizedLoopBlockedByContext becomes runPendingGeneralizedLoopByContext with an expectReuse parameter; Ret still expects no reuse, IndirectJump with a resolved target now expects reuse. --------- Co-authored-by: yusufcanislek <yusuf.canislek@meetdandy.com>	2026-04-19 05:36:45 +03:00
naci	0fbc2e9a52	Upgrade rewrite gate clang-cl to 21.1.8; re-enable calc_fib/calc_sum_array (#96 ) The windows-latest preinstalled clang-cl (currently 20.1.8 at `C:\Program Files\LLVM\bin\clang-cl.exe`) produces a lifter binary that segfaults on calc_fib before emitting any IR, causing the rewrite gate to fail. Clang 21.1.8 has been verified locally to compile the lifter into a binary that lifts both calc_fib and calc_sum_array to their expected constant returns (`ret i64 13` and `ret i64 150`). Rolling back to clang 18.x is not an option: the runner image's MSVC STL (14.44+) hard-requires clang 19.0.0 or newer via a static_assert in yvals_core.h. Clang 21 satisfies that bound and dodges the clang 20.1.8 miscompile. Upgrading via `choco upgrade llvm --version=21.1.8` keeps the existing `C:\Program Files\LLVM\bin\clang-cl.exe` path valid, so the rest of the pipeline (Resolve LLVM_DIR, Resolve clang-cl, Configure, Build) is unchanged. ## Changes - `.github/workflows/rewrite-strict-gate.yml`: add an "Upgrade clang-cl to 21.1.8" step before `Resolve LLVM_DIR` that runs `choco upgrade llvm` and pins `CMAKE_{C,CXX}_COMPILER` to the upgraded binary. - `scripts/rewrite/instruction_microtests.json`: drop the `ci_skip` entries on `calc_fib` and `calc_sum_array`. - `docs/SCOPE.md`: bump the corpus counts to 33 samples / 177 runtime semantic cases. ## Follow-up Investigating the underlying clang 20.1.8 miscompile in the lifter is still worth doing \u2014 it's almost certainly UB somewhere in the structured-loop recovery path that clang 21 happens to tolerate. Tracked separately. Co-authored-by: NaC-L <nac-l@users.noreply.github.com>	2026-04-07 18:33:05 +03:00
naci	acab499d3f	Re-skip calc_fib and calc_sum_array in CI (#95 ) PR #93 un-skipped both samples after a clean local Release build proved they lift correctly, but the windows-latest CI lane still fails on them `Lifter failed for calc_fib` (run 24077021868). The HANDOFF note that windows-latest clang-cl produces a different codegen shape than the locally pinned clang-cl turned out to be the actual root cause; the "stale build cache" theory only explained the local symptom. Restoring the `ci_skip` entries unbreaks the rewrite-strict-gate and rewrite-quick-gate workflows. Real fix tracked as a follow-up: either teach the lifter the CI codegen shape, or pin the rewrite CI lane to a toolchain that matches the local one byte-for-byte. Also reverts the `docs/SCOPE.md` corpus counts to 31 samples / 175 cases. Co-authored-by: NaC-L <nac-l@users.noreply.github.com>	2026-04-07 17:04:16 +03:00
naci	089e10ac08	Re-enable calc_fib and calc_sum_array in rewrite gate (#93 ) Both samples were originally CI-skipped because windows-latest clang-cl produced loop/array codegen shapes that tripped the lifter on CI even though local runs passed. Since then the rewrite CI lane has been pinned to the same LLVM 18.1.8 clang-cl used locally (`eb49a35`, `949acaa`, `a28a368`) and several structured loop recovery fixes have landed (`2989e5a`, `2eaa22e`), so the codegen mismatch that motivated the skips is gone. Verified locally with a clean Release build (`cmd /c scripts\dev\configure_iced.cmd` followed by `build_iced.cmd`): - `calc_fib` lifts to `ret i64 13` and passes its semantic case - `calc_sum_array` lifts to `ret i64 150` and passes its semantic case - `python test.py all` is fully green: semantic 33/33 (was 31/31), baseline, micro --check-flags, full handler suite 115/119, determinism Drops the two `ci_skip` entries from `instruction_microtests.json` and updates `docs/SCOPE.md` corpus counts to 33 samples / 177 cases. Co-authored-by: NaC-L <nac-l@users.noreply.github.com>	2026-04-07 13:38:38 +03:00
naci	5ccd498998	Implement PUNPCKLQDQ and re-enable calc_cout (#92 ) - Add lift_punpcklqdq handler in Semantics_Misc.ipp (XMM dest, low-quadword interleave from dest+src into a 128-bit result; rejects MMX/non-XMM forms via the standard not_implemented bailout) - Wire OPCODE(punpcklqdq, PUNPCKLQDQ) in x86_64_opcodes.x and add a missing trailing newline - Add manual punpcklqdq case to TestInstructions.cpp (rdrand-style XMM seed) and matching seeds in build_full_handler_seed.py - Regenerate oracle_seed_full_handlers{,_enriched}.json, oracle_seed_vectors.json, and oracle_vectors_full_handlers.json with two punpcklqdq vectors (basic interleave, low-source-zero edge case) - Drop ci_skip on calc_cout in instruction_microtests.json now that the STL PUNPCKLQDQ path lifts cleanly (4/4 semantic cases pass locally) - Keep calc_fib and calc_sum_array ci_skipped: they still trip a separate lifter dyn_cast assertion that is not related to PUNPCKLQDQ; tracked as follow-up - Update docs/SCOPE.md handler counts (115/119 covered, 4 intentional skips) and corpus counts (31 active samples / 175 cases) Co-authored-by: NaC-L <nac-l@users.noreply.github.com>	2026-04-07 12:58:44 +03:00
yusufcanislek	825b29946d	Fix CI coverage counts in docs	2026-04-04 16:57:17 +03:00
yusufcanislek	fa95a27dae	CI-skip calc_sum_array on windows-latest	2026-04-04 16:46:34 +03:00
yusufcanislek	81bc3a89da	CI-skip calc_fib on windows-latest	2026-04-04 16:34:38 +03:00
yusufcanislek	2989e5ab58	Recover structured loop lifting safely	2026-04-03 19:54:51 +03:00
yusufcanislek	8fba033cc6	Fix VMP gate and loop safety	2026-04-03 15:00:42 +03:00
yusufcanislek	460e845aed	fix: stabilize full-handler oracle fixtures	2026-04-01 06:55:56 +03:00
yusufcanislek	1020775ec0	feat: prototype minimization + canonical IR naming Two new post-optimization passes that run after the final O2 pipeline: PrototypeMinimizationPass: - Removes unused function arguments based on Argument::use_empty() - Typical reduction: 34 params -> 0-2 (e.g. @main(i64 %RCX) instead of all 16 GPRs + 16 XMMs + 2 ptrs) - Splices basic blocks into new function, remaps argument uses, erases old function - Updated check_semantic.py to parse actual IR signatures instead of hardcoded 34-param list CanonicalNamingPass: - Strips address-derived suffixes from block/value names for deterministic output - Blocks: entry, bb1, bb2, ... (sequential) - Values: semantic prefix preserved, address suffix removed (realadd-5368713230- -> realadd) - Same input now produces byte-identical IR across rebuilds Also fixed writeFunctionToFile to use stored module pointer M instead of fnc->getParent() (dangling after prototype minimization erases the old function). Review fixes: - CanonicalNamingPass: use StringMap<unsigned> instead of DenseMap<StringRef> (dangling key) - PrototypeMinimizationPass: restrict call rewriting to CallInst (not InvokeInst/CallBrInst) - PrototypeMinimizationPass: guard F->eraseFromParent() with use_empty() check - check_semantic.py: widen define regex to handle dso_local and other prefixes All 28 samples pass, 146 semantic cases, 56 golden hashes updated.	2026-03-29 11:00:07 +03:00
naci	6ee50d315e	test: add jump table regression suite (5 samples, 39 semantic cases) (#80 ) * test: add jump table regression suite (5 samples, 39 semantic cases) Add 5 new jump table test cases covering the major dispatch patterns: - jumptable_rel32.asm: RIP-relative dword offset table (lea+movsxd+add+jmp) - jumptable_shifted.asm: base-shifted range check (sub before index) - jumptable_shared_targets.asm: multiple cases sharing handlers - jumptable_computation.asm: case bodies with symbolic arithmetic - calc_jumptable_large.c: 16-case dense C switch compiled at /O2 All 5 pass lifting and semantic validation (39 new cases, 146 total). Update golden hashes (46 -> 56 files), manifest, and docs. * fix(ci): exclude C-compiled samples from golden IR hashes C-compiled samples (calc_) produce address-dependent IR because the linker places symbols at different addresses depending on toolchain version, link order, and build environment. The determinism check comment (test.py L123-125) already documented this exclusion policy but the golden hash file included them anyway, causing rewrite-quick-gate to fail on CI. Remove all 14 calc_ entries from golden_ir_hashes.json (56 -> 42). C-compiled sample correctness is still validated by semantic tests. --------- Co-authored-by: yusufcanislek <yusuf.canislek@meetdandy.com>	2026-03-29 09:46:52 +03:00
yusufcanislek	6d0157f26b	feat: call-boundary ABI framework with strict clobber + speculative inlining scaffolding Cross-ABI call contract (AbiCallContract.hpp): - AbiKind enum (x64_msvc, x86_cdecl/stdcall/fastcall, unknown) - CallModelMode: strict (default) clobbers volatile regs, compat preserves all - CallEffects: arg regs, return regs, volatile set, stack cleanup, memory effect - Pre-built descriptors for x64 MSVC and x86 calling conventions - Structured diagnostics at every call site ([call-abi] prefix) Call-site semantics (lift_call): - applyPostCallEffects: assigns RAX=result, clobbers volatile in strict mode - emittedExternalCall flag: skips Unflatten inlining when CreateCall emitted - Import thunk detection (FF 25 jmp [IAT]): auto-outlines DLL imports - shouldOutlineCall hook: extensible policy for inline/outline decisions Bug fixes: - parseArgs(nullptr) duplicated RDI (18 values for 16-type slots) — now 16 GPRs + memory ptr - Unknown calls in lift_call never assigned RAX = call result — now they do - callFunctionIR routed through applyPostCallEffects for consistency Speculative inlining (disabled by default, opt-in via maxCallInlineBudget): - Budget-limited call inlining with bail-out to CreateCall + ABI effects - Worklist trimming on bail-out restores pre-call continuation - Works mechanically but needs smarter trigger policy (see open issue) Tests: - call_abi_compat_preserves_volatile: R10 survives, RAX = result - call_abi_strict_clobbers_volatile: R10 = undef, RBX preserved, RAX = result - call_abi_default_is_strict: verifies strict is the default - All existing baseline (90+), semantic (23/23), micro (15) tests pass - VMP 3.8.1 target produces identical a+b+c deobfuscation	2026-03-26 09:53:16 +03:00
yusufcanislek	eb10474eb8	feat: commit working-tree changes required by rewrite gates Lifter improvements: - PathSolver.ipp: enhanced path memoization, switch-target diagnostics - GEPTracker.ipp: expanded value tracking, graceful bail-out paths - Semantics_Misc.ipp: clean up CPUID handler (remove dead comments, simplify constant emission) Rewrite infrastructure: - instruction_microtests.json: add jumptable manifest entries (calc_jumptable, jumptable_basic, jumptable_dense) with semantic cases - golden_ir_hashes.json: add hashes for new jumptable samples - build_samples.cmd: support C jumptable /O2 compilation pass - oracle vectors: regenerated (oracle_vectors.json trimmed to current seed set, full-handler vectors updated with new handlers) - run_microtests.cmd / run_all_handlers.cmd: script improvements - test.py: add jumptable semantic cases to coverage Dev scripts: - configure_iced/zydis.cmd, build_iced/zydis.cmd: improved toolchain detection and MERGEN_BUILD_JOBS support Review automation: - format_comment.py, invariant_guard.py, risk_map.py, shard_pr.py: minor fixes aligned with verify_plan public API rename Docs: - REWRITE_BASELINE.md: updated coverage summary and script docs - REVIEWER_RULES.md: minor formatting	2026-03-26 07:53:43 +03:00
yusufcanislek	3308ad7f65	feat: add review automation toolkit with full cutover - review_buckets.py: shared bucket/risk/check taxonomy - risk_map.py: PR risk assessment from diff - invariant_guard.py: vector schema, manifest, backend invariant checks - verify_plan.py: targeted verification planner with execution mode - shard_pr.py: refactored to use shared bucket metadata - run_review.py: orchestrator wiring all modules - format_comment.py: markdown rendering for review comments - docs/REVIEWER_RULES.md: reviewer rules with automation shortcuts - .gitignore: ignore artifacts/ and tmp_*.json Removed parallel/duplicate review scripts (verification_plan.py, invariant_checks.py, lint_vectors.py, build_repro.py, __init__.py) by full cutover to canonical modules.	2026-03-19 19:04:59 +03:00
yusufcanislek	72974c016b	docs: update rewrite baseline and failure-contract gate	2026-03-19 01:59:51 +03:00
yusufcanislek	433eb12532	Fix unknown provider error path and baseline parity docs	2026-03-08 16:29:15 +03:00
yusufcanislek	f53308d3e4	Fix Sleigh dependency fallback path and baseline doc parity note	2026-03-08 16:07:02 +03:00
yusufcanislek	8e2ada491f	Add SSE2 integer XMM lifting and oracle coverage	2026-03-07 16:14:34 +03:00
yusufcanislek	a67bcf3ee2	Add C test binaries, NASM test cases, deterministic IR hashing, SCOPE doc Test infra: - test.py: flag checks always-on for quick/all; deterministic IR hash verification via SHA-256; update-golden subcommand - run.ps1: accept both .asm and .c source files in manifest validation - build_samples.cmd: compile C files with cl.exe /Od /GS- alongside NASM - CI: rewrite-strict-gate.yml uses test.py defaults (flags always on) New test cases (10 total): - 6 NASM: nested_branch, loop_simple, bitchain, multi_arg, diamond, cmov_chain - 4 C (MSVC /Od): calc_grade (5-way branch), calc_mixed (symbolic+concrete), calc_fib (loop->const fold to 13), calc_sum_array (array->const fold to 150) Manifest: 17 samples, 40 pattern checks Golden hashes: 34 .ll files (17 optimized + 17 unoptimized) Handler microtests: 108/111 (97.3%), flags enforced Docs: - docs/SCOPE.md: supported/unsupported pattern matrix	2026-03-05 20:31:53 +03:00
yusufcanislek	567e0d7daf	Add rewrite regression automation, vectors, and documentation	2026-03-03 23:04:21 +03:00
wcscpy	549775de1d	Fix typo in build instructions	2024-11-12 12:51:51 +01:00
Chrizz	f8f7d2f54b	Windows build instructions	2024-11-04 21:09:45 +01:00
Chrizz	091dc98341	Removed llvm 10 starting to add windows build instructions	2024-11-04 20:49:14 +01:00
naci	1ce4c013a9	clean building md we dont need submodules thx to cmkr we dont need to specify llvm path if we correctly installed llvm.	2024-09-29 07:09:31 +03:00
pseuxide	43aa94c2d9	update Dockerfile for the user command to be simpler	2024-08-02 03:24:40 +09:00
G0lge	77fbd84f48	make readme readable	2024-06-11 15:26:55 +03:00
G0lge	d2f5126c4d	fix build	2024-05-22 07:33:06 +03:00
r3bb1t	99d1fbd086	Added docs/BUILDING.md Contains basic instructions for building the project in Docker	2024-03-23 16:55:21 +03:00

32 Commits