naci 167025e2c2 Merge pull request #3 from mrexodia/cross-platform
Compilation works on Linux (LLVM 15)
2024-03-22 18:04:01 +03:00
2024-03-17 14:26:31 +03:00
2024-03-22 15:42:16 +01:00
2024-03-20 02:45:32 +03:00
2023-11-13 15:46:01 +03:00
2024-03-17 14:28:13 +03:00

Project Overview:

Mergen is a tool engineered to convert Assembly code into LLVM Intermediate Representation (IR). This tool is designed for:

  • The deobfuscation or devirtualization of obfuscated binary code
  • The enhancement of the reverse engineering process, making it more efficient and effective, especially for complex software systems.

Core Objectives:

Deobfuscation

Devirtualization

Optimization

Diagram

image

Example

This is our target program

int maths(int a, int b, int c) {
    return a + b - c;
}

image

VMProtect settings, everything is turned off, we virtualize the function on ultra setting. (Tested versions 3.4.0-3.6.0)

image

image

This is how it looks after virtualizing.

image

Here, we run mergen. First argument is the name of the file and the second argument is the address of the function. Look how simple it is to run. And we can compile the output so we can explore it using our favorite decompiler.

image

; ModuleID = 'my_lifting_module'
source_filename = "my_lifting_module"

; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none)
define i64 @main(i64 %rax, i64 %rcx, i64 %rdx, i64 %rbx, i64 %0, i64 %rbp, i64 %rsi, i64 %rdi, i64 %r8, i64 %r9, i64 %r10, i64 %r11, i64 %r12, i64 %r13, i64 %r14, i64 %r15, ptr nocapture readnone %memory) local_unnamed_addr #0 {
entry:
  %stackmemory = alloca i128, i128 20971520, align 8
  %1 = trunc i64 %r8 to i32
  %2 = trunc i64 %rdx to i32
  %3 = trunc i64 %rcx to i32
  %realadd-5369234850- = add i32 %2, %3
  %not17196 = sub i32 %realadd-5369234850-, %1
  %4 = zext i32 %not17196 to i64
  ret i64 %4
}

attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) }

After compiling:

image

image

Now you might notice the registers are a little bit off. This is because of we dont follow the calling conventions, if we were to follow the calling conventions, function signature would look like this:

define i64 @main(i64 %rcx, i64 %rdx, i64 %rdx, i64 %r8, i64 %r9 ...) 

So, we just adjust the function signature to look normally. If you have more questions about this part, I suggest you research calling conventions and ABI.

image

image

What it can/cant do

It can devirtualize some versions of some programs.

It can figure out if jcc is opaque.

It cant decide which path to follow if jcc is not opaque.

It cant optimize whole binary, only one function at a time.

Getting in touch

Join our Mergen Discord Server to trade ideas or just chatting in general.

S
Description
Languages
C++ 69.1%
C 17.1%
Python 9.9%
PowerShell 1.2%
Batchfile 1%
Other 1.6%