#include <stdio.h>
#include "asm_x64.h"
int main() {
char* str = "Hello World!";
x64 code = {
{ MOV, rax, imptr(puts) },
{ MOV, rcx, imptr(str) }, // RDI for System V
{ JMP, rax },
};
uint32_t len = 0;
uint8_t* assembled = x64as(code, sizeof(code) / sizeof(code[0]), &len);
if(!assembled) return 1;
x64exec(assembled, len)(); // Prints "Hello World!"
return 0;
}
Download
asm_x64.c
andasm_x64.h
into your project and just includeasm_x64.h
to start assembling!
- Simple and easy to use, only requiring 2 function calls to run your code.
- Supports AVX-256 and many other x86 extensions.
- Fast, assembling up to 100 million instructions per second.
- Easy and flexible syntax, allowing you as much freedom as possible with coding practices.
- Simple error handling system, returning 0 from a function if it failed and fast error retrieval with
x64error(NULL)
. - Stringification of the IR for easy debugging with
x64stringify(code, len)
.
This library is useful for any code generated dynamically from user input. This includes:
- JIT Compilers
- Emulators
- Runtime optimizations / code generation
- Testing / benchmarking software
- Writing your own assemblers!
I would highly recommend using something like examples/vec.h
to dynamically push code onto a single array throughout your application with very low latency. I show this off in examples/bf_compiler.c
!
Assembler is built in an optimized fashion where anything that can be precomputed, is precomputed. In the above screenshot, it's shown that an optimized build can assemble most instructions in about 15 nanoseconds, which goes down to 30 for unoptimized builds.
x64
is an array of x64Ins
structs. The first member of the struct is op
, or the operation, an enum defined by the asm_x64.h
header. The other 4 members are x64Operand
structs, which are just a combination of the type of operand with the value.
An example instruction mov rax, 0
would be written as:
x64 code = { MOV, rax, imm(0) };
Notice the use of rax
and imm(0)
. All x86 registers like rax
(including mm
s, ymm
s etc) are defined as macros with the type x64Operand
. Other types of macros:
imm()
,im8()
,im16
,im32()
,im64()
andimptr()
for immediate values.mem()
,m8()
,m16()
,m32()
,m64()
,m128()
,m256()
andm512()
for memory addresses.rel()
for relative offsets referencing other instructions. The use of this macro requires soft linking if used that way.- Note:
rel(0)
references the current instruction, soJMP, rel(0)
jumps back to itself infinitely! Pass in 1 to jump to the next instruction.
- Note:
Let's start off with an example of lea rax, ds:[rax + 0xffe + rdx * 2]
in chasm:
x64 code = { LEA, rax, mem($rax, 0x0ffe, $rdx, 2, $ds) };
This is a variable length macro, with each argument being optional. Each of the register arguments of the mem()
macro have to be preceeded with a $
prefix. Any 32 bit signed integer can be passed for the offset parameter, and only 1, 2, 4 and 8 are allowed in the 4th parameter, also called the "scale" parameter (ANY OTHER VALUE WILL GO TO 1, x86 limitation). The last parameter is a segment register, also preceeded with a $
.
Other valid mem()
syntax examples are: mem($rax)
, mem($none, 0, $rdx, 8)
, mem($none, 0x60, $none, 1, $gs)
and with VSIB mem($rdx, 0, $ymm2, 4)
.
m8()
, m16()
, m32()
, m64()
, m128()
, m256()
and m512()
are specific versions of the mem()
macro, referencing the exact size of data accessed with the exact same syntax. Use mem()
with FPU, otherwise use the specific version of the macro if data size is known. The most efficient/common size is selected with mem()
, but can cause bugs when it's smaller than you intend.
Important
Make sure to pass in $none for register parameters you are not using, as it will assume eax if you pass in 0! If you omit arguments though, $none
is assumed :)
$rip
is a valid register to use with mem()
, but it's not very useful when you might not know the byte-length of the instructions in between the ones you're trying to reference. This is where $riprel
can be used as the base register for mem()
allowing you to reference other instructions without knowing the byte-length in between! In $riprel
, just like rel()
, 0 means the current instruction. This is the answer to lea rax, [$+1]
syntax provided by many assemblers. Here's an example:
x64 code = {
{ MOV, rax, imm(1) }, // 1 Iteration
{ LEA, rcx, mem($riprel, 2) }, // ━┓
{ PUSH, rcx }, // ┃ Pushes this address on the stack. Equivalent to "call $+2"
{ DEC, rax }, // ◄┛
{ JZ, rel(2) }, // Jumps out of the loop.
{ RET }, // Pops the pushed pointer off and jumps, basically "jmp $-2"
};
This is an example of some complicated control flow you can achieve with chasm! There's also an example in examples/bf_compiler.c
.
Important
To get actual results with this syntax, you need to link your code with x64as()
! Index and scale also do not work with $rip
or $riprel
as base registers.
Assembles and soft links code, dealing with $riprel
and rel()
syntax and returning the assembled code.
- Returns NULL if an error occured, and sets the error code to the
x64error
variable. - The length of the assembled code is stored in
outlen
.
- Returns the length of the instruction in bytes. If it returns 0, an error has occurred.
- This function does not perform any linking, so it's likely much faster to loop with this function than to use x64as() if you do not have any
rel()
ormem($riprel)
s in your code.
Example of such loop:
char buf[128];
uint32_t buf_len = 0;
for(size_t i = 0; i < sizeof(code) / sizeof(code[0]); i++) {
const uint32_t len = x64emit(&code[i], buf + buf_len);
if(!len) { // Or any other kind of error handling code. This is what x64as() does internally.
fprintf(stderr, "%s", x64error(NULL));
return 1;
}
buf_len += len;
}
- Returns a function pointer to the code, which you can call to run your code.
- Free this memory with
x64exec_free()
.
- Free this memory with
Note
Store the size of the memory you requested with x64exec()
as you will need to pass it in here, at least for Unix.
- Returns a string, NULL if an error occurred which will be accessible with
x64error()
.
- Returns a string with a description of the error.
- If
errcode
is not NULL, it will be set to the error code.
Currently does not support legacy instructions (32 bit protected mode instructions) because I didn't deem it necessary to support for hobbyists in the modern era. It's easily possible though, with some slight modifications to the encoding methods! If there are enough people that want it, I will work on adding it as a separate file.
I currently do not support AVX-512!! This will change eventually, as there are some plans to add support for it, with all of its' weird syntax. I have been thinking of something like ymm(10, k1, z)
for example.
Support for other instruction sets will come when I get to them, and I when get some good tables that give me the exact information I need! I currently use a modified table from StanfordPL/x64asm.
Chasm is dual licensed under the MIT Licence and Public Domain. You can choose the licence that suits your project the best. The MIT Licence is a permissive licence that is short and to the point. The Public Domain licence is a licence that makes the software available to the public for free and with no copyright.
A chasm is a deep ravine or hole formed through millenia of erosion and natural processes. This name struck a chord in my heart, as I have been working on this library for over a year, and it's the best way I know of getting low and deep into the heart of computing. I also loved that it had "asm" and "c" in it, which are big parts of what this library is about.
- asmjit/asmjit - A popular choice for C++ developers and many times more complex and feature rich than chasm.
- aengelke/fadec - Similar library to chasm but a completely different API that might be more flexible but harder for some.
- bitdefender/bddisasm - Fast, easy to use Disassembler library.
- garc0/CTAsm - Compile time assembler for C++ using only templates.
The first time I saw a library like this was when I found https://github.com/StanfordPL/x64asm. I loved the idea, but I couldn't use their library from C or Windows, so I took the liberty to redesign some of their library. I use their table to generate my own table in
asm_x64.c
and while I haven't used any of their code, I did take inspiration from how they did instruction operands.
All source is in aqilc/rustscript. Tests testing all the features and operands are here.