You cannot select more than 25 topics
			Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
		
		
		
		
		
			
		
			
				
	
	
		
			1264 lines
		
	
	
		
			53 KiB
		
	
	
	
		
			HTML
		
	
			
		
		
	
	
			1264 lines
		
	
	
		
			53 KiB
		
	
	
	
		
			HTML
		
	
| <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
 | |
| <html>
 | |
| <head>
 | |
| <title>FFI Semantics</title>
 | |
| <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
 | |
| <meta name="Author" content="Mike Pall">
 | |
| <meta name="Copyright" content="Copyright (C) 2005-2016, Mike Pall">
 | |
| <meta name="Language" content="en">
 | |
| <link rel="stylesheet" type="text/css" href="bluequad.css" media="screen">
 | |
| <link rel="stylesheet" type="text/css" href="bluequad-print.css" media="print">
 | |
| <style type="text/css">
 | |
| table.convtable { line-height: 1.2; }
 | |
| tr.convhead td { font-weight: bold; }
 | |
| td.convop { font-style: italic; width: 40%; }
 | |
| </style>
 | |
| </head>
 | |
| <body>
 | |
| <div id="site">
 | |
| <a href="http://luajit.org"><span>Lua<span id="logo">JIT</span></span></a>
 | |
| </div>
 | |
| <div id="head">
 | |
| <h1>FFI Semantics</h1>
 | |
| </div>
 | |
| <div id="nav">
 | |
| <ul><li>
 | |
| <a href="luajit.html">LuaJIT</a>
 | |
| <ul><li>
 | |
| <a href="http://luajit.org/download.html">Download <span class="ext">»</span></a>
 | |
| </li><li>
 | |
| <a href="install.html">Installation</a>
 | |
| </li><li>
 | |
| <a href="running.html">Running</a>
 | |
| </li></ul>
 | |
| </li><li>
 | |
| <a href="extensions.html">Extensions</a>
 | |
| <ul><li>
 | |
| <a href="ext_ffi.html">FFI Library</a>
 | |
| <ul><li>
 | |
| <a href="ext_ffi_tutorial.html">FFI Tutorial</a>
 | |
| </li><li>
 | |
| <a href="ext_ffi_api.html">ffi.* API</a>
 | |
| </li><li>
 | |
| <a class="current" href="ext_ffi_semantics.html">FFI Semantics</a>
 | |
| </li></ul>
 | |
| </li><li>
 | |
| <a href="ext_jit.html">jit.* Library</a>
 | |
| </li><li>
 | |
| <a href="ext_c_api.html">Lua/C API</a>
 | |
| </li><li>
 | |
| <a href="ext_profiler.html">Profiler</a>
 | |
| </li></ul>
 | |
| </li><li>
 | |
| <a href="status.html">Status</a>
 | |
| <ul><li>
 | |
| <a href="changes.html">Changes</a>
 | |
| </li></ul>
 | |
| </li><li>
 | |
| <a href="faq.html">FAQ</a>
 | |
| </li><li>
 | |
| <a href="http://luajit.org/performance.html">Performance <span class="ext">»</span></a>
 | |
| </li><li>
 | |
| <a href="http://wiki.luajit.org/">Wiki <span class="ext">»</span></a>
 | |
| </li><li>
 | |
| <a href="http://luajit.org/list.html">Mailing List <span class="ext">»</span></a>
 | |
| </li></ul>
 | |
| </div>
 | |
| <div id="main">
 | |
| <p>
 | |
| This page describes the detailed semantics underlying the FFI library
 | |
| and its interaction with both Lua and C code.
 | |
| </p>
 | |
| <p>
 | |
| Given that the FFI library is designed to interface with C code
 | |
| and that declarations can be written in plain C syntax, <b>it
 | |
| closely follows the C language semantics</b>, wherever possible.
 | |
| Some minor concessions are needed for smoother interoperation with Lua
 | |
| language semantics.
 | |
| </p>
 | |
| <p>
 | |
| Please don't be overwhelmed by the contents of this page — this
 | |
| is a reference and you may need to consult it, if in doubt. It doesn't
 | |
| hurt to skim this page, but most of the semantics "just work" as you'd
 | |
| expect them to work. It should be straightforward to write
 | |
| applications using the LuaJIT FFI for developers with a C or C++
 | |
| background.
 | |
| </p>
 | |
| 
 | |
| <h2 id="clang">C Language Support</h2>
 | |
| <p>
 | |
| The FFI library has a built-in C parser with a minimal memory
 | |
| footprint. It's used by the <a href="ext_ffi_api.html">ffi.* library
 | |
| functions</a> to declare C types or external symbols.
 | |
| </p>
 | |
| <p>
 | |
| It's only purpose is to parse C declarations, as found e.g. in
 | |
| C header files. Although it does evaluate constant expressions,
 | |
| it's <em>not</em> a C compiler. The body of <tt>inline</tt>
 | |
| C function definitions is simply ignored.
 | |
| </p>
 | |
| <p>
 | |
| Also, this is <em>not</em> a validating C parser. It expects and
 | |
| accepts correctly formed C declarations, but it may choose to
 | |
| ignore bad declarations or show rather generic error messages. If in
 | |
| doubt, please check the input against your favorite C compiler.
 | |
| </p>
 | |
| <p>
 | |
| The C parser complies to the <b>C99 language standard</b> plus
 | |
| the following extensions:
 | |
| </p>
 | |
| <ul>
 | |
| 
 | |
| <li>The <tt>'\e'</tt> escape in character and string literals.</li>
 | |
| 
 | |
| <li>The C99/C++ boolean type, declared with the keywords <tt>bool</tt>
 | |
| or <tt>_Bool</tt>.</li>
 | |
| 
 | |
| <li>Complex numbers, declared with the keywords <tt>complex</tt> or
 | |
| <tt>_Complex</tt>.</li>
 | |
| 
 | |
| <li>Two complex number types: <tt>complex</tt> (aka
 | |
| <tt>complex double</tt>) and <tt>complex float</tt>.</li>
 | |
| 
 | |
| <li>Vector types, declared with the GCC <tt>mode</tt> or
 | |
| <tt>vector_size</tt> attribute.</li>
 | |
| 
 | |
| <li>Unnamed ('transparent') <tt>struct</tt>/<tt>union</tt> fields
 | |
| inside a <tt>struct</tt>/<tt>union</tt>.</li>
 | |
| 
 | |
| <li>Incomplete <tt>enum</tt> declarations, handled like incomplete
 | |
| <tt>struct</tt> declarations.</li>
 | |
| 
 | |
| <li>Unnamed <tt>enum</tt> fields inside a
 | |
| <tt>struct</tt>/<tt>union</tt>. This is similar to a scoped C++
 | |
| <tt>enum</tt>, except that declared constants are visible in the
 | |
| global namespace, too.</li>
 | |
| 
 | |
| <li>Scoped <tt>static const</tt> declarations inside a
 | |
| <tt>struct</tt>/<tt>union</tt> (from C++).</li>
 | |
| 
 | |
| <li>Zero-length arrays (<tt>[0]</tt>), empty
 | |
| <tt>struct</tt>/<tt>union</tt>, variable-length arrays (VLA,
 | |
| <tt>[?]</tt>) and variable-length structs (VLS, with a trailing
 | |
| VLA).</li>
 | |
| 
 | |
| <li>C++ reference types (<tt>int &x</tt>).</li>
 | |
| 
 | |
| <li>Alternate GCC keywords with '<tt>__</tt>', e.g.
 | |
| <tt>__const__</tt>.</li>
 | |
| 
 | |
| <li>GCC <tt>__attribute__</tt> with the following attributes:
 | |
| <tt>aligned</tt>, <tt>packed</tt>, <tt>mode</tt>,
 | |
| <tt>vector_size</tt>, <tt>cdecl</tt>, <tt>fastcall</tt>,
 | |
| <tt>stdcall</tt>, <tt>thiscall</tt>.</li>
 | |
| 
 | |
| <li>The GCC <tt>__extension__</tt> keyword and the GCC
 | |
| <tt>__alignof__</tt> operator.</li>
 | |
| 
 | |
| <li>GCC <tt>__asm__("symname")</tt> symbol name redirection for
 | |
| function declarations.</li>
 | |
| 
 | |
| <li>MSVC keywords for fixed-length types: <tt>__int8</tt>,
 | |
| <tt>__int16</tt>, <tt>__int32</tt> and <tt>__int64</tt>.</li>
 | |
| 
 | |
| <li>MSVC <tt>__cdecl</tt>, <tt>__fastcall</tt>, <tt>__stdcall</tt>,
 | |
| <tt>__thiscall</tt>, <tt>__ptr32</tt>, <tt>__ptr64</tt>,
 | |
| <tt>__declspec(align(n))</tt> and <tt>#pragma pack</tt>.</li>
 | |
| 
 | |
| <li>All other GCC/MSVC-specific attributes are ignored.</li>
 | |
| 
 | |
| </ul>
 | |
| <p>
 | |
| The following C types are pre-defined by the C parser (like
 | |
| a <tt>typedef</tt>, except re-declarations will be ignored):
 | |
| </p>
 | |
| <ul>
 | |
| 
 | |
| <li>Vararg handling: <tt>va_list</tt>, <tt>__builtin_va_list</tt>,
 | |
| <tt>__gnuc_va_list</tt>.</li>
 | |
| 
 | |
| <li>From <tt><stddef.h></tt>: <tt>ptrdiff_t</tt>,
 | |
| <tt>size_t</tt>, <tt>wchar_t</tt>.</li>
 | |
| 
 | |
| <li>From <tt><stdint.h></tt>: <tt>int8_t</tt>, <tt>int16_t</tt>,
 | |
| <tt>int32_t</tt>, <tt>int64_t</tt>, <tt>uint8_t</tt>,
 | |
| <tt>uint16_t</tt>, <tt>uint32_t</tt>, <tt>uint64_t</tt>,
 | |
| <tt>intptr_t</tt>, <tt>uintptr_t</tt>.</li>
 | |
| 
 | |
| <li>From <tt><unistd.h></tt> (POSIX): <tt>ssize_t</tt>.</li>
 | |
| 
 | |
| </ul>
 | |
| <p>
 | |
| You're encouraged to use these types in preference to
 | |
| compiler-specific extensions or target-dependent standard types.
 | |
| E.g. <tt>char</tt> differs in signedness and <tt>long</tt> differs in
 | |
| size, depending on the target architecture and platform ABI.
 | |
| </p>
 | |
| <p>
 | |
| The following C features are <b>not</b> supported:
 | |
| </p>
 | |
| <ul>
 | |
| 
 | |
| <li>A declaration must always have a type specifier; it doesn't
 | |
| default to an <tt>int</tt> type.</li>
 | |
| 
 | |
| <li>Old-style empty function declarations (K&R) are not allowed.
 | |
| All C functions must have a proper prototype declaration. A
 | |
| function declared without parameters (<tt>int foo();</tt>) is
 | |
| treated as a function taking zero arguments, like in C++.</li>
 | |
| 
 | |
| <li>The <tt>long double</tt> C type is parsed correctly, but
 | |
| there's no support for the related conversions, accesses or arithmetic
 | |
| operations.</li>
 | |
| 
 | |
| <li>Wide character strings and character literals are not
 | |
| supported.</li>
 | |
| 
 | |
| <li><a href="#status">See below</a> for features that are currently
 | |
| not implemented.</li>
 | |
| 
 | |
| </ul>
 | |
| 
 | |
| <h2 id="convert">C Type Conversion Rules</h2>
 | |
| 
 | |
| <h3 id="convert_tolua">Conversions from C types to Lua objects</h3>
 | |
| <p>
 | |
| These conversion rules apply for <em>read accesses</em> to
 | |
| C types: indexing pointers, arrays or
 | |
| <tt>struct</tt>/<tt>union</tt> types; reading external variables or
 | |
| constant values; retrieving return values from C calls:
 | |
| </p>
 | |
| <table class="convtable">
 | |
| <tr class="convhead">
 | |
| <td class="convin">Input</td>
 | |
| <td class="convop">Conversion</td>
 | |
| <td class="convout">Output</td>
 | |
| </tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin"><tt>int8_t</tt>, <tt>int16_t</tt></td><td class="convop">→<sup>sign-ext</sup> <tt>int32_t</tt> → <tt>double</tt></td><td class="convout">number</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin"><tt>uint8_t</tt>, <tt>uint16_t</tt></td><td class="convop">→<sup>zero-ext</sup> <tt>int32_t</tt> → <tt>double</tt></td><td class="convout">number</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin"><tt>int32_t</tt>, <tt>uint32_t</tt></td><td class="convop">→ <tt>double</tt></td><td class="convout">number</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin"><tt>int64_t</tt>, <tt>uint64_t</tt></td><td class="convop">boxed value</td><td class="convout">64 bit int cdata</td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin"><tt>double</tt>, <tt>float</tt></td><td class="convop">→ <tt>double</tt></td><td class="convout">number</td></tr>
 | |
| <tr class="even separate">
 | |
| <td class="convin"><tt>bool</tt></td><td class="convop">0 → <tt>false</tt>, otherwise <tt>true</tt></td><td class="convout">boolean</td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin"><tt>enum</tt></td><td class="convop">boxed value</td><td class="convout">enum cdata</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Complex number</td><td class="convop">boxed value</td><td class="convout">complex cdata</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">Vector</td><td class="convop">boxed value</td><td class="convout">vector cdata</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Pointer</td><td class="convop">boxed value</td><td class="convout">pointer cdata</td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">Array</td><td class="convop">boxed reference</td><td class="convout">reference cdata</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin"><tt>struct</tt>/<tt>union</tt></td><td class="convop">boxed reference</td><td class="convout">reference cdata</td></tr>
 | |
| </table>
 | |
| <p>
 | |
| Bitfields are treated like their underlying type.
 | |
| </p>
 | |
| <p>
 | |
| Reference types are dereferenced <em>before</em> a conversion can take
 | |
| place — the conversion is applied to the C type pointed to
 | |
| by the reference.
 | |
| </p>
 | |
| 
 | |
| <h3 id="convert_fromlua">Conversions from Lua objects to C types</h3>
 | |
| <p>
 | |
| These conversion rules apply for <em>write accesses</em> to
 | |
| C types: indexing pointers, arrays or
 | |
| <tt>struct</tt>/<tt>union</tt> types; initializing cdata objects;
 | |
| casts to C types; writing to external variables; passing
 | |
| arguments to C calls:
 | |
| </p>
 | |
| <table class="convtable">
 | |
| <tr class="convhead">
 | |
| <td class="convin">Input</td>
 | |
| <td class="convop">Conversion</td>
 | |
| <td class="convout">Output</td>
 | |
| </tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">number</td><td class="convop">→</td><td class="convout"><tt>double</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">boolean</td><td class="convop"><tt>false</tt> → 0, <tt>true</tt> → 1</td><td class="convout"><tt>bool</tt></td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">nil</td><td class="convop"><tt>NULL</tt> →</td><td class="convout"><tt>(void *)</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">lightuserdata</td><td class="convop">lightuserdata address →</td><td class="convout"><tt>(void *)</tt></td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">userdata</td><td class="convop">userdata payload →</td><td class="convout"><tt>(void *)</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">io.* file</td><td class="convop">get FILE * handle →</td><td class="convout"><tt>(void *)</tt></td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">string</td><td class="convop">match against <tt>enum</tt> constant</td><td class="convout"><tt>enum</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">string</td><td class="convop">copy string data + zero-byte</td><td class="convout"><tt>int8_t[]</tt>, <tt>uint8_t[]</tt></td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">string</td><td class="convop">string data →</td><td class="convout"><tt>const char[]</tt></td></tr>
 | |
| <tr class="even separate">
 | |
| <td class="convin">function</td><td class="convop"><a href="#callback">create callback</a> →</td><td class="convout">C function type</td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">table</td><td class="convop"><a href="#init_table">table initializer</a></td><td class="convout">Array</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">table</td><td class="convop"><a href="#init_table">table initializer</a></td><td class="convout"><tt>struct</tt>/<tt>union</tt></td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">cdata</td><td class="convop">cdata payload →</td><td class="convout">C type</td></tr>
 | |
| </table>
 | |
| <p>
 | |
| If the result type of this conversion doesn't match the
 | |
| C type of the destination, the
 | |
| <a href="#convert_between">conversion rules between C types</a>
 | |
| are applied.
 | |
| </p>
 | |
| <p>
 | |
| Reference types are immutable after initialization ("no re-seating of
 | |
| references"). For initialization purposes or when passing values to
 | |
| reference parameters, they are treated like pointers. Note that unlike
 | |
| in C++, there's no way to implement automatic reference generation of
 | |
| variables under the Lua language semantics. If you want to call a
 | |
| function with a reference parameter, you need to explicitly pass a
 | |
| one-element array.
 | |
| </p>
 | |
| 
 | |
| <h3 id="convert_between">Conversions between C types</h3>
 | |
| <p>
 | |
| These conversion rules are more or less the same as the standard
 | |
| C conversion rules. Some rules only apply to casts, or require
 | |
| pointer or type compatibility:
 | |
| </p>
 | |
| <table class="convtable">
 | |
| <tr class="convhead">
 | |
| <td class="convin">Input</td>
 | |
| <td class="convop">Conversion</td>
 | |
| <td class="convout">Output</td>
 | |
| </tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">Signed integer</td><td class="convop">→<sup>narrow or sign-extend</sup></td><td class="convout">Integer</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Unsigned integer</td><td class="convop">→<sup>narrow or zero-extend</sup></td><td class="convout">Integer</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">Integer</td><td class="convop">→<sup>round</sup></td><td class="convout"><tt>double</tt>, <tt>float</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin"><tt>double</tt>, <tt>float</tt></td><td class="convop">→<sup>trunc</sup> <tt>int32_t</tt> →<sup>narrow</sup></td><td class="convout"><tt>(u)int8_t</tt>, <tt>(u)int16_t</tt></td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin"><tt>double</tt>, <tt>float</tt></td><td class="convop">→<sup>trunc</sup></td><td class="convout"><tt>(u)int32_t</tt>, <tt>(u)int64_t</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin"><tt>double</tt>, <tt>float</tt></td><td class="convop">→<sup>round</sup></td><td class="convout"><tt>float</tt>, <tt>double</tt></td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">Number</td><td class="convop">n == 0 → 0, otherwise 1</td><td class="convout"><tt>bool</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin"><tt>bool</tt></td><td class="convop"><tt>false</tt> → 0, <tt>true</tt> → 1</td><td class="convout">Number</td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">Complex number</td><td class="convop">convert real part</td><td class="convout">Number</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Number</td><td class="convop">convert real part, imag = 0</td><td class="convout">Complex number</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">Complex number</td><td class="convop">convert real and imag part</td><td class="convout">Complex number</td></tr>
 | |
| <tr class="even separate">
 | |
| <td class="convin">Number</td><td class="convop">convert scalar and replicate</td><td class="convout">Vector</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">Vector</td><td class="convop">copy (same size)</td><td class="convout">Vector</td></tr>
 | |
| <tr class="even separate">
 | |
| <td class="convin"><tt>struct</tt>/<tt>union</tt></td><td class="convop">take base address (compat)</td><td class="convout">Pointer</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">Array</td><td class="convop">take base address (compat)</td><td class="convout">Pointer</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Function</td><td class="convop">take function address</td><td class="convout">Function pointer</td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">Number</td><td class="convop">convert via <tt>uintptr_t</tt> (cast)</td><td class="convout">Pointer</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Pointer</td><td class="convop">convert address (compat/cast)</td><td class="convout">Pointer</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">Pointer</td><td class="convop">convert address (cast)</td><td class="convout">Integer</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Array</td><td class="convop">convert base address (cast)</td><td class="convout">Integer</td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">Array</td><td class="convop">copy (compat)</td><td class="convout">Array</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin"><tt>struct</tt>/<tt>union</tt></td><td class="convop">copy (identical type)</td><td class="convout"><tt>struct</tt>/<tt>union</tt></td></tr>
 | |
| </table>
 | |
| <p>
 | |
| Bitfields or <tt>enum</tt> types are treated like their underlying
 | |
| type.
 | |
| </p>
 | |
| <p>
 | |
| Conversions not listed above will raise an error. E.g. it's not
 | |
| possible to convert a pointer to a complex number or vice versa.
 | |
| </p>
 | |
| 
 | |
| <h3 id="convert_vararg">Conversions for vararg C function arguments</h3>
 | |
| <p>
 | |
| The following default conversion rules apply when passing Lua objects
 | |
| to the variable argument part of vararg C functions:
 | |
| </p>
 | |
| <table class="convtable">
 | |
| <tr class="convhead">
 | |
| <td class="convin">Input</td>
 | |
| <td class="convop">Conversion</td>
 | |
| <td class="convout">Output</td>
 | |
| </tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">number</td><td class="convop">→</td><td class="convout"><tt>double</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">boolean</td><td class="convop"><tt>false</tt> → 0, <tt>true</tt> → 1</td><td class="convout"><tt>bool</tt></td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin">nil</td><td class="convop"><tt>NULL</tt> →</td><td class="convout"><tt>(void *)</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">userdata</td><td class="convop">userdata payload →</td><td class="convout"><tt>(void *)</tt></td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">lightuserdata</td><td class="convop">lightuserdata address →</td><td class="convout"><tt>(void *)</tt></td></tr>
 | |
| <tr class="even separate">
 | |
| <td class="convin">string</td><td class="convop">string data →</td><td class="convout"><tt>const char *</tt></td></tr>
 | |
| <tr class="odd separate">
 | |
| <td class="convin"><tt>float</tt> cdata</td><td class="convop">→</td><td class="convout"><tt>double</tt></td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Array cdata</td><td class="convop">take base address</td><td class="convout">Element pointer</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin"><tt>struct</tt>/<tt>union</tt> cdata</td><td class="convop">take base address</td><td class="convout"><tt>struct</tt>/<tt>union</tt> pointer</td></tr>
 | |
| <tr class="even">
 | |
| <td class="convin">Function cdata</td><td class="convop">take function address</td><td class="convout">Function pointer</td></tr>
 | |
| <tr class="odd">
 | |
| <td class="convin">Any other cdata</td><td class="convop">no conversion</td><td class="convout">C type</td></tr>
 | |
| </table>
 | |
| <p>
 | |
| To pass a Lua object, other than a cdata object, as a specific type,
 | |
| you need to override the conversion rules: create a temporary cdata
 | |
| object with a constructor or a cast and initialize it with the value
 | |
| to pass:
 | |
| </p>
 | |
| <p>
 | |
| Assuming <tt>x</tt> is a Lua number, here's how to pass it as an
 | |
| integer to a vararg function:
 | |
| </p>
 | |
| <pre class="code">
 | |
| ffi.cdef[[
 | |
| int printf(const char *fmt, ...);
 | |
| ]]
 | |
| ffi.C.printf("integer value: %d\n", ffi.new("int", x))
 | |
| </pre>
 | |
| <p>
 | |
| If you don't do this, the default Lua number → <tt>double</tt>
 | |
| conversion rule applies. A vararg C function expecting an integer
 | |
| will see a garbled or uninitialized value.
 | |
| </p>
 | |
| 
 | |
| <h2 id="init">Initializers</h2>
 | |
| <p>
 | |
| Creating a cdata object with
 | |
| <a href="ext_ffi_api.html#ffi_new"><tt>ffi.new()</tt></a> or the
 | |
| equivalent constructor syntax always initializes its contents, too.
 | |
| Different rules apply, depending on the number of optional
 | |
| initializers and the C types involved:
 | |
| </p>
 | |
| <ul>
 | |
| <li>If no initializers are given, the object is filled with zero bytes.</li>
 | |
| 
 | |
| <li>Scalar types (numbers and pointers) accept a single initializer.
 | |
| The Lua object is <a href="#convert_fromlua">converted to the scalar
 | |
| C type</a>.</li>
 | |
| 
 | |
| <li>Valarrays (complex numbers and vectors) are treated like scalars
 | |
| when a single initializer is given. Otherwise they are treated like
 | |
| regular arrays.</li>
 | |
| 
 | |
| <li>Aggregate types (arrays and structs) accept either a single cdata
 | |
| initializer of the same type (copy constructor), a single
 | |
| <a href="#init_table">table initializer</a>, or a flat list of
 | |
| initializers.</li>
 | |
| 
 | |
| <li>The elements of an array are initialized, starting at index zero.
 | |
| If a single initializer is given for an array, it's repeated for all
 | |
| remaining elements. This doesn't happen if two or more initializers
 | |
| are given: all remaining uninitialized elements are filled with zero
 | |
| bytes.</li>
 | |
| 
 | |
| <li>Byte arrays may also be initialized with a Lua string. This copies
 | |
| the whole string plus a terminating zero-byte. The copy stops early only
 | |
| if the array has a known, fixed size.</li>
 | |
| 
 | |
| <li>The fields of a <tt>struct</tt> are initialized in the order of
 | |
| their declaration. Uninitialized fields are filled with zero
 | |
| bytes.</li>
 | |
| 
 | |
| <li>Only the first field of a <tt>union</tt> can be initialized with a
 | |
| flat initializer.</li>
 | |
| 
 | |
| <li>Elements or fields which are aggregates themselves are initialized
 | |
| with a <em>single</em> initializer, but this may be a table
 | |
| initializer or a compatible aggregate.</li>
 | |
| 
 | |
| <li>Excess initializers cause an error.</li>
 | |
| 
 | |
| </ul>
 | |
| 
 | |
| <h2 id="init_table">Table Initializers</h2>
 | |
| <p>
 | |
| The following rules apply if a Lua table is used to initialize an
 | |
| Array or a <tt>struct</tt>/<tt>union</tt>:
 | |
| </p>
 | |
| <ul>
 | |
| 
 | |
| <li>If the table index <tt>[0]</tt> is non-<tt>nil</tt>, then the
 | |
| table is assumed to be zero-based. Otherwise it's assumed to be
 | |
| one-based.</li>
 | |
| 
 | |
| <li>Array elements, starting at index zero, are initialized one-by-one
 | |
| with the consecutive table elements, starting at either index
 | |
| <tt>[0]</tt> or <tt>[1]</tt>. This process stops at the first
 | |
| <tt>nil</tt> table element.</li>
 | |
| 
 | |
| <li>If exactly one array element was initialized, it's repeated for
 | |
| all the remaining elements. Otherwise all remaining uninitialized
 | |
| elements are filled with zero bytes.</li>
 | |
| 
 | |
| <li>The above logic only applies to arrays with a known fixed size.
 | |
| A VLA is only initialized with the element(s) given in the table.
 | |
| Depending on the use case, you may need to explicitly add a
 | |
| <tt>NULL</tt> or <tt>0</tt> terminator to a VLA.</li>
 | |
| 
 | |
| <li>A <tt>struct</tt>/<tt>union</tt> can be initialized in the
 | |
| order of the declaration of its fields. Each field is initialized with
 | |
| consecutive table elements, starting at either index <tt>[0]</tt>
 | |
| or <tt>[1]</tt>. This process stops at the first <tt>nil</tt> table
 | |
| element.</li>
 | |
| 
 | |
| <li>Otherwise, if neither index <tt>[0]</tt> nor <tt>[1]</tt> is present,
 | |
| a <tt>struct</tt>/<tt>union</tt> is initialized by looking up each field
 | |
| name (as a string key) in the table. Each non-<tt>nil</tt> value is
 | |
| used to initialize the corresponding field.</li>
 | |
| 
 | |
| <li>Uninitialized fields of a <tt>struct</tt> are filled with zero
 | |
| bytes, except for the trailing VLA of a VLS.</li>
 | |
| 
 | |
| <li>Initialization of a <tt>union</tt> stops after one field has been
 | |
| initialized. If no field has been initialized, the <tt>union</tt> is
 | |
| filled with zero bytes.</li>
 | |
| 
 | |
| <li>Elements or fields which are aggregates themselves are initialized
 | |
| with a <em>single</em> initializer, but this may be a nested table
 | |
| initializer (or a compatible aggregate).</li>
 | |
| 
 | |
| <li>Excess initializers for an array cause an error. Excess
 | |
| initializers for a <tt>struct</tt>/<tt>union</tt> are ignored.
 | |
| Unrelated table entries are ignored, too.</li>
 | |
| 
 | |
| </ul>
 | |
| <p>
 | |
| Example:
 | |
| </p>
 | |
| <pre class="code">
 | |
| local ffi = require("ffi")
 | |
| 
 | |
| ffi.cdef[[
 | |
| struct foo { int a, b; };
 | |
| union bar { int i; double d; };
 | |
| struct nested { int x; struct foo y; };
 | |
| ]]
 | |
| 
 | |
| ffi.new("int[3]", {})            --> 0, 0, 0
 | |
| ffi.new("int[3]", {1})           --> 1, 1, 1
 | |
| ffi.new("int[3]", {1,2})         --> 1, 2, 0
 | |
| ffi.new("int[3]", {1,2,3})       --> 1, 2, 3
 | |
| ffi.new("int[3]", {[0]=1})       --> 1, 1, 1
 | |
| ffi.new("int[3]", {[0]=1,2})     --> 1, 2, 0
 | |
| ffi.new("int[3]", {[0]=1,2,3})   --> 1, 2, 3
 | |
| ffi.new("int[3]", {[0]=1,2,3,4}) --> error: too many initializers
 | |
| 
 | |
| ffi.new("struct foo", {})            --> a = 0, b = 0
 | |
| ffi.new("struct foo", {1})           --> a = 1, b = 0
 | |
| ffi.new("struct foo", {1,2})         --> a = 1, b = 2
 | |
| ffi.new("struct foo", {[0]=1,2})     --> a = 1, b = 2
 | |
| ffi.new("struct foo", {b=2})         --> a = 0, b = 2
 | |
| ffi.new("struct foo", {a=1,b=2,c=3}) --> a = 1, b = 2  'c' is ignored
 | |
| 
 | |
| ffi.new("union bar", {})        --> i = 0, d = 0.0
 | |
| ffi.new("union bar", {1})       --> i = 1, d = ?
 | |
| ffi.new("union bar", {[0]=1,2}) --> i = 1, d = ?    '2' is ignored
 | |
| ffi.new("union bar", {d=2})     --> i = ?, d = 2.0
 | |
| 
 | |
| ffi.new("struct nested", {1,{2,3}})     --> x = 1, y.a = 2, y.b = 3
 | |
| ffi.new("struct nested", {x=1,y={2,3}}) --> x = 1, y.a = 2, y.b = 3
 | |
| </pre>
 | |
| 
 | |
| <h2 id="cdata_ops">Operations on cdata Objects</h2>
 | |
| <p>
 | |
| All of the standard Lua operators can be applied to cdata objects or a
 | |
| mix of a cdata object and another Lua object. The following list shows
 | |
| the pre-defined operations.
 | |
| </p>
 | |
| <p>
 | |
| Reference types are dereferenced <em>before</em> performing each of
 | |
| the operations below — the operation is applied to the
 | |
| C type pointed to by the reference.
 | |
| </p>
 | |
| <p>
 | |
| The pre-defined operations are always tried first before deferring to a
 | |
| metamethod or index table (if any) for the corresponding ctype (except
 | |
| for <tt>__new</tt>). An error is raised if the metamethod lookup or
 | |
| index table lookup fails.
 | |
| </p>
 | |
| 
 | |
| <h3 id="cdata_array">Indexing a cdata object</h3>
 | |
| <ul>
 | |
| 
 | |
| <li><b>Indexing a pointer/array</b>: a cdata pointer/array can be
 | |
| indexed by a cdata number or a Lua number. The element address is
 | |
| computed as the base address plus the number value multiplied by the
 | |
| element size in bytes. A read access loads the element value and
 | |
| <a href="#convert_tolua">converts it to a Lua object</a>. A write
 | |
| access <a href="#convert_fromlua">converts a Lua object to the element
 | |
| type</a> and stores the converted value to the element. An error is
 | |
| raised if the element size is undefined or a write access to a
 | |
| constant element is attempted.</li>
 | |
| 
 | |
| <li><b>Dereferencing a <tt>struct</tt>/<tt>union</tt> field</b>: a
 | |
| cdata <tt>struct</tt>/<tt>union</tt> or a pointer to a
 | |
| <tt>struct</tt>/<tt>union</tt> can be dereferenced by a string key,
 | |
| giving the field name. The field address is computed as the base
 | |
| address plus the relative offset of the field. A read access loads the
 | |
| field value and <a href="#convert_tolua">converts it to a Lua
 | |
| object</a>. A write access <a href="#convert_fromlua">converts a Lua
 | |
| object to the field type</a> and stores the converted value to the
 | |
| field. An error is raised if a write access to a constant
 | |
| <tt>struct</tt>/<tt>union</tt> or a constant field is attempted.
 | |
| Scoped enum constants or static constants are treated like a constant
 | |
| field.</li>
 | |
| 
 | |
| <li><b>Indexing a complex number</b>: a complex number can be indexed
 | |
| either by a cdata number or a Lua number with the values 0 or 1, or by
 | |
| the strings <tt>"re"</tt> or <tt>"im"</tt>. A read access loads the
 | |
| real part (<tt>[0]</tt>, <tt>.re</tt>) or the imaginary part
 | |
| (<tt>[1]</tt>, <tt>.im</tt>) part of a complex number and
 | |
| <a href="#convert_tolua">converts it to a Lua number</a>. The
 | |
| sub-parts of a complex number are immutable — assigning to an
 | |
| index of a complex number raises an error. Accessing out-of-bound
 | |
| indexes returns unspecified results, but is guaranteed not to trigger
 | |
| memory access violations.</li>
 | |
| 
 | |
| <li><b>Indexing a vector</b>: a vector is treated like an array for
 | |
| indexing purposes, except the vector elements are immutable —
 | |
| assigning to an index of a vector raises an error.</li>
 | |
| 
 | |
| </ul>
 | |
| <p>
 | |
| A ctype object can be indexed with a string key, too. The only
 | |
| pre-defined operation is reading scoped constants of
 | |
| <tt>struct</tt>/<tt>union</tt> types. All other accesses defer
 | |
| to the corresponding metamethods or index tables (if any).
 | |
| </p>
 | |
| <p>
 | |
| Note: since there's (deliberately) no address-of operator, a cdata
 | |
| object holding a value type is effectively immutable after
 | |
| initialization. The JIT compiler benefits from this fact when applying
 | |
| certain optimizations.
 | |
| </p>
 | |
| <p>
 | |
| As a consequence, the <em>elements</em> of complex numbers and
 | |
| vectors are immutable. But the elements of an aggregate holding these
 | |
| types <em>may</em> be modified of course. I.e. you cannot assign to
 | |
| <tt>foo.c.im</tt>, but you can assign a (newly created) complex number
 | |
| to <tt>foo.c</tt>.
 | |
| </p>
 | |
| <p>
 | |
| The JIT compiler implements strict aliasing rules: accesses to different
 | |
| types do <b>not</b> alias, except for differences in signedness (this
 | |
| applies even to <tt>char</tt> pointers, unlike C99). Type punning
 | |
| through unions is explicitly detected and allowed.
 | |
| </p>
 | |
| 
 | |
| <h3 id="cdata_call">Calling a cdata object</h3>
 | |
| <ul>
 | |
| 
 | |
| <li><b>Constructor</b>: a ctype object can be called and used as a
 | |
| <a href="ext_ffi_api.html#ffi_new">constructor</a>. This is equivalent
 | |
| to <tt>ffi.new(ct, ...)</tt>, unless a <tt>__new</tt> metamethod is
 | |
| defined. The <tt>__new</tt> metamethod is called with the ctype object
 | |
| plus any other arguments passed to the contructor. Note that you have to
 | |
| use <tt>ffi.new</tt> inside of it, since calling <tt>ct(...)</tt> would
 | |
| cause infinite recursion.</li>
 | |
| 
 | |
| <li><b>C function call</b>: a cdata function or cdata function
 | |
| pointer can be called. The passed arguments are
 | |
| <a href="#convert_fromlua">converted to the C types</a> of the
 | |
| parameters given by the function declaration. Arguments passed to the
 | |
| variable argument part of vararg C function use
 | |
| <a href="#convert_vararg">special conversion rules</a>. This
 | |
| C function is called and the return value (if any) is
 | |
| <a href="#convert_tolua">converted to a Lua object</a>.<br>
 | |
| On Windows/x86 systems, <tt>__stdcall</tt> functions are automatically
 | |
| detected and a function declared as <tt>__cdecl</tt> (the default) is
 | |
| silently fixed up after the first call.</li>
 | |
| 
 | |
| </ul>
 | |
| 
 | |
| <h3 id="cdata_arith">Arithmetic on cdata objects</h3>
 | |
| <ul>
 | |
| 
 | |
| <li><b>Pointer arithmetic</b>: a cdata pointer/array and a cdata
 | |
| number or a Lua number can be added or subtracted. The number must be
 | |
| on the right hand side for a subtraction. The result is a pointer of
 | |
| the same type with an address plus or minus the number value
 | |
| multiplied by the element size in bytes. An error is raised if the
 | |
| element size is undefined.</li>
 | |
| 
 | |
| <li><b>Pointer difference</b>: two compatible cdata pointers/arrays
 | |
| can be subtracted. The result is the difference between their
 | |
| addresses, divided by the element size in bytes. An error is raised if
 | |
| the element size is undefined or zero.</li>
 | |
| 
 | |
| <li><b>64 bit integer arithmetic</b>: the standard arithmetic
 | |
| operators (<tt>+ - * / % ^</tt> and unary
 | |
| minus) can be applied to two cdata numbers, or a cdata number and a
 | |
| Lua number. If one of them is an <tt>uint64_t</tt>, the other side is
 | |
| converted to an <tt>uint64_t</tt> and an unsigned arithmetic operation
 | |
| is performed. Otherwise both sides are converted to an
 | |
| <tt>int64_t</tt> and a signed arithmetic operation is performed. The
 | |
| result is a boxed 64 bit cdata object.<br>
 | |
| 
 | |
| If one of the operands is an <tt>enum</tt> and the other operand is a
 | |
| string, the string is converted to the value of a matching <tt>enum</tt>
 | |
| constant before the above conversion.<br>
 | |
| 
 | |
| These rules ensure that 64 bit integers are "sticky". Any
 | |
| expression involving at least one 64 bit integer operand results
 | |
| in another one. The undefined cases for the division, modulo and power
 | |
| operators return <tt>2LL ^ 63</tt> or
 | |
| <tt>2ULL ^ 63</tt>.<br>
 | |
| 
 | |
| You'll have to explicitly convert a 64 bit integer to a Lua
 | |
| number (e.g. for regular floating-point calculations) with
 | |
| <tt>tonumber()</tt>. But note this may incur a precision loss.</li>
 | |
| 
 | |
| <li><b>64 bit bitwise operations</b>: the rules for 64 bit
 | |
| arithmetic operators apply analogously.<br>
 | |
| 
 | |
| Unlike the other <tt>bit.*</tt> operations, <tt>bit.tobit()</tt>
 | |
| converts a cdata number via <tt>int64_t</tt> to <tt>int32_t</tt> and
 | |
| returns a Lua number.<br>
 | |
| 
 | |
| For <tt>bit.band()</tt>, <tt>bit.bor()</tt> and <tt>bit.bxor()</tt>, the
 | |
| conversion to <tt>int64_t</tt> or <tt>uint64_t</tt> applies to
 | |
| <em>all</em> arguments, if <em>any</em> argument is a cdata number.<br>
 | |
| 
 | |
| For all other operations, only the first argument is used to determine
 | |
| the output type. This implies that a cdata number as a shift count for
 | |
| shifts and rotates is accepted, but that alone does <em>not</em> cause
 | |
| a cdata number output.
 | |
| 
 | |
| </ul>
 | |
| 
 | |
| <h3 id="cdata_comp">Comparisons of cdata objects</h3>
 | |
| <ul>
 | |
| 
 | |
| <li><b>Pointer comparison</b>: two compatible cdata pointers/arrays
 | |
| can be compared. The result is the same as an unsigned comparison of
 | |
| their addresses. <tt>nil</tt> is treated like a <tt>NULL</tt> pointer,
 | |
| which is compatible with any other pointer type.</li>
 | |
| 
 | |
| <li><b>64 bit integer comparison</b>: two cdata numbers, or a
 | |
| cdata number and a Lua number can be compared with each other. If one
 | |
| of them is an <tt>uint64_t</tt>, the other side is converted to an
 | |
| <tt>uint64_t</tt> and an unsigned comparison is performed. Otherwise
 | |
| both sides are converted to an <tt>int64_t</tt> and a signed
 | |
| comparison is performed.<br>
 | |
| 
 | |
| If one of the operands is an <tt>enum</tt> and the other operand is a
 | |
| string, the string is converted to the value of a matching <tt>enum</tt>
 | |
| constant before the above conversion.<br>
 | |
| 
 | |
| <li><b>Comparisons for equality/inequality</b> never raise an error.
 | |
| Even incompatible pointers can be compared for equality by address. Any
 | |
| other incompatible comparison (also with non-cdata objects) treats the
 | |
| two sides as unequal.</li>
 | |
| 
 | |
| </ul>
 | |
| 
 | |
| <h3 id="cdata_key">cdata objects as table keys</h3>
 | |
| <p>
 | |
| Lua tables may be indexed by cdata objects, but this doesn't provide
 | |
| any useful semantics — <b>cdata objects are unsuitable as table
 | |
| keys!</b>
 | |
| </p>
 | |
| <p>
 | |
| A cdata object is treated like any other garbage-collected object and
 | |
| is hashed and compared by its address for table indexing. Since
 | |
| there's no interning for cdata value types, the same value may be
 | |
| boxed in different cdata objects with different addresses. Thus
 | |
| <tt>t[1LL+1LL]</tt> and <tt>t[2LL]</tt> usually <b>do not</b> point to
 | |
| the same hash slot and they certainly <b>do not</b> point to the same
 | |
| hash slot as <tt>t[2]</tt>.
 | |
| </p>
 | |
| <p>
 | |
| It would seriously drive up implementation complexity and slow down
 | |
| the common case, if one were to add extra handling for by-value
 | |
| hashing and comparisons to Lua tables. Given the ubiquity of their use
 | |
| inside the VM, this is not acceptable.
 | |
| </p>
 | |
| <p>
 | |
| There are three viable alternatives, if you really need to use cdata
 | |
| objects as keys:
 | |
| </p>
 | |
| <ul>
 | |
| 
 | |
| <li>If you can get by with the precision of Lua numbers
 | |
| (52 bits), then use <tt>tonumber()</tt> on a cdata number or
 | |
| combine multiple fields of a cdata aggregate to a Lua number. Then use
 | |
| the resulting Lua number as a key when indexing tables.<br>
 | |
| One obvious benefit: <tt>t[tonumber(2LL)]</tt> <b>does</b> point to
 | |
| the same slot as <tt>t[2]</tt>.</li>
 | |
| 
 | |
| <li>Otherwise use either <tt>tostring()</tt> on 64 bit integers
 | |
| or complex numbers or combine multiple fields of a cdata aggregate to
 | |
| a Lua string (e.g. with
 | |
| <a href="ext_ffi_api.html#ffi_string"><tt>ffi.string()</tt></a>). Then
 | |
| use the resulting Lua string as a key when indexing tables.</li>
 | |
| 
 | |
| <li>Create your own specialized hash table implementation using the
 | |
| C types provided by the FFI library, just like you would in
 | |
| C code. Ultimately this may give much better performance than the
 | |
| other alternatives or what a generic by-value hash table could
 | |
| possibly provide.</li>
 | |
| 
 | |
| </ul>
 | |
| 
 | |
| <h2 id="param">Parameterized Types</h2>
 | |
| <p>
 | |
| To facilitate some abstractions, the two functions
 | |
| <a href="ext_ffi_api.html#ffi_typeof"><tt>ffi.typeof</tt></a> and
 | |
| <a href="ext_ffi_api.html#ffi_cdef"><tt>ffi.cdef</tt></a> support
 | |
| parameterized types in C declarations. Note: none of the other API
 | |
| functions taking a cdecl allow this.
 | |
| </p>
 | |
| <p>
 | |
| Any place you can write a <b><tt>typedef</tt> name</b>, an
 | |
| <b>identifier</b> or a <b>number</b> in a declaration, you can write
 | |
| <tt>$</tt> (the dollar sign) instead. These placeholders are replaced in
 | |
| order of appearance with the arguments following the cdecl string:
 | |
| </p>
 | |
| <pre class="code">
 | |
| -- Declare a struct with a parameterized field type and name:
 | |
| ffi.cdef([[
 | |
| typedef struct { $ $; } foo_t;
 | |
| ]], type1, name1)
 | |
| 
 | |
| -- Anonymous struct with dynamic names:
 | |
| local bar_t = ffi.typeof("struct { int $, $; }", name1, name2)
 | |
| -- Derived pointer type:
 | |
| local bar_ptr_t = ffi.typeof("$ *", bar_t)
 | |
| 
 | |
| -- Parameterized dimensions work even where a VLA won't work:
 | |
| local matrix_t = ffi.typeof("uint8_t[$][$]", width, height)
 | |
| </pre>
 | |
| <p>
 | |
| Caveat: this is <em>not</em> simple text substitution! A passed ctype or
 | |
| cdata object is treated like the underlying type, a passed string is
 | |
| considered an identifier and a number is considered a number. You must
 | |
| not mix this up: e.g. passing <tt>"int"</tt> as a string doesn't work in
 | |
| place of a type, you'd need to use <tt>ffi.typeof("int")</tt> instead.
 | |
| </p>
 | |
| <p>
 | |
| The main use for parameterized types are libraries implementing abstract
 | |
| data types
 | |
| (<a href="http://www.freelists.org/post/luajit/ffi-type-of-pointer-to,8"><span class="ext">»</span> example</a>),
 | |
| similar to what can be achieved with C++ template metaprogramming.
 | |
| Another use case are derived types of anonymous structs, which avoids
 | |
| pollution of the global struct namespace.
 | |
| </p>
 | |
| <p>
 | |
| Please note that parameterized types are a nice tool and indispensable
 | |
| for certain use cases. But you'll want to use them sparingly in regular
 | |
| code, e.g. when all types are actually fixed.
 | |
| </p>
 | |
| 
 | |
| <h2 id="gc">Garbage Collection of cdata Objects</h2>
 | |
| <p>
 | |
| All explicitly (<tt>ffi.new()</tt>, <tt>ffi.cast()</tt> etc.) or
 | |
| implicitly (accessors) created cdata objects are garbage collected.
 | |
| You need to ensure to retain valid references to cdata objects
 | |
| somewhere on a Lua stack, an upvalue or in a Lua table while they are
 | |
| still in use. Once the last reference to a cdata object is gone, the
 | |
| garbage collector will automatically free the memory used by it (at
 | |
| the end of the next GC cycle).
 | |
| </p>
 | |
| <p>
 | |
| Please note that pointers themselves are cdata objects, however they
 | |
| are <b>not</b> followed by the garbage collector. So e.g. if you
 | |
| assign a cdata array to a pointer, you must keep the cdata object
 | |
| holding the array alive as long as the pointer is still in use:
 | |
| </p>
 | |
| <pre class="code">
 | |
| ffi.cdef[[
 | |
| typedef struct { int *a; } foo_t;
 | |
| ]]
 | |
| 
 | |
| local s = ffi.new("foo_t", ffi.new("int[10]")) -- <span style="color:#c00000;">WRONG!</span>
 | |
| 
 | |
| local a = ffi.new("int[10]") -- <span style="color:#00a000;">OK</span>
 | |
| local s = ffi.new("foo_t", a)
 | |
| -- Now do something with 's', but keep 'a' alive until you're done.
 | |
| </pre>
 | |
| <p>
 | |
| Similar rules apply for Lua strings which are implicitly converted to
 | |
| <tt>"const char *"</tt>: the string object itself must be
 | |
| referenced somewhere or it'll be garbage collected eventually. The
 | |
| pointer will then point to stale data, which may have already been
 | |
| overwritten. Note that <em>string literals</em> are automatically kept
 | |
| alive as long as the function containing it (actually its prototype)
 | |
| is not garbage collected.
 | |
| </p>
 | |
| <p>
 | |
| Objects which are passed as an argument to an external C function
 | |
| are kept alive until the call returns. So it's generally safe to
 | |
| create temporary cdata objects in argument lists. This is a common
 | |
| idiom for <a href="#convert_vararg">passing specific C types to
 | |
| vararg functions</a>.
 | |
| </p>
 | |
| <p>
 | |
| Memory areas returned by C functions (e.g. from <tt>malloc()</tt>)
 | |
| must be manually managed, of course (or use
 | |
| <a href="ext_ffi_api.html#ffi_gc"><tt>ffi.gc()</tt></a>). Pointers to
 | |
| cdata objects are indistinguishable from pointers returned by C
 | |
| functions (which is one of the reasons why the GC cannot follow them).
 | |
| </p>
 | |
| 
 | |
| <h2 id="callback">Callbacks</h2>
 | |
| <p>
 | |
| The LuaJIT FFI automatically generates special callback functions
 | |
| whenever a Lua function is converted to a C function pointer. This
 | |
| associates the generated callback function pointer with the C type
 | |
| of the function pointer and the Lua function object (closure).
 | |
| </p>
 | |
| <p>
 | |
| This can happen implicitly due to the usual conversions, e.g. when
 | |
| passing a Lua function to a function pointer argument. Or you can use
 | |
| <tt>ffi.cast()</tt> to explicitly cast a Lua function to a
 | |
| C function pointer.
 | |
| </p>
 | |
| <p>
 | |
| Currently only certain C function types can be used as callback
 | |
| functions. Neither C vararg functions nor functions with
 | |
| pass-by-value aggregate argument or result types are supported. There
 | |
| are no restrictions for the kind of Lua functions that can be called
 | |
| from the callback — no checks for the proper number of arguments
 | |
| are made. The return value of the Lua function will be converted to the
 | |
| result type and an error will be thrown for invalid conversions.
 | |
| </p>
 | |
| <p>
 | |
| It's allowed to throw errors across a callback invocation, but it's not
 | |
| advisable in general. Do this only if you know the C function, that
 | |
| called the callback, copes with the forced stack unwinding and doesn't
 | |
| leak resources.
 | |
| </p>
 | |
| <p>
 | |
| One thing that's not allowed, is to let an FFI call into a C function
 | |
| get JIT-compiled, which in turn calls a callback, calling into Lua again.
 | |
| Usually this attempt is caught by the interpreter first and the
 | |
| C function is blacklisted for compilation.
 | |
| </p>
 | |
| <p>
 | |
| However, this heuristic may fail under specific circumstances: e.g. a
 | |
| message polling function might not run Lua callbacks right away and the call
 | |
| gets JIT-compiled. If it later happens to call back into Lua (e.g. a rarely
 | |
| invoked error callback), you'll get a VM PANIC with the message
 | |
| <tt>"bad callback"</tt>. Then you'll need to manually turn off
 | |
| JIT-compilation with
 | |
| <a href="ext_jit.html#jit_onoff_func"><tt>jit.off()</tt></a> for the
 | |
| surrounding Lua function that invokes such a message polling function (or
 | |
| similar).
 | |
| </p>
 | |
| 
 | |
| <h3 id="callback_resources">Callback resource handling</h3>
 | |
| <p>
 | |
| Callbacks take up resources — you can only have a limited number
 | |
| of them at the same time (500 - 1000, depending on the
 | |
| architecture). The associated Lua functions are anchored to prevent
 | |
| garbage collection, too.
 | |
| </p>
 | |
| <p>
 | |
| <b>Callbacks due to implicit conversions are permanent!</b> There is no
 | |
| way to guess their lifetime, since the C side might store the
 | |
| function pointer for later use (typical for GUI toolkits). The associated
 | |
| resources cannot be reclaimed until termination:
 | |
| </p>
 | |
| <pre class="code">
 | |
| ffi.cdef[[
 | |
| typedef int (__stdcall *WNDENUMPROC)(void *hwnd, intptr_t l);
 | |
| int EnumWindows(WNDENUMPROC func, intptr_t l);
 | |
| ]]
 | |
| 
 | |
| -- Implicit conversion to a callback via function pointer argument.
 | |
| local count = 0
 | |
| ffi.C.EnumWindows(function(hwnd, l)
 | |
|   count = count + 1
 | |
|   return true
 | |
| end, 0)
 | |
| -- The callback is permanent and its resources cannot be reclaimed!
 | |
| -- Ok, so this may not be a problem, if you do this only once.
 | |
| </pre>
 | |
| <p>
 | |
| Note: this example shows that you <em>must</em> properly declare
 | |
| <tt>__stdcall</tt> callbacks on Windows/x86 systems. The calling
 | |
| convention cannot be automatically detected, unlike for
 | |
| <tt>__stdcall</tt> calls <em>to</em> Windows functions.
 | |
| </p>
 | |
| <p>
 | |
| For some use cases it's necessary to free up the resources or to
 | |
| dynamically redirect callbacks. Use an explicit cast to a
 | |
| C function pointer and keep the resulting cdata object. Then use
 | |
| the <a href="ext_ffi_api.html#callback_free"><tt>cb:free()</tt></a>
 | |
| or <a href="ext_ffi_api.html#callback_set"><tt>cb:set()</tt></a> methods
 | |
| on the cdata object:
 | |
| </p>
 | |
| <pre class="code">
 | |
| -- Explicitly convert to a callback via cast.
 | |
| local count = 0
 | |
| local cb = ffi.cast("WNDENUMPROC", function(hwnd, l)
 | |
|   count = count + 1
 | |
|   return true
 | |
| end)
 | |
| 
 | |
| -- Pass it to a C function.
 | |
| ffi.C.EnumWindows(cb, 0)
 | |
| -- EnumWindows doesn't need the callback after it returns, so free it.
 | |
| 
 | |
| cb:free()
 | |
| -- The callback function pointer is no longer valid and its resources
 | |
| -- will be reclaimed. The created Lua closure will be garbage collected.
 | |
| </pre>
 | |
| 
 | |
| <h3 id="callback_performance">Callback performance</h3>
 | |
| <p>
 | |
| <b>Callbacks are slow!</b> First, the C to Lua transition itself
 | |
| has an unavoidable cost, similar to a <tt>lua_call()</tt> or
 | |
| <tt>lua_pcall()</tt>. Argument and result marshalling add to that cost.
 | |
| And finally, neither the C compiler nor LuaJIT can inline or
 | |
| optimize across the language barrier and hoist repeated computations out
 | |
| of a callback function.
 | |
| </p>
 | |
| <p>
 | |
| Do not use callbacks for performance-sensitive work: e.g. consider a
 | |
| numerical integration routine which takes a user-defined function to
 | |
| integrate over. It's a bad idea to call a user-defined Lua function from
 | |
| C code millions of times. The callback overhead will be absolutely
 | |
| detrimental for performance.
 | |
| </p>
 | |
| <p>
 | |
| It's considerably faster to write the numerical integration routine
 | |
| itself in Lua — the JIT compiler will be able to inline the
 | |
| user-defined function and optimize it together with its calling context,
 | |
| with very competitive performance.
 | |
| </p>
 | |
| <p>
 | |
| As a general guideline: <b>use callbacks only when you must</b>, because
 | |
| of existing C APIs. E.g. callback performance is irrelevant for a
 | |
| GUI application, which waits for user input most of the time, anyway.
 | |
| </p>
 | |
| <p>
 | |
| For new designs <b>avoid push-style APIs</b>: a C function repeatedly
 | |
| calling a callback for each result. Instead <b>use pull-style APIs</b>:
 | |
| call a C function repeatedly to get a new result. Calls from Lua
 | |
| to C via the FFI are much faster than the other way round. Most well-designed
 | |
| libraries already use pull-style APIs (read/write, get/put).
 | |
| </p>
 | |
| 
 | |
| <h2 id="clib">C Library Namespaces</h2>
 | |
| <p>
 | |
| A C library namespace is a special kind of object which allows
 | |
| access to the symbols contained in shared libraries or the default
 | |
| symbol namespace. The default
 | |
| <a href="ext_ffi_api.html#ffi_C"><tt>ffi.C</tt></a> namespace is
 | |
| automatically created when the FFI library is loaded. C library
 | |
| namespaces for specific shared libraries may be created with the
 | |
| <a href="ext_ffi_api.html#ffi_load"><tt>ffi.load()</tt></a> API
 | |
| function.
 | |
| </p>
 | |
| <p>
 | |
| Indexing a C library namespace object with a symbol name (a Lua
 | |
| string) automatically binds it to the library. First the symbol type
 | |
| is resolved — it must have been declared with
 | |
| <a href="ext_ffi_api.html#ffi_cdef"><tt>ffi.cdef</tt></a>. Then the
 | |
| symbol address is resolved by searching for the symbol name in the
 | |
| associated shared libraries or the default symbol namespace. Finally,
 | |
| the resulting binding between the symbol name, the symbol type and its
 | |
| address is cached. Missing symbol declarations or nonexistent symbol
 | |
| names cause an error.
 | |
| </p>
 | |
| <p>
 | |
| This is what happens on a <b>read access</b> for the different kinds of
 | |
| symbols:
 | |
| </p>
 | |
| <ul>
 | |
| 
 | |
| <li>External functions: a cdata object with the type of the function
 | |
| and its address is returned.</li>
 | |
| 
 | |
| <li>External variables: the symbol address is dereferenced and the
 | |
| loaded value is <a href="#convert_tolua">converted to a Lua object</a>
 | |
| and returned.</li>
 | |
| 
 | |
| <li>Constant values (<tt>static const</tt> or <tt>enum</tt>
 | |
| constants): the constant is <a href="#convert_tolua">converted to a
 | |
| Lua object</a> and returned.</li>
 | |
| 
 | |
| </ul>
 | |
| <p>
 | |
| This is what happens on a <b>write access</b>:
 | |
| </p>
 | |
| <ul>
 | |
| 
 | |
| <li>External variables: the value to be written is
 | |
| <a href="#convert_fromlua">converted to the C type</a> of the
 | |
| variable and then stored at the symbol address.</li>
 | |
| 
 | |
| <li>Writing to constant variables or to any other symbol type causes
 | |
| an error, like any other attempted write to a constant location.</li>
 | |
| 
 | |
| </ul>
 | |
| <p>
 | |
| C library namespaces themselves are garbage collected objects. If
 | |
| the last reference to the namespace object is gone, the garbage
 | |
| collector will eventually release the shared library reference and
 | |
| remove all memory associated with the namespace. Since this may
 | |
| trigger the removal of the shared library from the memory of the
 | |
| running process, it's generally <em>not safe</em> to use function
 | |
| cdata objects obtained from a library if the namespace object may be
 | |
| unreferenced.
 | |
| </p>
 | |
| <p>
 | |
| Performance notice: the JIT compiler specializes to the identity of
 | |
| namespace objects and to the strings used to index it. This
 | |
| effectively turns function cdata objects into constants. It's not
 | |
| useful and actually counter-productive to explicitly cache these
 | |
| function objects, e.g. <tt>local strlen = ffi.C.strlen</tt>. OTOH it
 | |
| <em>is</em> useful to cache the namespace itself, e.g. <tt>local C =
 | |
| ffi.C</tt>.
 | |
| </p>
 | |
| 
 | |
| <h2 id="policy">No Hand-holding!</h2>
 | |
| <p>
 | |
| The FFI library has been designed as <b>a low-level library</b>. The
 | |
| goal is to interface with C code and C data types with a
 | |
| minimum of overhead. This means <b>you can do anything you can do
 | |
| from C</b>: access all memory, overwrite anything in memory, call
 | |
| machine code at any memory address and so on.
 | |
| </p>
 | |
| <p>
 | |
| The FFI library provides <b>no memory safety</b>, unlike regular Lua
 | |
| code. It will happily allow you to dereference a <tt>NULL</tt>
 | |
| pointer, to access arrays out of bounds or to misdeclare
 | |
| C functions. If you make a mistake, your application might crash,
 | |
| just like equivalent C code would.
 | |
| </p>
 | |
| <p>
 | |
| This behavior is inevitable, since the goal is to provide full
 | |
| interoperability with C code. Adding extra safety measures, like
 | |
| bounds checks, would be futile. There's no way to detect
 | |
| misdeclarations of C functions, since shared libraries only
 | |
| provide symbol names, but no type information. Likewise there's no way
 | |
| to infer the valid range of indexes for a returned pointer.
 | |
| </p>
 | |
| <p>
 | |
| Again: the FFI library is a low-level library. This implies it needs
 | |
| to be used with care, but it's flexibility and performance often
 | |
| outweigh this concern. If you're a C or C++ developer, it'll be easy
 | |
| to apply your existing knowledge. OTOH writing code for the FFI
 | |
| library is not for the faint of heart and probably shouldn't be the
 | |
| first exercise for someone with little experience in Lua, C or C++.
 | |
| </p>
 | |
| <p>
 | |
| As a corollary of the above, the FFI library is <b>not safe for use by
 | |
| untrusted Lua code</b>. If you're sandboxing untrusted Lua code, you
 | |
| definitely don't want to give this code access to the FFI library or
 | |
| to <em>any</em> cdata object (except 64 bit integers or complex
 | |
| numbers). Any properly engineered Lua sandbox needs to provide safety
 | |
| wrappers for many of the standard Lua library functions —
 | |
| similar wrappers need to be written for high-level operations on FFI
 | |
| data types, too.
 | |
| </p>
 | |
| 
 | |
| <h2 id="status">Current Status</h2>
 | |
| <p>
 | |
| The initial release of the FFI library has some limitations and is
 | |
| missing some features. Most of these will be fixed in future releases.
 | |
| </p>
 | |
| <p>
 | |
| <a href="#clang">C language support</a> is
 | |
| currently incomplete:
 | |
| </p>
 | |
| <ul>
 | |
| <li>C declarations are not passed through a C pre-processor,
 | |
| yet.</li>
 | |
| <li>The C parser is able to evaluate most constant expressions
 | |
| commonly found in C header files. However it doesn't handle the
 | |
| full range of C expression semantics and may fail for some
 | |
| obscure constructs.</li>
 | |
| <li><tt>static const</tt> declarations only work for integer types
 | |
| up to 32 bits. Neither declaring string constants nor
 | |
| floating-point constants is supported.</li>
 | |
| <li>Packed <tt>struct</tt> bitfields that cross container boundaries
 | |
| are not implemented.</li>
 | |
| <li>Native vector types may be defined with the GCC <tt>mode</tt> or
 | |
| <tt>vector_size</tt> attribute. But no operations other than loading,
 | |
| storing and initializing them are supported, yet.</li>
 | |
| <li>The <tt>volatile</tt> type qualifier is currently ignored by
 | |
| compiled code.</li>
 | |
| <li><a href="ext_ffi_api.html#ffi_cdef"><tt>ffi.cdef</tt></a> silently
 | |
| ignores most re-declarations. Note: avoid re-declarations which do not
 | |
| conform to C99. The implementation will eventually be changed to
 | |
| perform strict checks.</li>
 | |
| </ul>
 | |
| <p>
 | |
| The JIT compiler already handles a large subset of all FFI operations.
 | |
| It automatically falls back to the interpreter for unimplemented
 | |
| operations (you can check for this with the
 | |
| <a href="running.html#opt_j"><tt>-jv</tt></a> command line option).
 | |
| The following operations are currently not compiled and may exhibit
 | |
| suboptimal performance, especially when used in inner loops:
 | |
| </p>
 | |
| <ul>
 | |
| <li>Bitfield accesses and initializations.</li>
 | |
| <li>Vector operations.</li>
 | |
| <li>Table initializers.</li>
 | |
| <li>Initialization of nested <tt>struct</tt>/<tt>union</tt> types.</li>
 | |
| <li>Non-default initialization of VLA/VLS or large C types
 | |
| (> 128 bytes or > 16 array elements.</li>
 | |
| <li>Conversions from lightuserdata to <tt>void *</tt>.</li>
 | |
| <li>Pointer differences for element sizes that are not a power of
 | |
| two.</li>
 | |
| <li>Calls to C functions with aggregates passed or returned by
 | |
| value.</li>
 | |
| <li>Calls to ctype metamethods which are not plain functions.</li>
 | |
| <li>ctype <tt>__newindex</tt> tables and non-string lookups in ctype
 | |
| <tt>__index</tt> tables.</li>
 | |
| <li><tt>tostring()</tt> for cdata types.</li>
 | |
| <li>Calls to <tt>ffi.cdef()</tt>, <tt>ffi.load()</tt> and
 | |
| <tt>ffi.metatype()</tt>.</li>
 | |
| </ul>
 | |
| <p>
 | |
| Other missing features:
 | |
| </p>
 | |
| <ul>
 | |
| <li>Arithmetic for <tt>complex</tt> numbers.</li>
 | |
| <li>Passing structs by value to vararg C functions.</li>
 | |
| <li><a href="extensions.html#exceptions">C++ exception interoperability</a>
 | |
| does not extend to C functions called via the FFI, if the call is
 | |
| compiled.</li>
 | |
| </ul>
 | |
| <br class="flush">
 | |
| </div>
 | |
| <div id="foot">
 | |
| <hr class="hide">
 | |
| Copyright © 2005-2016 Mike Pall
 | |
| <span class="noprint">
 | |
| ·
 | |
| <a href="contact.html">Contact</a>
 | |
| </span>
 | |
| </div>
 | |
| </body>
 | |
| </html>
 |