Expand description
Syn is a parsing library for parsing a stream of Rust tokens into a syntax tree of Rust source code.
Currently this library is geared toward use in Rust procedural macros, but contains some APIs that may be useful more generally.
-
Data structures — Syn provides a complete syntax tree that can represent any valid Rust source code. The syntax tree is rooted at
syn::Filewhich represents a full source file, but there are other entry points that may be useful to procedural macros includingsyn::Item,syn::Exprandsyn::Type. -
Derives — Of particular interest to derive macros is
syn::DeriveInputwhich is any of the three legal input items to a derive macro. An example below shows using this type in a library that can derive implementations of a user-defined trait. -
Parsing — Parsing in Syn is built around parser functions with the signature
fn(ParseStream) -> Result<T>. Every syntax tree node defined by Syn is individually parsable and may be used as a building block for custom syntaxes, or you may dream up your own brand new syntax without involving any of our syntax tree types. -
Location information — Every token parsed by Syn is associated with a
Spanthat tracks line and column information back to the source of that token. These spans allow a procedural macro to display detailed error messages pointing to all the right places in the user’s code. There is an example of this below. -
Feature flags — Functionality is aggressively feature gated so your procedural macros enable only what they need, and do not pay in compile time for all the rest.
§Example of a derive macro
The canonical derive macro using Syn looks like this. We write an ordinary
Rust function tagged with a proc_macro_derive attribute and the name of
the trait we are deriving. Any time that derive appears in the user’s code,
the Rust compiler passes their data structure as tokens into our macro. We
get to execute arbitrary Rust code to figure out what to do with those
tokens, then hand some tokens back to the compiler to compile into the
user’s crate.
[dependencies]
syn = "2.0"
quote = "1.0"
[lib]
proc-macro = trueuse proc_macro::TokenStream;
use quote::quote;
use syn::{parse_macro_input, DeriveInput};
#[proc_macro_derive(MyMacro)]
pub fn my_macro(input: TokenStream) -> TokenStream {
// Parse the input tokens into a syntax tree
let input = parse_macro_input!(input as DeriveInput);
// Build the output, possibly using quasi-quotation
let expanded = quote! {
// ...
};
// Hand the output tokens back to the compiler
TokenStream::from(expanded)
}The heapsize example directory shows a complete working implementation
of a derive macro. The example derives a HeapSize trait which computes an
estimate of the amount of heap memory owned by a value.
pub trait HeapSize {
/// Total number of bytes of heap memory owned by `self`.
fn heap_size_of_children(&self) -> usize;
}The derive macro allows users to write #[derive(HeapSize)] on data
structures in their program.
#[derive(HeapSize)]
struct Demo<'a, T: ?Sized> {
a: Box<T>,
b: u8,
c: &'a str,
d: String,
}§Spans and error reporting
The token-based procedural macro API provides great control over where the
compiler’s error messages are displayed in user code. Consider the error the
user sees if one of their field types does not implement HeapSize.
#[derive(HeapSize)]
struct Broken {
ok: String,
bad: std::thread::Thread,
}By tracking span information all the way through the expansion of a
procedural macro as shown in the heapsize example, token-based macros in
Syn are able to trigger errors that directly pinpoint the source of the
problem.
error[E0277]: the trait bound `std::thread::Thread: HeapSize` is not satisfied
--> src/main.rs:7:5
|
7 | bad: std::thread::Thread,
| ^^^^^^^^^^^^^^^^^^^^^^^^ the trait `HeapSize` is not implemented for `Thread`§Parsing a custom syntax
The lazy-static example directory shows the implementation of a
functionlike!(...) procedural macro in which the input tokens are parsed
using Syn’s parsing API.
The example reimplements the popular lazy_static crate from crates.io as a
procedural macro.
lazy_static! {
static ref USERNAME: Regex = Regex::new("^[a-z0-9_-]{3,16}$").unwrap();
}The implementation shows how to trigger custom warnings and error messages on the macro input.
warning: come on, pick a more creative name
--> src/main.rs:10:16
|
10 | static ref FOO: String = "lazy_static".to_owned();
| ^^^§Testing
When testing macros, we often care not just that the macro can be used
successfully but also that when the macro is provided with invalid input it
produces maximally helpful error messages. Consider using the trybuild
crate to write tests for errors that are emitted by your macro or errors
detected by the Rust compiler in the expanded code following misuse of the
macro. Such tests help avoid regressions from later refactors that
mistakenly make an error no longer trigger or be less helpful than it used
to be.
§Debugging
When developing a procedural macro it can be helpful to look at what the
generated code looks like. Use cargo rustc -- -Zunstable-options --pretty=expanded or the cargo expand subcommand.
To show the expanded code for some crate that uses your procedural macro,
run cargo expand from that crate. To show the expanded code for one of
your own test cases, run cargo expand --test the_test_case where the last
argument is the name of the test file without the .rs extension.
This write-up by Brandon W Maister discusses debugging in more detail: Debugging Rust’s new Custom Derive system.
§Optional features
Syn puts a lot of functionality behind optional features in order to optimize compile time for the most common use cases. The following features are available.
derive(enabled by default) — Data structures for representing the possible input to a derive macro, including structs and enums and types.full— Data structures for representing the syntax tree of all valid Rust source code, including items and expressions.parsing(enabled by default) — Ability to parse input tokens into a syntax tree node of a chosen type.printing(enabled by default) — Ability to print a syntax tree node as tokens of Rust source code.visit— Trait for traversing a syntax tree.visit-mut— Trait for traversing and mutating in place a syntax tree.fold— Trait for transforming an owned syntax tree.clone-impls(enabled by default) — Clone impls for all syntax tree types.extra-traits— Debug, Eq, PartialEq, Hash impls for all syntax tree types.proc-macro(enabled by default) — Runtime dependency on the dynamic library libproc_macro from rustc toolchain.
Modules§
- buffer
parsing - A stably addressed token buffer supporting efficient traversal based on a cheaply copyable cursor.
- ext
parsing - Extension traits to provide parsing methods on foreign types.
- meta
parsingand (fullorderive) - Facility for interpreting structured content inside of an
Attribute. - parse
parsing - Parsing interface for parsing a token stream into a syntax tree node.
- punctuated
- A punctuated sequence of syntax tree nodes separated by punctuation.
- spanned
parsingandprinting - A trait that can provide the
Spanof the complete contents of a syntax tree node. - token
- Tokens representing Rust punctuation, keywords, and delimiters.
Macros§
- Token
- A type-macro that expands to the name of the Rust type representation of a given token.
- braced
parsing - Parse a set of curly braces and expose their content to subsequent parsers.
- bracketed
parsing - Parse a set of square brackets and expose their content to subsequent parsers.
- custom_
keyword - Define a type that supports parsing and printing a given identifier as if it were a keyword.
- custom_
punctuation - Define a type that supports parsing and printing a multi-character symbol as if it were a punctuation token.
- parenthesized
parsing - Parse a set of parentheses and expose their content to subsequent parsers.
- parse_
macro_ input parsingandproc-macro - Parse the input TokenStream of a macro, triggering a compile error if the tokens fail to parse.
- parse_
quote parsingandprinting - Quasi-quotation macro that accepts input like the
quote!macro but uses type inference to figure out a return type for those tokens. - parse_
quote_ spanned parsingandprinting - This macro is
parse_quote!+quote_spanned!.
Structs§
- Abi
fullorderive - The binary interface of a function:
extern "C". - Angle
Bracketed Generic Arguments fullorderive - Angle bracketed arguments of a path segment: the
<K, V>inHashMap<K, V>. - Assoc
Const fullorderive - An equality constraint on an associated constant: the
PANIC = falseinTrait<PANIC = false>. - Assoc
Type fullorderive - A binding (equality constraint) on an associated type: the
Item = u8inIterator<Item = u8>. - Attribute
fullorderive - An attribute, like
#[repr(transparent)]. - Bare
FnArg fullorderive - An argument in a function type: the
usizeinfn(usize) -> bool. - Bare
Variadic fullorderive - The variadic argument of a function pointer like
fn(usize, ...). - Bound
Lifetimes fullorderive - A set of bound lifetimes:
for<'a, 'b, 'c>. - Const
Param fullorderive - A const generic parameter:
const LENGTH: usize. - Constraint
fullorderive - An associated type bound:
Iterator<Item: Display>. - Data
Enum derive - An enum input to a
proc_macro_derivemacro. - Data
Struct derive - A struct input to a
proc_macro_derivemacro. - Data
Union derive - An untagged union input to a
proc_macro_derivemacro. - Derive
Input derive - Data structure sent to a
proc_macro_derivemacro. - Error
- Error returned when a Syn parser cannot parse the input tokens.
- Expr
Array full - A slice literal expression:
[a, b, c, d]. - Expr
Assign full - An assignment expression:
a = compute(). - Expr
Async full - An async block:
async { ... }. - Expr
Await full - An await expression:
fut.await. - Expr
Binary fullorderive - A binary operation:
a + b,a += b. - Expr
Block full - A blocked scope:
{ ... }. - Expr
Break full - A
break, with an optional label to break and an optional expression. - Expr
Call fullorderive - A function call expression:
invoke(a, b). - Expr
Cast fullorderive - A cast expression:
foo as f64. - Expr
Closure full - A closure expression:
|a, b| a + b. - Expr
Const full - A const block:
const { ... }. - Expr
Continue full - A
continue, with an optional label. - Expr
Field fullorderive - Access of a named struct field (
obj.k) or unnamed tuple struct field (obj.0). - Expr
ForLoop full - A for loop:
for pat in expr { ... }. - Expr
Group full - An expression contained within invisible delimiters.
- ExprIf
full - An
ifexpression with an optionalelseblock:if expr { ... } else { ... }. - Expr
Index fullorderive - A square bracketed indexing expression:
vector[2]. - Expr
Infer full - The inferred value of a const generic argument, denoted
_. - ExprLet
full - A
letguard:let Some(x) = opt. - ExprLit
fullorderive - A literal in place of an expression:
1,"foo". - Expr
Loop full - Conditionless loop:
loop { ... }. - Expr
Macro fullorderive - A macro invocation expression:
format!("{}", q). - Expr
Match full - A
matchexpression:match n { Some(n) => {}, None => {} }. - Expr
Method Call fullorderive - A method call expression:
x.foo::<T>(a, b). - Expr
Paren fullorderive - A parenthesized expression:
(a + b). - Expr
Path fullorderive - A path like
std::mem::replacepossibly containing generic parameters and a qualified self-type. - Expr
Range full - A range expression:
1..2,1..,..2,1..=2,..=2. - Expr
RawAddr full - Address-of operation:
&raw const placeor&raw mut place. - Expr
Reference fullorderive - A referencing operation:
&aor&mut a. - Expr
Repeat full - An array literal constructed from one repeated element:
[0u8; N]. - Expr
Return full - A
return, with an optional value to be returned. - Expr
Struct fullorderive - A struct literal expression:
Point { x: 1, y: 1 }. - ExprTry
full - A try-expression:
expr?. - Expr
TryBlock full - A try block:
try { ... }. - Expr
Tuple full - A tuple expression:
(a, b, c, d). - Expr
Unary fullorderive - A unary operation:
!x,*x. - Expr
Unsafe full - An unsafe block:
unsafe { ... }. - Expr
While full - A while loop:
while expr { ... }. - Expr
Yield full - A yield expression:
yield expr. - Field
fullorderive - A field of a struct or enum variant.
- Field
Value fullorderive - A field-value pair in a struct literal.
- Fields
Named fullorderive - Named fields of a struct or struct variant such as
Point { x: f64, y: f64 }. - Fields
Unnamed fullorderive - Unnamed fields of a tuple struct or tuple variant such as
Some(T). - Generics
fullorderive - Lifetimes and type parameters attached to a declaration of a function, enum, trait, etc.
- Ident
- A word of Rust code, which may be a keyword or legal variable name.
- Impl
Generics ( fullorderive) andprinting - Returned by
Generics::split_for_impl. - Index
fullorderive - The index of an unnamed tuple struct field.
- Lifetime
- A Rust lifetime:
'a. - Lifetime
Param fullorderive - A lifetime definition:
'a: 'b + 'c + 'd. - LitBool
- A boolean literal:
trueorfalse. - LitByte
- A byte literal:
b'f'. - LitByte
Str - A byte string literal:
b"foo". - LitCStr
- A nul-terminated C-string literal:
c"foo". - LitChar
- A character literal:
'a'. - LitFloat
- A floating point literal:
1f64or1.0e10f64. - LitInt
- An integer literal:
1or1u16. - LitStr
- A UTF-8 string literal:
"foo". - Macro
fullorderive - A macro invocation:
println!("{}", mac). - Meta
List fullorderive - A structured list within an attribute, like
derive(Copy, Clone). - Meta
Name Value fullorderive - A name-value pair within an attribute, like
feature = "nightly". - Parenthesized
Generic Arguments fullorderive - Arguments of a function path segment: the
(A, B) -> CinFn(A,B) -> C. - Path
fullorderive - A path at which a named item is exported (e.g.
std::collections::HashMap). - Path
Segment fullorderive - A segment of a path together with any path arguments on that segment.
- Predicate
Lifetime fullorderive - A lifetime predicate in a
whereclause:'a: 'b + 'c. - Predicate
Type fullorderive - A type predicate in a
whereclause:for<'c> Foo<'c>: Trait<'c>. - QSelf
fullorderive - The explicit Self type in a qualified path: the
Tin<T as Display>::fmt. - Trait
Bound fullorderive - A trait used as a bound on a type parameter.
- Turbofish
( fullorderive) andprinting - Returned by
TypeGenerics::as_turbofish. - Type
Array fullorderive - A fixed size array type:
[T; n]. - Type
Bare Fn fullorderive - A bare function type:
fn(usize) -> bool. - Type
Generics ( fullorderive) andprinting - Returned by
Generics::split_for_impl. - Type
Group fullorderive - A type contained within invisible delimiters.
- Type
Impl Trait fullorderive - An
impl Bound1 + Bound2 + Bound3type whereBoundis a trait or a lifetime. - Type
Infer fullorderive - Indication that a type should be inferred by the compiler:
_. - Type
Macro fullorderive - A macro in the type position.
- Type
Never fullorderive - The never type:
!. - Type
Param fullorderive - A generic type parameter:
T: Into<String>. - Type
Paren fullorderive - A parenthesized type equivalent to the inner type.
- Type
Path fullorderive - A path like
std::slice::Iter, optionally qualified with a self-type as in<Vec<T> as SomeTrait>::Associated. - TypePtr
fullorderive - A raw pointer type:
*const Tor*mut T. - Type
Reference fullorderive - A reference type:
&'a Tor&'a mut T. - Type
Slice fullorderive - A dynamically sized slice type:
[T]. - Type
Trait Object fullorderive - A trait object type
dyn Bound1 + Bound2 + Bound3whereBoundis a trait or a lifetime. - Type
Tuple fullorderive - A tuple type:
(A, B, C, String). - Variant
fullorderive - An enum variant.
- VisRestricted
fullorderive - A visibility level restricted to some path:
pub(self)orpub(super)orpub(crate)orpub(in some::module). - Where
Clause fullorderive - A
whereclause in a definition:where T: Deserialize<'de>, D: 'static.
Enums§
- Attr
Style fullorderive - Distinguishes between attributes that decorate an item and attributes that are contained within an item.
- BinOp
fullorderive - A binary operator:
+,+=,&. - Data
derive - The storage of a struct, enum or union data structure.
- Expr
fullorderive - A Rust expression.
- Field
Mutability fullorderive - Unused, but reserved for RFC 3323 restrictions.
- Fields
fullorderive - Data stored within an enum variant or struct.
- Generic
Argument fullorderive - An individual generic argument, like
'a,T, orItem = T. - Generic
Param fullorderive - A generic type parameter, lifetime, or const generic:
T: Into<String>,'a: 'b,const LEN: usize. - Lit
- A Rust literal such as a string or integer or boolean.
- Macro
Delimiter fullorderive - A grouping token that surrounds a macro body:
m!(...)orm!{...}orm![...]. - Member
fullorderive - A struct or tuple struct field accessed in a struct literal or field expression.
- Meta
fullorderive - Content of a compile-time structured attribute.
- Path
Arguments fullorderive - Angle bracketed or parenthesized arguments of a path segment.
- Return
Type fullorderive - Return type of a function signature.
- Trait
Bound Modifier fullorderive - A modifier on a trait bound, currently only used for the
?in?Sized. - Type
fullorderive - The possible types that a Rust value could have.
- Type
Param Bound fullorderive - A trait or lifetime used as a bound on a type parameter.
- UnOp
fullorderive - A unary operator:
*,!,-. - Visibility
fullorderive - The visibility level of an item: inherited or
puborpub(restricted). - Where
Predicate fullorderive - A single predicate in a
whereclause:T: Deserialize<'de>.
Functions§
- parse
parsingandproc-macro - Parse tokens of source code into the chosen syntax tree node.
- parse2
parsing - Parse a proc-macro2 token stream into the chosen syntax tree node.
- parse_
str parsing - Parse a string of Rust code into the chosen syntax tree node.
Type Aliases§
- Result
- The result of a Syn parser.