RCParsing 4.1.0
See the version list below for details.
dotnet add package RCParsing --version 4.1.0
NuGet\Install-Package RCParsing -Version 4.1.0
<PackageReference Include="RCParsing" Version="4.1.0" />
<PackageVersion Include="RCParsing" Version="4.1.0" />
<PackageReference Include="RCParsing" />
paket add RCParsing --version 4.1.0
#r "nuget: RCParsing, 4.1.0"
#:package RCParsing@4.1.0
#addin nuget:?package=RCParsing&version=4.1.0
#tool nuget:?package=RCParsing&version=4.1.0
RCParsing
A Fluent, Lexerless Parser Builder for .NET — Define ANY grammars with the elegance of BNF and the power of C#.
This library focuses on Developer-experience (DX) first, providing best toolkit for creating your programming languages, file formats or even data extraction tools with declarative API, debugging tools, and more. This allows you to design your parser directly in code and easily fix it using rule stack traces and detailed error messages.
Why RCParsing?
- 🐍 Hybrid Power: Unique support for barrier tokens to parse indent-sensitive languages like Python and YAML.
- ☄️ Incremental Parsing: Edit large documents with instant feedback. Our persistent AST enables efficient re-parsing of only changed sections, perfect for LSP servers and real-time editing scenarios.
- 💪 Regex on Steroids: You can find all matches for target structure in the input text with detailed AST information and transformed value.
- 🌀 Lexerless Freedom: No token priority headaches. Parse directly from raw text, even with keywords embedded in strings. Tokens are used just as lightweight matching primitives.
- 🎨 Fluent API: Write parsers in C# that read like clean BNF grammars, boosting readability and maintainability compared to imperative, functional or code-generation approaches.
- 🧩 Combinator Style: Unlock maximum performance by defining complex tokens with immediate value transformation, bypassing the AST construction entirely for a direct, allocation-free result. Perfect for high-speed parsing of well-defined formats. Also can be used with AST mode.
- 🐛 Superior Debugging: Get detailed, actionable error messages with stack traces, walk traces and precise source locations. Richest API for manual error information included.
- 🚑 Error Recovery: Define custom recovery strategies per rule to handle syntax errors and go further.
- ⚡ Blazing Fast: Performance is now on par with the fastest .NET parsing libraries (see benchmarks below).
- 🌳 Rich AST: Parser makes an AST (Abstract Syntax Tree) from raw text, with ability to optimize, fully analyze and calculate the result value entirely lazy, reducing unnecessary allocations.
- 🔧 Configurable Skipping: Advanced strategies for whitespace and comments, allowing you to use conflicting tokens in your main rules.
- 📦 Batteries Included: Useful built-in tokens and rules (regex, identifiers, numbers, escaped strings, separated lists, custom tokens, and more...).
- 🖥️ Broad Compatibility: Targets
.NET Standard 2.0
(runs on.NET Framework 4.6.1+
),.NET 6.0
, and.NET 8.0
.
Table of contents
- Installation
- Tutorials, docs and examples
- Simple examples - The examples that you can copy, paste, run or look!
- A + B - Basic arithmetic expression parser with result calculation.
- JSON (with incremental parsing) - A complete JSON parser with comments and skipping (with incremental parsing example included).
- Python-like - Demonstrating barrier tokens for indentation.
- JSON token combination - A maximum speed approach for getting values without AST.
- Finding patterns - How to find all occurrences of a rule in a string.
- Errors example - Just a simple example of how errors look in default and debug modes.
- Comparison with other parsing libraries
- Benchmarks
- Projects using RCParsing
- Roadmap
- Contributing
Installation
You can install the package via NuGet Package Manager or console window, using one of these commands:
dotnet add package RCParsing
Install-Package RCParsing
Or do it manually by cloning this repository.
Tutorials, docs and examples
- Tutorials - detailed tutorials, explaining features and mechanics of this library, highly recommended to read!
Simple examples
A + B
Here is simple example how to make simple parser that parses "a + b" string with numbers and transforms the result:
using RCParsing;
using RCParsing.Building;
// First, you need to create a builder
var builder = new ParserBuilder();
// Enable and configure the auto-skip for 'Whitespaces' (you can replace it with any other rule)
builder.Settings.SkipWhitespaces();
// Create a main sequential expression rule
builder.CreateMainRule("expression")
.Number<double>()
.LiteralChoice("+", "-")
.Number<double>()
.Transform(v => {
var value1 = v.GetValue<double>(0);
var op = v.GetValue<string>(1);
var value2 = v.GetValue<double>(2);
return op == "+" ? value1 + value2 : value1 - value2;
});
// Build the parser
var parser = builder.Build();
// Parse a string using 'expression' rule and get the raw AST (value will be calculated lazily)
var parsedRule = parser.Parse("10 + 15");
// We can now get the value from our 'Transform' functions (value calculates now)
var transformedValue = parsedRule.GetValue<double>();
Console.WriteLine(transformedValue); // 25
JSON (with incremental parsing)
And here is JSON example that also shows the partial re-parsing of parse tree:
var builder = new ParserBuilder();
// Configure AST type and skip-rule for whitespace and comments
builder.Settings
.Skip(r => r.Rule("skip"), ParserSkippingStrategy.SkipBeforeParsingGreedy)
.UseLazyAST(); // Use lazy AST type to store cached resuls
// The rule that will be skipped before every parsing attempt
builder.CreateRule("skip")
.Choice(
b => b.Whitespaces(),
b => b.Literal("//").TextUntil('\n', '\r'))
.ConfigureForSkip();
builder.CreateToken("string")
.Literal('"')
.EscapedTextPrefix(prefix: '\\', '\\', '\"') // This sub-token automatically escapes the source string and puts it into intermediate value
.Literal('"')
.Pass(index: 1); // Pass the EscapedTextPrefix's intermediate value up (it will be used as token's result value)
builder.CreateToken("number")
.Number<double>();
builder.CreateToken("boolean")
.LiteralChoice("true", "false").Transform(v => v.Text == "true");
builder.CreateToken("null")
.Literal("null").Transform(v => null);
builder.CreateRule("value")
.Choice(
c => c.Token("string"),
c => c.Token("number"),
c => c.Token("boolean"),
c => c.Token("null"),
c => c.Rule("array"),
c => c.Rule("object")
); // Choice rule propagates child's value by default
builder.CreateRule("array")
.Literal("[")
.ZeroOrMoreSeparated(v => v.Rule("value"), s => s.Literal(","),
allowTrailingSeparator: true, includeSeparatorsInResult: false)
.TransformLast(v => v.SelectArray())
.Literal("]")
.TransformSelect(index: 1); // Selects the Children[1]'s value
builder.CreateRule("object")
.Literal("{")
.ZeroOrMoreSeparated(v => v.Rule("pair"), s => s.Literal(","),
allowTrailingSeparator: true, includeSeparatorsInResult: false)
.TransformLast(v => v.SelectValues<KeyValuePair<string, object>>().ToDictionary(k => k.Key, v => v.Value))
.Literal("}")
.TransformSelect(index: 1);
builder.CreateRule("pair")
.Token("string")
.Literal(":")
.Rule("value")
.Transform(v => KeyValuePair.Create(v.GetValue<string>(0), v.GetValue(2)));
builder.CreateMainRule("content")
.Rule("value")
.EOF() // Sure that we captured all the input
.TransformSelect(0);
var jsonParser = builder.Build();
var json =
"""
{
"id": 1,
"name": "Sample Data",
"created": "2023-01-01T00:00:00", // This is a comment
"tags": ["tag1", "tag2", "tag3"],
"isActive": true,
"nested": {
"value": 123.456,
"description": "Nested description"
}
}
""";
// The same JSON, but with 'tags' value changed
var changedJson =
"""
{
"id": 1,
"name": "Sample Data",
"created": "2023-01-01T00:00:00", // This is a comment
"tags": { "nested": ["tag1", "tag2", "tag3"] },
"isActive": true,
"nested": {
"value": 123.456,
"description": "Nested description"
}
}
""";
// Parse the input text and calculate values (them will be recorded into the cache because we're using lazy AST)
var ast = jsonParser.Parse(json);
var value = ast.Value as Dictionary<string, object>;
var tags = value!["tags"] as object[];
var nested = value!["nested"] as Dictionary<string, object>;
// Prints: Sample Data
Console.WriteLine(value["name"]);
// Prints: tag1
Console.WriteLine(tags![0]);
// Re-parse the sligtly changed input string and get the values
var changedAst = ast.Reparsed(changedJson);
var changedValue = changedAst.Value as Dictionary<string, object>;
var changedTags = changedValue!["tags"] as Dictionary<string, object>;
var nestedTags = changedTags!["nested"] as object[];
var changedNested = changedValue!["nested"] as Dictionary<string, object>;
// Prints type: System.Object[]
Console.WriteLine(changedTags["nested"]);
// Prints: tag1
Console.WriteLine(nestedTags![0]);
// And untouched values remains the same!
// Prints: True
Console.WriteLine(ReferenceEquals(nested, changedNested));
Python-like
This example involves our killer-feature, barrier tokens that allows to parse indentations without missing them:
using RCParsing;
using RCParsing.Building;
var builder = new ParserBuilder();
builder.Settings.SkipWhitespaces();
// Add the 'INDENT' and 'DEDENT' barrier tokenizer
// 'INDENT' is emitted when indentation grows
// And 'DEDENT' is emitted when indentation cuts
// They are indentation delta tokens
builder.BarrierTokenizers
.AddIndent(indentSize: 4, "INDENT", "DEDENT");
// Create the statement rule
builder.CreateRule("statement")
.Choice(
b => b
.Literal("def")
.Identifier()
.Literal("():")
.Rule("block"),
b => b
.Literal("if")
.Identifier()
.Literal(":")
.Rule("block"),
b => b
.Identifier()
.Literal("=")
.Identifier()
.Literal(";"));
// Create the 'block' rule that matches our 'INDENT' and 'DEDENT' barrier tokens
builder.CreateRule("block")
.Token("INDENT")
.OneOrMore(b => b.Rule("statement"))
.Token("DEDENT");
builder.CreateMainRule("program")
.ZeroOrMore(b => b.Rule("statement"))
.EOF();
var parser = builder.Build();
string inputStr =
"""
def a():
b = c;
c = a;
a = p;
if c:
h = i;
if b:
a = aa;
""";
// Get the optimized AST...
var ast = parser.Parse(inputStr).Optimized();
// And print it!
foreach (var statement in ast.Children)
{
Console.WriteLine(statement.Text);
Console.Write("\n\n");
}
// Outputs:
/*
def a():
b = c;
c = a;
a = p;
if c:
h = i;
if b:
a = aa;
*/
JSON token combination
Tokens in this parser can be complex enough to act like the combinators, with immediate value transformation without AST:
var builder = new ParserBuilder();
// Use lookahead for 'Choice' tokens
builder.Settings.UseFirstCharacterMatch();
builder.CreateToken("string")
// 'Between' token pattern matches a sequence of three elements,
// but calculates and propagates intermediate value of second element
.Between(
b => b.Literal('"'),
b => b.TextUntil('"'),
b => b.Literal('"'));
builder.CreateToken("number")
.Number<double>();
builder.CreateToken("boolean")
// 'Map' token pattern applies intermediate value transformer to child's value
.Map<string>(b => b.LiteralChoice("true", "false"), m => m == "true");
builder.CreateToken("null")
// 'Return' does not calculates value for child element, just returns 'null' here
.Return(b => b.Literal("null"), null);
builder.CreateToken("value")
// Skip whitespaces before value token
.SkipWhitespaces(b =>
// 'Choice' token selects the matched token's value
b.Choice(
c => c.Token("string"),
c => c.Token("number"),
c => c.Token("boolean"),
c => c.Token("null"),
c => c.Token("array"),
c => c.Token("object")
));
builder.CreateToken("value_list")
.ZeroOrMoreSeparated(
b => b.Token("value"),
b => b.SkipWhitespaces(b => b.Literal(',')),
includeSeparatorsInResult: false)
// You can apply passage function for tokens that
// matches multiple and variable amount of child elements
.Pass(v =>
{
return v.ToArray();
});
builder.CreateToken("array")
.Between(
b => b.Literal('['),
b => b.Token("value_list"),
b => b.SkipWhitespaces(b => b.Literal(']')));
builder.CreateToken("pair")
.SkipWhitespaces(b => b.Token("string"))
.SkipWhitespaces(b => b.Literal(':'))
.Token("value")
.Pass(v =>
{
return KeyValuePair.Create((string)v[0]!, v[2]);
});
builder.CreateToken("pair_list")
.ZeroOrMoreSeparated(
b => b.Token("pair"),
b => b.SkipWhitespaces(b => b.Literal(',')))
.Pass(v =>
{
return v.Cast<KeyValuePair<string, object>>().ToDictionary();
});
builder.CreateToken("object")
.Between(
b => b.Literal('{'),
b => b.Token("pair_list"),
b => b.SkipWhitespaces(b => b.Literal('}')));
var parser = builder.Build();
var json =
"""
{
"id": 1,
"name": "Sample Data",
"created": "2023-01-01T00:00:00",
"tags": ["tag1", "tag2", "tag3"],
"isActive": true,
"nested": {
"value": 123.456,
"description": "Nested description"
}
}
""";
// Match the token directly and produce intermediate value
var result = parser.MatchToken<Dictionary<string, object>>("value", json);
Console.WriteLine(result["name"]); // Outputs: Sample Data
Finding patterns
The FindAllMatches
method allows you to extract all occurrences of a pattern from a string, even in complex inputs, while handling optional transformations. Here's an example where will find the Price: *PRICE* (USD|EUR)
pattern:
var builder = new ParserBuilder();
// Skip unnecessary whitespace (you can configure comments here and they will be ignored when matching)
builder.Settings.SkipWhitespaces();
// Create the rule that we will find in text
builder.CreateMainRule()
.Literal("Price:")
.Number<double>() // 1
.LiteralChoice("USD", "EUR") // 2
.Transform(v =>
{
var number = v[1].Value; // Get the number value
var currency = v[2].Text; // Get the 'USD' or 'EUR' text
return new { Amount = number, Currency = currency };
});
var input =
"""
Some log entries.
Price: 42.99 USD
Error: something happened.
Price: 99.50 EUR
Another line.
Price: 2.50 USD
""";
// Find all transformed matches
var prices = builder.Build().FindAllMatches<dynamic>(input).ToList();
foreach (var price in prices)
{
Console.WriteLine($"Price: {price.Amount}; Currency: {price.Currency}");
}
Errors example
There is how errors are displayed in the default mode:
RCParsing.ParsingException : An error occurred during parsing:
The line where the error occurred (position 130):
"tags": ["tag1", "tag2", "tag3"],,
line 5, column 35 ^
',' is unexpected character, expected one of:
'string'
literal '}'
... and more errors omitted
And there is errors when using the builder.Settings.UseDebug()
setting:
RCParsing.ParsingException : An error occurred during parsing:
['string']: Failed to parse token.
['pair']: Failed to parse sequence rule.
[literal '}']: Failed to parse token.
['object']: Failed to parse sequence rule.
The line where the error occurred (position 130):
"tags": ["tag1", "tag2", "tag3"],,
line 5, column 35 ^
',' is unexpected character, expected one of:
'string'
'pair'
literal '}'
'object'
['string'] Stack trace (top call recently):
- Sequence 'pair':
'string' <-- here
literal ':'
'value'
- SeparatedRepeat[0..] (allow trailing): 'pair' <-- here
sep literal ','
- Sequence 'object':
literal '{'
SeparatedRepeat[0..] (allow trailing)... <-- here
literal '}'
- Choice 'value':
'string'
'number'
'boolean'
'null'
'array'
'object' <-- here
- Sequence 'content':
'value' <-- here
end of file
[literal '}'] Stack trace (top call recently):
- Sequence 'object':
literal '{'
SeparatedRepeat[0..] (allow trailing)...
literal '}' <-- here
- Choice 'value':
'string'
'number'
'boolean'
'null'
'array'
'object' <-- here
- Sequence 'content':
'value' <-- here
end of file
... and more errors omitted
Walk Trace:
... 316 hidden parsing steps. Total: 356 ...
[ENTER] pos:128 literal '//'
[FAIL] pos:128 literal '//' failed to match: '],,\r\n\t"isActive...'
[FAIL] pos:128 Sequence... failed to match: '],,\r\n\t"isActive...'
[FAIL] pos:128 'skip' failed to match: '],,\r\n\t"isActive...'
[ENTER] pos:128 literal ','
[FAIL] pos:128 literal ',' failed to match: '],,\r\n\t"isActive...'
[SUCCESS] pos:106 SeparatedRepeat[0..] (allow trailing)... matched: '"tag1", "tag2", "tag3"' [22 chars]
[ENTER] pos:128 literal ']'
[SUCCESS] pos:128 literal ']' matched: ']' [1 chars]
[SUCCESS] pos:105 'array' matched: '["tag1", "tag2", "tag3"]' [24 chars]
[SUCCESS] pos:105 'value' matched: '["tag1", "tag2", "tag3"]' [24 chars]
[SUCCESS] pos:97 'pair' matched: '"tags": ["tag1" ..... ", "tag3"]' [32 chars]
[ENTER] pos:129 'skip'
[ENTER] pos:129 whitespaces
[FAIL] pos:129 whitespaces failed to match: ',,\r\n\t"isActive"...'
[ENTER] pos:129 Sequence...
[ENTER] pos:129 literal '//'
[FAIL] pos:129 literal '//' failed to match: ',,\r\n\t"isActive"...'
[FAIL] pos:129 Sequence... failed to match: ',,\r\n\t"isActive"...'
[FAIL] pos:129 'skip' failed to match: ',,\r\n\t"isActive"...'
[ENTER] pos:129 literal ','
[SUCCESS] pos:129 literal ',' matched: ',' [1 chars]
[ENTER] pos:130 'skip'
[ENTER] pos:130 whitespaces
[FAIL] pos:130 whitespaces failed to match: ',\r\n\t"isActive":...'
[ENTER] pos:130 Sequence...
[ENTER] pos:130 literal '//'
[FAIL] pos:130 literal '//' failed to match: ',\r\n\t"isActive":...'
[FAIL] pos:130 Sequence... failed to match: ',\r\n\t"isActive":...'
[FAIL] pos:130 'skip' failed to match: ',\r\n\t"isActive":...'
[ENTER] pos:130 'pair'
[ENTER] pos:130 'string'
[FAIL] pos:130 'string' failed to match: ',\r\n\t"isActive":...'
[FAIL] pos:130 'pair' failed to match: ',\r\n\t"isActive":...'
[SUCCESS] pos:4 SeparatedRepeat[0..] (allow trailing)... matched: '"id": 1,\r\n\t"nam ..... , "tag3"],' [126 chars]
[ENTER] pos:130 literal '}'
[FAIL] pos:130 literal '}' failed to match: ',\r\n\t"isActive":...'
[FAIL] pos:0 'object' failed to match: '{\r\n\t"id": 1,\r\n\t...'
[FAIL] pos:0 'value' failed to match: '{\r\n\t"id": 1,\r\n\t...'
[FAIL] pos:0 'content' failed to match: '{\r\n\t"id": 1,\r\n\t...'
... End of walk trace ...
Comparison with Other Parsing Libraries
RCParsing
is designed to outstand with unique features, and easy developer experience, but it is good enough to compete with other fastest parser tools.
Performance at a Glance (based on benchmarks)
Library | Speed (Relative to RCParsing default mode) | Speed (Relative to RCParsing token combination style) | Memory Efficiency | Type |
---|---|---|---|---|
RCParsing | 1.00x (baseline) | 1.00x (baseline), ~5.00x faster than default | High or Excellent (based on style) | Both |
Parlot | ~3.50x-3.70x faster | ~1.20x-1.45x slower | Excellent | Combinator |
Pidgin | ~1.45x-3.00x slower | ~6.75x-13.55x slower | Excellent | Combinator |
ANTLR | ~1.20x-1.30x slower | ~6.60x-7.30x slower | High | AST-based |
Superpower | ~8.00x-8.10x slower | ~40.75x slower | Medium | Combinator |
Sprache | ~7.50x-8.10x slower | ~41.00x slower | Very low | Combinator |
Feature Comparison
This table highlights the unique architectural and usability features of each library.
Feature | RCParsing | Pidgin | Parlot | Superpower | ANTLR4 |
---|---|---|---|---|---|
Architecture | Scannerless hybrid | Scannerless | Scannerless | Lexer-based | Lexer-based with modes |
API | Fluent, lambda-based | Functional | Fluent/functional | Fluent/functional | Grammar Files |
Barrier/complex Tokens | Yes, built-in or manual | None | None | Yes, manual | Yes, manual |
Skipping | 6 strategies, global or manual | Manual | Global or manual | Lexer-based | Lexer-based |
Error Messages | Extremely Detailed, extendable with API | Simple | Manual messages | Simple | Simple by default, extendable |
Minimum .NET Target | .NET Standard 2.0 | .NET 7.0 | .NET Standard 2.0 | .NET Standard 2.0 | .NET Framework 4.5 |
Comparison with each library
ANTLR
This is a powerful tool for creating own DSLs with a wide ecosystem and multiple languages support (C#, Java, Python, C++ and others). But it requires a step of code generation and it can be a bit complex to set up for beginners. Also it uses a lexer-based algorithm, so you need to carefully setup a lexer for complex languages, also it barely fits for code/text mixed grammars. Therefore, RCParsing can be a bit slow for complex grammars, but it not tested yet.
Parlot
Parlot is known as fastest parser combinator library for .NET with support of context-specific parsing, global skipping and compilation via expression trees. But it's not that friendly for debug, you required to manually place errors in parsers, otherwise you just get nothing on parsing. But RCParsing shown that it is faster than Parlot, and it handles errors automatically, but it does not have global skip-tokens in the combinator style.
Pidgin
Pidgin is a memory-efficient and fast parser combinator library for .NET with some kind of incremental parsing support. It was created before Parlot and supports streams of any type of input, even binary. It supports LINQ-based syntax for creating parsers, but needs to manually place skip parsers in everything.
Sprache and Superpower
The legacy parser combinators for .NET, came out more than 10 years ago, but somewhat unefficient in performance, especially memory usage. But Sprache has the most readable API comparing than Pidgin and Parlot (in my opinion). Superpower is a more modern alternative to Sprache, but uses lexer-based approach and requires more amount of code.
Why RCParsing outstands
It designed to be a more convenient than other libraries, and later it been optimized and now it has a better performance than other libraries. It also supports both modes: AST and immediate calculations, or them together. RCParsing can produce stack and walk traces for errors, recover from them, and supports incremental parsing.
Benchmarks
All benchmarks are done via BenchmarkDotNet
.
Here is machine and runtime information:
BenchmarkDotNet v0.15.2, Windows 10 (10.0.19045.3448/22H2/2022Update)
AMD Ryzen 5 5600 3.60GHz, 1 CPU, 12 logical and 6 physical cores
.NET SDK 9.0.302
[Host] : .NET 8.0.18 (8.0.1825.31117), X64 RyuJIT AVX2
Job-KTXINV : .NET 8.0.18 (8.0.1825.31117), X64 RyuJIT AVX2
JSON
The JSON value calculation with the typeset Dictionary<string, object>
, object[]
, string
, int
and null
.
Method | Mean | Error | StdDev | Ratio | RatioSD | Gen0 | Gen1 | Allocated | Alloc Ratio |
---|---|---|---|---|---|---|---|---|---|
JsonBig_RCParsing | 166,563.6 ns | 1,550.19 ns | 688.30 ns | 1.00 | 0.01 | 13.1836 | 3.6621 | 222336 B | 1.00 |
JsonBig_RCParsing_Optimized | 99,578.1 ns | 1,203.19 ns | 429.07 ns | 0.60 | 0.00 | 9.2773 | 2.1973 | 156712 B | 0.70 |
JsonBig_RCParsing_TokenCombination | 30,124.3 ns | 279.09 ns | 99.53 ns | 0.18 | 0.00 | 2.5635 | 0.1831 | 43096 B | 0.19 |
JsonBig_SystemTextJson | 12,501.0 ns | 51.21 ns | 22.74 ns | 0.08 | 0.00 | 0.5035 | 0.0153 | 8648 B | 0.04 |
JsonBig_NewtonsoftJson | 48,258.1 ns | 314.92 ns | 139.83 ns | 0.29 | 0.00 | 4.7607 | 0.9766 | 80176 B | 0.36 |
JsonBig_ANTLR | 184,954.4 ns | 498.21 ns | 177.67 ns | 1.11 | 0.00 | 19.5313 | 7.5684 | 330584 B | 1.49 |
JsonBig_Parlot | 41,351.8 ns | 516.85 ns | 229.48 ns | 0.25 | 0.00 | 1.9531 | 0.1221 | 32848 B | 0.15 |
JsonBig_Pidgin | 213,947.0 ns | 1,219.96 ns | 541.67 ns | 1.28 | 0.01 | 3.9063 | 0.2441 | 66816 B | 0.30 |
JsonBig_Superpower | 1,191,550.3 ns | 3,853.95 ns | 1,374.36 ns | 7.15 | 0.03 | 39.0625 | 5.8594 | 653627 B | 2.94 |
JsonBig_Sprache | 1,232,307.4 ns | 28,065.79 ns | 12,461.38 ns | 7.40 | 0.08 | 232.4219 | 27.3438 | 3899736 B | 17.54 |
JsonShort_RCParsing | 8,965.7 ns | 64.95 ns | 28.84 ns | 1.00 | 0.00 | 0.6561 | - | 10992 B | 1.00 |
JsonShort_RCParsing_Optimized | 5,620.1 ns | 135.28 ns | 60.06 ns | 0.63 | 0.01 | 0.5341 | 0.0076 | 8976 B | 0.82 |
JsonShort_RCParsing_TokenCombination | 1,530.2 ns | 5.98 ns | 2.13 ns | 0.17 | 0.00 | 0.1354 | - | 2280 B | 0.21 |
JsonShort_SystemTextJson | 790.0 ns | 11.70 ns | 5.19 ns | 0.09 | 0.00 | 0.0401 | - | 672 B | 0.06 |
JsonShort_NewtonsoftJson | 2,824.3 ns | 287.70 ns | 127.74 ns | 0.32 | 0.01 | 0.3891 | - | 6552 B | 0.60 |
JsonShort_ANTLR | 10,575.1 ns | 43.55 ns | 15.53 ns | 1.18 | 0.00 | 1.1444 | 0.0305 | 19360 B | 1.76 |
JsonShort_Parlot | 2,199.4 ns | 16.57 ns | 5.91 ns | 0.25 | 0.00 | 0.1144 | - | 1960 B | 0.18 |
JsonShort_Pidgin | 10,794.7 ns | 109.21 ns | 48.49 ns | 1.20 | 0.01 | 0.2136 | - | 3664 B | 0.33 |
JsonShort_Superpower | 66,359.1 ns | 220.78 ns | 98.03 ns | 7.40 | 0.02 | 1.9531 | - | 34117 B | 3.10 |
JsonShort_Sprache | 64,819.0 ns | 617.46 ns | 220.19 ns | 7.23 | 0.03 | 12.6953 | 0.2441 | 213168 B | 19.39 |
Notes:
RCParsing
uses its default configuration, without any optimizations and settings applied.RCParsing_Optimized
usesUseInlining()
,UseFirstCharacterMatch()
,IgnoreErrors()
andSkipWhitespacesOptimized()
settings.RCParsing_TokenCombination
uses complex manual tokens with immediate transformations instead of rules, andUseFirstCharacterMatch()
setting.Parlot
usesCompiled()
version of parser.JsonShort
methods uses ~20 lines of hardcoded (not generated) JSON with simple content.JsonBig
methods uses ~180 lines of hardcoded (not generated) JSON with various content (deep, long objects/arrays).
Expressions
The int
value calculation from expression with parentheses ()
, spaces and operators +-/*
with priorities.
Method | Mean | Error | StdDev | Ratio | RatioSD | Gen0 | Gen1 | Allocated | Alloc Ratio |
---|---|---|---|---|---|---|---|---|---|
ExpressionBig_RCParsing | 258,484.1 ns | 4,221.70 ns | 653.31 ns | 1.00 | 0.00 | 23.4375 | 11.2305 | 399704 B | 1.00 |
ExpressionBig_RCParsing_Optimized | 179,193.0 ns | 622.43 ns | 96.32 ns | 0.69 | 0.00 | 19.7754 | 8.3008 | 334080 B | 0.84 |
ExpressionBig_RCParsing_TokenCombination | 54,463.7 ns | 1,455.89 ns | 225.30 ns | 0.21 | 0.00 | 4.1504 | 0.0610 | 70288 B | 0.18 |
ExpressionBig_Parlot | 63,761.8 ns | 339.88 ns | 88.27 ns | 0.25 | 0.00 | 3.2959 | - | 56608 B | 0.14 |
ExpressionBig_Pidgin | 700,906.7 ns | 7,006.51 ns | 1,819.57 ns | 2.71 | 0.01 | 0.9766 | - | 23540 B | 0.06 |
ExpressionShort_RCParsing | 2,310.8 ns | 34.09 ns | 8.85 ns | 1.00 | 0.00 | 0.2174 | - | 3696 B | 1.00 |
ExpressionShort_RCParsing_Optimized | 1,638.7 ns | 22.90 ns | 5.95 ns | 0.71 | 0.00 | 0.2117 | - | 3544 B | 0.96 |
ExpressionShort_RCParsing_TokenCombination | 459.0 ns | 4.37 ns | 1.14 ns | 0.20 | 0.00 | 0.0391 | - | 656 B | 0.18 |
ExpressionShort_Parlot | 603.4 ns | 5.78 ns | 1.50 ns | 0.26 | 0.00 | 0.0534 | - | 896 B | 0.24 |
ExpressionShort_Pidgin | 6,655.6 ns | 256.88 ns | 66.71 ns | 2.88 | 0.03 | 0.0153 | - | 344 B | 0.09 |
Notes:
RCParsing
uses its default configuration, without any optimizations and settings applied.RCParsing_Optimized
usesUseInlining()
,IgnoreErrors()
andSkipWhitespacesOptimized()
settings.RCParsing_TokenCombination
uses complex manual tokens with immediate transformations instead of rules, andUseFirstCharacterMatch()
setting.Parlot
usesCompiled()
version of parser.ExpressionShort
methods uses single line with 4 operators of hardcoded (not generated) expression.ExpressionBig
methods uses single line with ~400 operators of hardcoded (not generated) expression.
Regex
Matching identifiers and emails in the plain text.
Method | Mean | Error | StdDev | Ratio | RatioSD | Gen0 | Gen1 | Allocated | Alloc Ratio |
---|---|---|---|---|---|---|---|---|---|
EmailsBig_RCParsing | 236,175.3 ns | 26,801.07 ns | 6,960.15 ns | 1.00 | 0.04 | 0.9766 | - | 16568 B | 1.00 |
EmailsBig_RCParsing_Optimized | 157,271.9 ns | 5,076.92 ns | 1,318.46 ns | 0.67 | 0.02 | 0.9766 | - | 16568 B | 1.00 |
EmailsBig_Regex | 27,638.6 ns | 711.08 ns | 184.66 ns | 0.12 | 0.00 | 1.5564 | 0.1221 | 26200 B | 1.58 |
EmailsShort_RCParsing | 6,658.5 ns | 78.57 ns | 20.40 ns | 1.00 | 0.00 | 0.0916 | - | 1600 B | 1.00 |
EmailsShort_RCParsing_Optimized | 3,799.0 ns | 35.69 ns | 5.52 ns | 0.57 | 0.00 | 0.0954 | - | 1600 B | 1.00 |
EmailsShort_Regex | 931.5 ns | 13.52 ns | 3.51 ns | 0.14 | 0.00 | 0.0601 | - | 1008 B | 0.63 |
IdentifiersBig_RCParsing | 158,034.1 ns | 4,041.56 ns | 625.44 ns | 1.00 | 0.01 | 5.8594 | - | 101664 B | 1.00 |
IdentifiersBig_RCParsing_Optimized | 99,086.9 ns | 1,619.80 ns | 420.66 ns | 0.63 | 0.00 | 5.9814 | - | 101664 B | 1.00 |
IdentifiersBig_Regex | 71,439.8 ns | 4,727.93 ns | 731.65 ns | 0.45 | 0.00 | 11.1084 | 3.6621 | 187248 B | 1.84 |
IdentifiersShort_RCParsing | 4,041.5 ns | 172.86 ns | 44.89 ns | 1.00 | 0.01 | 0.2518 | - | 4240 B | 1.00 |
IdentifiersShort_RCParsing_Optimized | 2,930.9 ns | 56.37 ns | 14.64 ns | 0.73 | 0.01 | 0.2518 | - | 4240 B | 1.00 |
IdentifiersShort_Regex | 2,386.2 ns | 160.57 ns | 41.70 ns | 0.59 | 0.01 | 0.3624 | 0.0076 | 6104 B | 1.44 |
Notes:
RCParsing
uses naive pattern for matching, without any optimization settings applied.RCParsing_Optimized
uses the same pattern, but with configured skip-rule for making it faster.Regex
usesRegexOptions.Compiled
flags.Identifiers
pattern is[a-zA-Z_][a-zA-Z0-9_]*
.Emails
pattern is[a-zA-Z0-9]+@[a-zA-Z0-9]+\.[a-zA-Z0-9]+
.
More benchmarks will be later here...
Projects using RCParsing
- RCLargeLangugeModels: My project, used for
LLT
, the template Razor-like language with VERY specific syntax.
Using RCParsing in your project? We'd love to feature it here! Submit a pull request to add your project to the list.
Roadmap
The future development of RCParsing
is focused on:
- Performance: Continued profiling and optimization, especially for large files with deep structures.
- API Ergonomics: Introducing even more expressive and fluent methods (such as expression builder).
- New Built-in Rules: Adding common patterns (e.g., number with wide range of notations).
- Visualization Tooling: Exploring tools for debugging and visualizing resulting AST.
Contributing
Contributions are welcome!
This framework is born recently (2 months ago) and some little features may not be tested and be buggy.
If you have an idea about this project, you can report it to Issues
.
For contributing code, please fork the repository and make your changes in a new branch. Once you're ready, create a pull request to merge your changes into the main branch. Pull requests should include a clear description of what was changed and why.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 is compatible. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- System.Memory (>= 4.6.3)
-
net6.0
- No dependencies.
-
net8.0
- No dependencies.
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last Updated |
---|---|---|
4.2.0 | 61 | 9/30/2025 |
4.1.0 | 151 | 9/23/2025 |
4.0.0 | 155 | 9/22/2025 |
3.2.0 | 204 | 9/21/2025 |
3.1.0 | 136 | 9/21/2025 |
3.0.0 | 275 | 9/18/2025 |
2.4.0 | 199 | 9/14/2025 |
2.3.0 | 148 | 9/7/2025 |
2.2.0 | 141 | 9/1/2025 |
2.1.0 | 151 | 8/30/2025 |
2.0.2 | 182 | 8/28/2025 |
2.0.1 | 183 | 8/28/2025 |
2.0.0 | 186 | 8/28/2025 |
1.1.0 | 171 | 8/24/2025 |
1.0.0 | 136 | 8/21/2025 |