Parser Simulation Lab
Realistic test documents with real-world problems. Click any scenario to see the original document side-by-side with what our parser extracted.
Mixed Date Formats
MediumA single CSV where the date column uses 5 different formats — exactly what happens when a bank exports records that span a system migration …
Multi-Line Descriptions & Truncation
MediumBank descriptions that wrap to multiple lines (common in PDF-to-CSV conversions), descriptions truncated at 30 or 40 characters, and memo lines mixed in as separate …
POS Terminal Settlement Report
MediumDaily settlement report from 12 POS terminals across 4 locations. Machine IDs, cashier codes, and batch numbers embedded in descriptions. This is the GL side …
Encoding & Special Characters
MediumInternational transactions with special characters, BOM markers, and mixed encodings. Common when dealing with foreign vendors or multinational bank accounts.
Separate Debit/Credit Columns
MediumInstead of one Amount column, this statement has separate Debit and Credit columns — exactly how many bank PDFs and older systems present data. Some …