Parser Simulation Lab

Realistic test documents with real-world problems. Click any scenario to see the original document side-by-side with what our parser extracted.

12
Test Documents
12
Parsed OK
0
Partial Parse
0
Parse Failed
Filter: All Easy Medium Hard

Mixed Date Formats

Medium

A single CSV where the date column uses 5 different formats — exactly what happens when a bank exports records that span a system migration …

BANK
22 rows 100% valid
5 known problems

Non-Standard Amount Formats

Hard

Amounts expressed in every way banks and accounting systems do it: parentheses for negatives, DR/CR suffixes, currency symbols, spaces in numbers, and European comma decimals.

BANK
17 rows 100% valid
7 known problems

No Column Headers

Hard

Exported data with NO header row — just raw data lines. This is common with older banking systems, mainframe exports, and some ERP batch reports.

BANK
15 rows 100% valid
4 known problems

Duplicates, Reversals & Corrections

Hard

An Excel file with duplicate entries (same txn posted twice), reversal pairs, and corrected entries. This happens constantly with ACH/wire processing errors and batch resubmissions.

BANK
20 rows 100% valid
4 known problems

Multi-Line Descriptions & Truncation

Medium

Bank descriptions that wrap to multiple lines (common in PDF-to-CSV conversions), descriptions truncated at 30 or 40 characters, and memo lines mixed in as separate …

BANK
12 rows 100% valid
5 known problems

Real Chase Bank Export

Easy

Mimics actual Chase Business checking CSV export format. Uses their exact column names and formatting conventions.

BANK
12 rows 100% valid
4 known problems

Real Bank of America Export

Easy

Mimics actual Bank of America CSV download. BofA has its own column naming and includes summary rows at top.

BANK
10 rows 100% valid
4 known problems

Messy ERP GL Export

Hard

A typical GL export from an older ERP system — multi-row headers, subtotal rows mixed in, blank rows between sections, merged cells, and department totals …

GL
23 rows 100% valid
6 known problems

POS Terminal Settlement Report

Medium

Daily settlement report from 12 POS terminals across 4 locations. Machine IDs, cashier codes, and batch numbers embedded in descriptions. This is the GL side …

GL
108 rows 100% valid
5 known problems

Encoding & Special Characters

Medium

International transactions with special characters, BOM markers, and mixed encodings. Common when dealing with foreign vendors or multinational bank accounts.

BANK
10 rows 100% valid
6 known problems

Separate Debit/Credit Columns

Medium

Instead of one Amount column, this statement has separate Debit and Credit columns — exactly how many bank PDFs and older systems present data. Some …

BANK
13 rows 100% valid
4 known problems

Vendor Batch Payments (GL Side)

Hard

GL entries from an AP batch run — 10 vendors paid in one ACH batch. Individual GL lines must match to a single bank withdrawal. …

GL
15 rows 100% valid
5 known problems