How to extract financial tables from a pdf into json
- Step 1Upload the financial report PDF — Drop the document into the table extractor.
- Step 2Extract financial tables to JSON — Convert P&L, balance sheet, or cash flow tables to JSON.
- Step 3Normalise numeric values — Strip currency symbols and convert strings to numbers in the JSON.
- Step 4Use in your model or API — Feed the structured JSON to your financial model or dashboard API.
Frequently asked questions
Will negative numbers in parentheses (123) convert correctly?+
Parenthetical negatives may extract as strings. Convert them to negative numbers (e.g., (123) → -123) in your post-processing script.
Can I extract multiple tables from a PDF with one operation?+
Yes — all tables in the PDF are extracted and returned as a JSON array of table objects, each labelled by page and table index.
Is this suitable for extracting tables from XBRL-tagged annual reports?+
This tool extracts from the visual PDF table. For structured XBRL data, use an XBRL parser that reads the tagged inline XBRL data directly.
Privacy first
All PDF processing runs locally in your browser using PDF-lib and pdf.js. No file is ever uploaded — only metadata counters are saved for signed-in dashboard stats.