Sharing a data angle in case it’s useful.
US public companies disclose disaggregated revenue (by product and by geography) in their 10-K/10-Q/20-F filings, tagged as XBRL dimensional facts. It’s all free and public on SEC EDGAR, but it’s genuinely hard to use raw:
the geography axis is tagged inconsistently (some filers use ISO country codes, some US state codes, some their own “rest of world” catch-alls), companies mix subtotals and leaves on the product axis, and 10-Qs report cumulative half-year/nine-month figures instead of standalone quarters.
If you’re assembling this yourself, the things that bit me: keep single-axis facts only (the filings rarely tag product×geography as one crossed fact), preserve subtotal members rather than pruning them, and reconstruct standalone quarters by subtracting the cumulative periods. Period-classify each fact against the company’s real fiscal-year end, not the calendar.
I maintain a cleaned-up version of this as the StockFit API, but the underlying data is all on EDGAR if you want to parse it yourself with Arelle.
Happy to answer any questions.
submitted by /u/Either_Door_5500
[link] [comments]