| REVENUE-FOCUSED FINANCIAL DATA EXTRACTION | |
| === DOCUMENT TO ANALYZE === | |
| File: {file_path} | |
| (Document will be provided directly to you for analysis) | |
| === YOUR MISSION === | |
| Extract ONLY revenue-related financial data from the provided document with 100% accuracy. | |
| Focus exclusively on company name and revenue data - ignore all other financial metrics. | |
| You are using gemini-2.5-pro with thinking budget optimization. | |
| === WHAT TO EXTRACT (REVENUE ONLY) === | |
| **MANDATORY (Must Extract):** | |
| 1. **Company Name** - Official legal company name | |
| 2. **Total Revenue** - Consolidated revenue/net sales for most recent period | |
| 3. **Document Type** - 10-K, 10-Q, Annual Report, Quarterly Report, etc. | |
| 4. **Reporting Period** - FY 2023, Q1 2024, etc. | |
| 5. **Currency** - USD, EUR, etc. | |
| **HIGH VALUE (Extract if clearly present):** | |
| 6. **Segment Revenue** - Revenue by business segment/division/product line | |
| 7. **Regional Revenue** - Revenue by geographic region/country | |
| **IGNORE COMPLETELY:** | |
| - Net income, profit, losses | |
| - Assets, liabilities, equity | |
| - Cash flow data | |
| - Expenses, costs, operating income | |
| - Balance sheet items | |
| - Ratios, per-share metrics | |
| - Non-financial data | |
| === SYSTEMATIC EXTRACTION PROCESS === | |
| **Step 1: Document Structure Analysis** | |
| - Scan document to understand structure and layout | |
| - Identify document type and reporting period | |
| - Locate revenue-related sections (Income Statement, Segment Reporting, Geographic Data) | |
| **Step 2: Company Identification** | |
| - Extract official company name from document header/title | |
| - Verify name consistency throughout document | |
| - Use parent company name if multiple entities present | |
| **Step 3: Total Revenue Extraction (CRITICAL)** | |
| - Find consolidated revenue figure for most recent period | |
| - Look in: Consolidated Statements of Operations, Income Statement | |
| - Search terms: "Revenue", "Net Sales", "Total Revenue", "Net Revenue" | |
| - Record exact value with currency and time period | |
| **Step 4: Segment Revenue Analysis** | |
| - Locate segment reporting section (usually separate section after financial statements) | |
| - Extract revenue by business segment, division, or product line | |
| - Common segments: Products, Services, Geographic regions, Business units | |
| - Ensure segment revenues sum to total revenue for validation | |
| **Step 5: Regional Revenue Analysis** | |
| - Find geographic revenue breakdown section | |
| - Look for revenue by: Americas, EMEA, APAC, US vs International, specific countries | |
| - Extract revenue figures for major geographic regions | |
| - Validate regional totals match consolidated revenue | |
| **Step 6: Data Validation** | |
| - Verify company name is not empty | |
| - Confirm total revenue has high confidence score (>0.7) | |
| - Check that segment/regional breakdowns sum to total | |
| - Ensure all mandatory fields are extracted | |
| === CONFIDENCE SCORING (REVENUE DATA ONLY) === | |
| - **1.0**: Revenue clearly stated in financial table with proper labels | |
| - **0.8**: Revenue stated in structured text with clear context | |
| - **0.6**: Revenue derived from segment/regional totals | |
| - **0.4**: Revenue estimated or context somewhat unclear | |
| - **0.2**: Revenue barely visible or questionable source | |
| - **0.0**: Revenue not found or completely unclear | |
| === OUTPUT REQUIREMENTS === | |
| Return ExtractedFinancialData with ONLY revenue-related data: | |
| ```json | |
| { | |
| "company_name": "[Official Company Name]", | |
| "document_type": "[10-K|10-Q|Annual Report|etc.]", | |
| "reporting_period": "[FY 2023|Q1 2024|etc.]", | |
| "currency": "[USD|EUR|etc.]", | |
| "data_points": [ | |
| { | |
| "field_name": "Total Revenue", | |
| "value": "$50.3 billion", | |
| "category": "Revenue", | |
| "period": "FY 2023", | |
| "unit": "USD billions", | |
| "confidence": 1.0 | |
| }, | |
| { | |
| "field_name": "Product Revenue", | |
| "value": "$30.2 billion", | |
| "category": "Segment Revenue", | |
| "period": "FY 2023", | |
| "unit": "USD billions", | |
| "confidence": 0.9 | |
| }, | |
| { | |
| "field_name": "Americas Revenue", | |
| "value": "$25.1 billion", | |
| "category": "Regional Revenue", | |
| "period": "FY 2023", | |
| "unit": "USD billions", | |
| "confidence": 0.8 | |
| } | |
| ], | |
| "summary": "[2-3 sentences describing key revenue findings and trends]" | |
| } | |
| ``` | |
| === SUCCESS CRITERIA === | |
| Extraction is successful ONLY if: | |
| β Company name extracted (never empty) | |
| β Total revenue extracted with confidence > 0.5 | |
| β Document type and period identified | |
| β Currency specified | |
| β All data points are revenue-related only | |
| β Summary focuses on revenue insights (2-3 sentences) | |
| β Segment/regional data sums to total (if present) | |
| === REVENUE EXTRACTION STRATEGY === | |
| 1. **Income Statement First** - Look for consolidated revenue in primary financial statements | |
| 2. **Segment Section Second** - Find detailed segment revenue breakdowns | |
| 3. **Geographic Section Third** - Locate regional revenue data | |
| 4. **Management Discussion** - Check for revenue highlights and explanations | |
| 5. **Tables Over Text** - Prioritize tabular data over narrative mentions | |
| **Remember**: Focus EXCLUSIVELY on revenue data. Ignore all other financial metrics. Your goal is 100% accuracy on revenue extraction with proper segment and regional breakdowns. | |