|
REVENUE-FOCUSED FINANCIAL DATA EXTRACTION |
|
|
|
=== DOCUMENT TO ANALYZE === |
|
File: {file_path} |
|
(Document will be provided directly to you for analysis) |
|
|
|
=== YOUR MISSION === |
|
Extract ONLY revenue-related financial data from the provided document with 100% accuracy. |
|
Focus exclusively on company name and revenue data - ignore all other financial metrics. |
|
You are using gemini-2.5-pro with thinking budget optimization. |
|
|
|
=== WHAT TO EXTRACT (REVENUE ONLY) === |
|
|
|
**MANDATORY (Must Extract):** |
|
1. **Company Name** - Official legal company name |
|
2. **Total Revenue** - Consolidated revenue/net sales for most recent period |
|
3. **Document Type** - 10-K, 10-Q, Annual Report, Quarterly Report, etc. |
|
4. **Reporting Period** - FY 2023, Q1 2024, etc. |
|
5. **Currency** - USD, EUR, etc. |
|
|
|
**HIGH VALUE (Extract if clearly present):** |
|
6. **Segment Revenue** - Revenue by business segment/division/product line |
|
7. **Regional Revenue** - Revenue by geographic region/country |
|
|
|
**IGNORE COMPLETELY:** |
|
- Net income, profit, losses |
|
- Assets, liabilities, equity |
|
- Cash flow data |
|
- Expenses, costs, operating income |
|
- Balance sheet items |
|
- Ratios, per-share metrics |
|
- Non-financial data |
|
|
|
=== SYSTEMATIC EXTRACTION PROCESS === |
|
|
|
**Step 1: Document Structure Analysis** |
|
- Scan document to understand structure and layout |
|
- Identify document type and reporting period |
|
- Locate revenue-related sections (Income Statement, Segment Reporting, Geographic Data) |
|
|
|
**Step 2: Company Identification** |
|
- Extract official company name from document header/title |
|
- Verify name consistency throughout document |
|
- Use parent company name if multiple entities present |
|
|
|
**Step 3: Total Revenue Extraction (CRITICAL)** |
|
- Find consolidated revenue figure for most recent period |
|
- Look in: Consolidated Statements of Operations, Income Statement |
|
- Search terms: "Revenue", "Net Sales", "Total Revenue", "Net Revenue" |
|
- Record exact value with currency and time period |
|
|
|
**Step 4: Segment Revenue Analysis** |
|
- Locate segment reporting section (usually separate section after financial statements) |
|
- Extract revenue by business segment, division, or product line |
|
- Common segments: Products, Services, Geographic regions, Business units |
|
- Ensure segment revenues sum to total revenue for validation |
|
|
|
**Step 5: Regional Revenue Analysis** |
|
- Find geographic revenue breakdown section |
|
- Look for revenue by: Americas, EMEA, APAC, US vs International, specific countries |
|
- Extract revenue figures for major geographic regions |
|
- Validate regional totals match consolidated revenue |
|
|
|
**Step 6: Data Validation** |
|
- Verify company name is not empty |
|
- Confirm total revenue has high confidence score (>0.7) |
|
- Check that segment/regional breakdowns sum to total |
|
- Ensure all mandatory fields are extracted |
|
|
|
=== CONFIDENCE SCORING (REVENUE DATA ONLY) === |
|
- **1.0**: Revenue clearly stated in financial table with proper labels |
|
- **0.8**: Revenue stated in structured text with clear context |
|
- **0.6**: Revenue derived from segment/regional totals |
|
- **0.4**: Revenue estimated or context somewhat unclear |
|
- **0.2**: Revenue barely visible or questionable source |
|
- **0.0**: Revenue not found or completely unclear |
|
|
|
=== OUTPUT REQUIREMENTS === |
|
Return ExtractedFinancialData with ONLY revenue-related data: |
|
|
|
```json |
|
{ |
|
"company_name": "[Official Company Name]", |
|
"document_type": "[10-K|10-Q|Annual Report|etc.]", |
|
"reporting_period": "[FY 2023|Q1 2024|etc.]", |
|
"currency": "[USD|EUR|etc.]", |
|
"data_points": [ |
|
{ |
|
"field_name": "Total Revenue", |
|
"value": "$50.3 billion", |
|
"category": "Revenue", |
|
"period": "FY 2023", |
|
"unit": "USD billions", |
|
"confidence": 1.0 |
|
}, |
|
{ |
|
"field_name": "Product Revenue", |
|
"value": "$30.2 billion", |
|
"category": "Segment Revenue", |
|
"period": "FY 2023", |
|
"unit": "USD billions", |
|
"confidence": 0.9 |
|
}, |
|
{ |
|
"field_name": "Americas Revenue", |
|
"value": "$25.1 billion", |
|
"category": "Regional Revenue", |
|
"period": "FY 2023", |
|
"unit": "USD billions", |
|
"confidence": 0.8 |
|
} |
|
], |
|
"summary": "[2-3 sentences describing key revenue findings and trends]" |
|
} |
|
``` |
|
|
|
=== SUCCESS CRITERIA === |
|
Extraction is successful ONLY if: |
|
β Company name extracted (never empty) |
|
β Total revenue extracted with confidence > 0.5 |
|
β Document type and period identified |
|
β Currency specified |
|
β All data points are revenue-related only |
|
β Summary focuses on revenue insights (2-3 sentences) |
|
β Segment/regional data sums to total (if present) |
|
|
|
=== REVENUE EXTRACTION STRATEGY === |
|
1. **Income Statement First** - Look for consolidated revenue in primary financial statements |
|
2. **Segment Section Second** - Find detailed segment revenue breakdowns |
|
3. **Geographic Section Third** - Locate regional revenue data |
|
4. **Management Discussion** - Check for revenue highlights and explanations |
|
5. **Tables Over Text** - Prioritize tabular data over narrative mentions |
|
|
|
**Remember**: Focus EXCLUSIVELY on revenue data. Ignore all other financial metrics. Your goal is 100% accuracy on revenue extraction with proper segment and regional breakdowns. |
|
|
|
|