Spaces:

zwt963
/

paperindex

Running

App Files Files Community

DVampire commited on 6 days ago

Commit

583741e

1 Parent(s): bf5c0e0

update website

Browse files

Files changed (32) hide show

DATABASE_MIGRATION_SUMMARY.md +147 -0
DATABASE_USAGE.md +182 -0
Dockerfile +1 -1
PROJECT_STRUCTURE.md +87 -0
agents/__init__.py +0 -2
app.py +715 -6
cli.py +5 -58
configs/paper_agent.py +5 -0
frontend/index.html +12 -3
frontend/main.js +611 -77
frontend/paper.html +1 -1
frontend/paper.js +29 -9
frontend/styles.css +399 -18
server.py +0 -731
src/__init__.py +1 -0
src/agents/__init__.py +3 -0
{agents → src/agents}/evaluator.py +138 -45
{agents → src/agents}/prompt.py +0 -11
src/cli/__init__.py +1 -0
src/cli/cli.py +80 -0
src/config/config.py +4 -51
src/crawl/__init__.py +5 -0
src/crawl/huggingface_daily.py +309 -0
src/database/db.py +138 -1
src/logger/__init__.py +3 -6
src/logger/log.py +136 -0
src/logger/logger.py +0 -229
src/utils/__init__.py +1 -1
src/utils/hf_utils.py +0 -0
test_evaluation.py +181 -0
workdir/2508.05629.json +0 -57
papers_cache.db → workdir/paper_agent/papers_cache.db +2 -2

DATABASE_MIGRATION_SUMMARY.md ADDED Viewed

	@@ -0,0 +1,147 @@

+# 数据库迁移完成总结
+## 概述
+已成功将系统从JSON文件存储迁移到SQLite数据库存储，现在每篇arXiv文章的评价内容都存储在数据库中，支持更好的数据管理和查询功能。
+## 主要修改
+### 1. 数据库结构 (`src/database/db.py`)
+**新增 papers 表：**
+- `arxiv_id`: 论文唯一标识
+- `title`, `authors`, `abstract`: 论文基本信息
+- `evaluation_content`: 评价内容（JSON格式）
+- `evaluation_score`: 总体自动化评分
+- `evaluation_tags`: 评价标签
+- `is_evaluated`: 评价状态标记
+- `evaluation_date`: 评价时间
+- `created_at`, `updated_at`: 时间戳
+**新增数据库方法：**
+- `insert_paper()`: 插入新论文
+- `get_paper()`: 获取单个论文
+- `update_paper_evaluation()`: 更新评价内容
+- `get_evaluated_papers()`: 获取已评价论文
+- `get_unevaluated_papers()`: 获取未评价论文
+- `search_papers()`: 搜索论文
+- `get_papers_count()`: 获取统计信息
+### 2. 评价器修改 (`src/agents/evaluator.py`)
+**ConversationState 类：**
+- 添加 `arxiv_id` 字段
+**save_node 函数：**
+- 改为保存到数据库而不是JSON文件
+- 自动提取评分和标签信息
+- 支持结构化数据存储
+**run_evaluation 函数：**
+- 添加 `arxiv_id` 参数支持
+### 3. API接口修改 (`app.py`)
+**修改的接口：**
+- `/api/evals`: 从数据库获取评价列表
+- `/api/has-eval/{paper_id}`: 检查数据库中的评价状态
+- `/api/eval/{paper_id}`: 从数据库获取评价内容
+**新增接口：**
+- `/api/papers/status`: 获取论文统计信息
+- `/api/papers/insert`: 插入新论文
+- `/api/papers/evaluate/{arxiv_id}`: 评价论文
+### 4. CLI工具修改 (`src/cli/cli.py`)
+**新增参数：**
+- `--arxiv-id`: 指定论文的arXiv ID
+**功能增强：**
+- 支持将评价结果保存到数据库
+- 保持向后兼容性（仍可保存到文件）
+## 使用示例
+### 1. 使用CLI评价论文并保存到数据库
+```bash
+# 评价论文并保存到数据库
+python cli.py https://arxiv.org/pdf/2508.05629 --arxiv-id 2508.05629
+# 同时保存到文件和数据库
+python cli.py https://arxiv.org/pdf/2508.05629 --arxiv-id 2508.05629 -o /path/to/output
+```
+### 2. 使用API插入论文
+```bash
+curl -X POST "http://localhost:8000/api/papers/insert" \
+  -H "Content-Type: application/json" \
+  -d '{
+    "arxiv_id": "2508.05629",
+    "title": "Your Paper Title",
+    "authors": "Author 1, Author 2",
+    "abstract": "Paper abstract...",
+    "categories": "cs.AI, cs.LG",
+    "published_date": "2024-08-01"
+  }'
+```
+### 3. 获取评价统计
+```bash
+curl "http://localhost:8000/api/papers/status"
+```
+## 数据库优势
+1. **结构化存储**: 论文信息和评价内容分离，便于管理
+2. **状态跟踪**: 通过 `is_evaluated` 字段跟踪评价状态
+3. **标签系统**: 支持为评价添加标签，便于分类筛选
+4. **搜索功能**: 支持按标题、作者、摘要搜索
+5. **统计功能**: 轻松获取论文统计信息
+6. **API支持**: 完整的RESTful API接口
+7. **数据完整性**: SQLite提供ACID特性
+## 迁移注意事项
+1. **现有JSON文件**: 可以编写脚本将现有JSON文件导入数据库
+2. **数据库备份**: 建议定期备份数据库文件
+3. **向后兼容**: CLI工具仍支持保存到文件，保持兼容性
+4. **配置路径**: 数据库文件路径在 `configs/paper_agent.py` 中配置
+## 测试验证
+已创建并运行测试脚本验证所有数据库功能：
+- ✅ 论文插入
+- ✅ 论文查询
+- ✅ 评价更新
+- ✅ 状态检查
+- ✅ 统计功能
+- ✅ 搜索功能
+## 下一步建议
+1. **数据迁移**: 编写脚本将现有JSON文件导入数据库
+2. **前端更新**: 更新前端界面以支持新的数据库功能
+3. **批量操作**: 添加批量论文插入和评价功能
+4. **数据导出**: 添加数据导出功能
+5. **性能优化**: 为大量数据添加索引优化
+## 文件清单
+**修改的文件：**
+- `src/database/db.py` - 数据库结构和操作
+- `src/agents/evaluator.py` - 评价器修改
+- `app.py` - API接口修改
+- `src/cli/cli.py` - CLI工具修改
+**新增的文件：**
+- `DATABASE_USAGE.md` - 使用说明文档
+- `DATABASE_MIGRATION_SUMMARY.md` - 本总结文档
+**配置文件：**
+- `configs/paper_agent.py` - 数据库路径配置
+现在系统已经完全支持数据库存储，可以更好地管理论文评价数据！

DATABASE_USAGE.md ADDED Viewed

	@@ -0,0 +1,182 @@

+# Papers Database 使用说明
+## 概述
+现在系统已经支持将arXiv文章和评价内容存储到SQLite数据库中，而不是保存在JSON文件中。这样可以更好地管理论文数据，支持查询、统计和标签管理。
+## 数据库结构
+### papers 表
+| 字段 | 类型 | 说明 |
+|------|------|------|
+| arxiv_id | TEXT PRIMARY KEY | arXiv论文ID |
+| title | TEXT NOT NULL | 论文标题 |
+| authors | TEXT NOT NULL | 作者列表 |
+| abstract | TEXT | 论文摘要 |
+| categories | TEXT | 论文分类 |
+| published_date | TEXT | 发布日期 |
+| evaluation_content | TEXT | 评价内容（JSON格式） |
+| evaluation_score | REAL | 总体自动化评分 |
+| evaluation_tags | TEXT | 评价标签 |
+| is_evaluated | BOOLEAN | 是否已评价 |
+| evaluation_date | TIMESTAMP | 评价日期 |
+| created_at | TIMESTAMP | 创建时间 |
+| updated_at | TIMESTAMP | 更新时间 |
+## 使用方法
+### 1. 插入论文
+```python
+from src.database.db import db
+# 插入新论文
+db.insert_paper(
+    arxiv_id="2508.05629",
+    title="Your Paper Title",
+    authors="Author 1, Author 2",
+    abstract="Paper abstract...",
+    categories="cs.AI, cs.LG",
+    published_date="2024-08-01"
+)
+```
+### 2. 更新评价
+```python
+# 更新论文评价
+db.update_paper_evaluation(
+    arxiv_id="2508.05629",
+    evaluation_content='{"overall_automatability": 3, "three_year_feasibility": 75}',
+    evaluation_score=3.0,
+    evaluation_tags="3yr_feasibility:75%,overall_automatability:3/4"
+)
+```
+### 3. 查询论文
+```python
+# 获取单个论文
+paper = db.get_paper("2508.05629")
+# 获取所有已评价的论文
+evaluated_papers = db.get_evaluated_papers()
+# 获取所有未评价的论文
+unevaluated_papers = db.get_unevaluated_papers()
+# 搜索论文
+search_results = db.search_papers("AI")
+```
+### 4. 统计信息
+```python
+# 获取论文统计
+count = db.get_papers_count()
+print(f"总论文数: {count['total']}")
+print(f"已评价: {count['evaluated']}")
+print(f"未评价: {count['unevaluated']}")
+```
+## API 接口
+### 获取评价列表
+```
+GET /api/evals
+```
+### 检查论文是否已评价
+```
+GET /api/has-eval/{paper_id}
+```
+### 获取论文评价
+```
+GET /api/eval/{paper_id}
+```
+### 获取论文统计
+```
+GET /api/papers/status
+```
+### 插入新论文
+```
+POST /api/papers/insert
+Content-Type: application/json
+{
+    "arxiv_id": "2508.05629",
+    "title": "Paper Title",
+    "authors": "Author 1, Author 2",
+    "abstract": "Abstract...",
+    "categories": "cs.AI",
+    "published_date": "2024-08-01"
+}
+```
+### 评价论文
+```
+POST /api/papers/evaluate/{arxiv_id}
+```
+## CLI 工具使用
+### 评价论文并保存到数据库
+```bash
+# 使用arxiv_id参数将评价保存到数据库
+python cli.py https://arxiv.org/pdf/2508.05629 --arxiv-id 2508.05629
+# 同时保存到文件和数据库
+python cli.py https://arxiv.org/pdf/2508.05629 --arxiv-id 2508.05629 -o /path/to/output
+```
+## 迁移现有数据
+如果你有现有的JSON评价文件，可以编写脚本将它们导入到数据库中：
+```python
+import json
+import os
+from src.database.db import db
+def migrate_json_to_db(json_dir="workdir"):
+    """将JSON文件迁移到数据库"""
+    for filename in os.listdir(json_dir):
+        if filename.endswith('.json'):
+            filepath = os.path.join(json_dir, filename)
+            with open(filepath, 'r') as f:
+                data = json.load(f)
+            # 提取arxiv_id（假设文件名包含arxiv_id）
+            arxiv_id = filename.split('_')[0]  # 根据实际文件名格式调整
+            # 更新数据库中的评价
+            if 'response' in data:
+                db.update_paper_evaluation(
+                    arxiv_id=arxiv_id,
+                    evaluation_content=data['response'],
+                    evaluation_score=None,  # 需要从内容中解析
+                    evaluation_tags=None
+                )
+                print(f"Migrated {filename} for paper {arxiv_id}")
+```
+## 优势
+1. **结构化存储**: 论文信息和评价内容分开存储，便于查询
+2. **标签系统**: 支持为评价添加标签，便于分类和筛选
+3. **统计功能**: 可以轻松获取论文统计信息
+4. **搜索功能**: 支持按标题、作者、摘要搜索论文
+5. **状态管理**: 通过`is_evaluated`字段跟踪评价状态
+6. **API支持**: 提供完整的RESTful API接口
+## 注意事项
+1. 确保在评价论文前先插入论文基本信息
+2. 评价内容建议使用JSON格式，便于解析和展示
+3. 定期备份数据库文件
+4. 可以使用`evaluation_tags`字段存储关键评分信息，便于快速筛选

Dockerfile CHANGED Viewed

@@ -10,4 +10,4 @@ COPY --chown=user ./requirements.txt requirements.txt
 RUN pip install --no-cache-dir --upgrade -r requirements.txt
 COPY --chown=user . /app
-CMD ["uvicorn", "app:app", "--host", "0.0.0.0", "--port", "7860"]

 RUN pip install --no-cache-dir --upgrade -r requirements.txt
 COPY --chown=user . /app
+CMD ["python", "app.py"]

PROJECT_STRUCTURE.md ADDED Viewed

	@@ -0,0 +1,87 @@

+# PaperIndex 项目结构
+## 目录组织
+```
+paperindex/
+├── app.py                 # 主应用程序入口点
+├── cli.py                 # 命令行工具入口点
+├── src/                   # 源代码目录
+│   ├── __init__.py
+│   ├── app.py            # 内部应用入口（已废弃）
+│   ├── agents/           # AI 代理模块
+│   │   ├── __init__.py
+│   │   ├── evaluator.py  # 论文评估器
+│   │   └── prompt.py     # 评估提示词
+│   ├── database/         # 数据库模块
+│   │   ├── __init__.py
+│   │   ├── models.py     # 数据库模型和类
+│   │   └── papers_cache.db
+│   ├── server/           # 服务器模块
+│   │   ├── __init__.py
+│   │   └── server.py     # FastAPI 服务器
+│   └── cli/              # 命令行工具模块
+│       ├── __init__.py
+│       └── cli.py        # CLI 实现
+├── frontend/             # 前端文件
+│   ├── index.html
+│   ├── paper.html
+│   ├── main.js
+│   ├── paper.js
+│   └── styles.css
+├── data/                 # 数据目录
+│   └── pdfs/
+├── workdir/              # 工作目录
+├── requirements.txt      # Python 依赖
+├── Dockerfile           # Docker 配置
+└── README.md            # 项目说明
+```
+## 模块说明
+### `src/agents/`
+AI 代理模块，负责论文评估功能：
+- `evaluator.py`: 使用 LangGraph 和 Claude API 进行论文评估
+- `prompt.py`: 包含评估提示词和工具定义
+### `src/database/`
+数据库管理模块：
+- `models.py`: 包含 PapersDatabase 类和数据库操作
+- 包含 SQLite 数据库文件
+- 负责论文缓存和状态管理
+### `src/server/`
+FastAPI 服务器模块：
+- `server.py`: 主要的 Web 服务器实现
+- 提供 RESTful API 接口
+- 处理前端请求
+### `src/cli/`
+命令行工具模块：
+- `cli.py`: 独立的论文评估命令行工具
+- 支持本地 PDF 和在线 URL 评估
+## 使用方法
+### 启动 Web 应用
+```bash
+python app.py
+```
+### 使用命令行工具
+```bash
+python cli.py <pdf_path_or_url> [options]
+```
+### 开发模式
+```bash
+# 在 src 目录下运行
+cd src
+python -m uvicorn server.server:app --reload --host 0.0.0.0 --port 8000
+```
+## 导入路径
+- 从根目录导入：`from src.agents.evaluator import Evaluator`
+- 在 src 目录内导入：`from agents.evaluator import Evaluator`
+- 模块间导入使用相对路径或绝对路径

agents/__init__.py DELETED Viewed

	@@ -1,2 +0,0 @@
1	-
2	-

app.py CHANGED Viewed

@@ -1,13 +1,722 @@
 import os
 import sys
 from pathlib import Path
-# Add the current directory to Python path
-sys.path.insert(0, str(Path(__file__).parent))
-# Import and run the FastAPI app from server.py
-from server import app
 if __name__ == "__main__":
-    import uvicorn
-    uvicorn.run(app, host="0.0.0.0", port=7860)

 import os
 import sys
+from dotenv import load_dotenv
+load_dotenv(verbose=True)
 from pathlib import Path
+import argparse
+from mmengine import DictAction
+from datetime import date, datetime, timedelta
+from typing import Any, Dict, List, Optional
+from fastapi.staticfiles import StaticFiles
+from fastapi import FastAPI, HTTPException
+from fastapi.middleware.cors import CORSMiddleware
+from fastapi.responses import FileResponse
+import httpx
+from bs4 import BeautifulSoup
+import json
+import asyncio
+import uvicorn
+root = str(Path(__file__).parent)
+sys.path.append(root)
+from src.database import db
+from src.logger import logger
+from src.config import config
+from src.crawl import HuggingFaceDailyPapers
+from src.utils import assemble_project_path
+from src.agents.evaluator import run_evaluation
+app = FastAPI(title="PaperAgent")
+# Local development: allow same-origin and localhost
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+def parse_args():
+    parser = argparse.ArgumentParser(description='main')
+    parser.add_argument("--config", default=os.path.join(root, "configs", "paper_agent.py"), help="config file path")
+    parser.add_argument(
+        '--cfg-options',
+        nargs='+',
+        action=DictAction,
+        help='override some settings in the used config, the key-value pair '
+        'in xxx=yyy format will be merged into config file. If the value to '
+        'be overwritten is a list, it should be like key="[a,b]" or key=a,b '
+        'It also allows nested list/tuple values, e.g. key="[(a,b),(c,d)]" '
+        'Note that the quotation marks are necessary and that no white space '
+        'is allowed.')
+    args = parser.parse_args()
+    return args
+# Remove the find_next_available_date function since we're using HuggingFace's redirect mechanism
+@app.get("/api/daily")
+async def get_daily(date_str: Optional[str] = None, direction: Optional[str] = None) -> Dict[str, Any]:
+    target_date = date_str or date.today().isoformat()
+    # Initialize HuggingFaceDailyPapers
+    hf_daily = HuggingFaceDailyPapers()
+    # First, check if we have fresh cache for the requested date
+    cached_data = db.get_cached_papers(target_date)
+    if cached_data and db.is_cache_fresh(target_date):
+        print(f"Using cached data for {target_date}")
+        return {
+            "date": target_date,
+            "requested_date": target_date,
+            "cards": cached_data['cards'],
+            "fallback_used": False,
+            "cached": True,
+            "cached_at": cached_data['cached_at']
+        }
+    # Handle different navigation directions
+    if direction == "prev":
+        # For previous navigation, use redirect mechanism to find the most recent available date
+        try:
+            actual_date, html = await hf_daily.fetch_daily_html(target_date)
+            print(f"Previous navigation: fetched {actual_date} (requested {target_date})")
+            # If we got redirected to a different date, that's our fallback
+            if actual_date != target_date:
+                print(f"Redirected from {target_date} to {actual_date}")
+                # Check if the redirected date has fresh cache
+                cached_data = db.get_cached_papers(actual_date)
+                if cached_data and db.is_cache_fresh(actual_date):
+                    print(f"Using cached data for redirected date {actual_date}")
+                    return {
+                        "date": actual_date,
+                        "requested_date": target_date,
+                        "cards": cached_data['cards'],
+                        "fallback_used": True,
+                        "cached": True,
+                        "cached_at": cached_data['cached_at']
+                    }
+                # Process the HTML we got
+                cards = hf_daily.parse_daily_cards(html)
+                enriched_cards = await enrich_cards(cards)
+                # Cache the results for the redirected date
+                db.cache_papers(actual_date, html, enriched_cards)
+                return {
+                    "date": actual_date,
+                    "requested_date": target_date,
+                    "cards": enriched_cards,
+                    "fallback_used": True,
+                    "cached": False
+                }
+            # If we got the exact date we requested, process normally
+            cards = hf_daily.parse_daily_cards(html)
+            enriched_cards = await enrich_cards(cards)
+            db.cache_papers(actual_date, html, enriched_cards)
+            return {
+                "date": actual_date,
+                "requested_date": target_date,
+                "cards": enriched_cards,
+                "fallback_used": False,
+                "cached": False
+            }
+        except Exception as e:
+            print(f"Failed to fetch {target_date} for previous navigation: {e}")
+            # Fallback to cached data if available
+            cached_data = db.get_cached_papers(target_date)
+            if cached_data:
+                return {
+                    "date": target_date,
+                    "requested_date": target_date,
+                    "cards": cached_data['cards'],
+                    "fallback_used": False,
+                    "cached": True,
+                    "cached_at": cached_data['cached_at']
+                }
+            raise HTTPException(status_code=503, detail="Unable to fetch papers and no cache available")
+    elif direction == "next":
+        # For next navigation, we need to find the next available date
+        # First try the exact date
+        try:
+            actual_date, html = await hf_daily.fetch_daily_html(target_date)
+            print(f"Next navigation: fetched {actual_date} (requested {target_date})")
+            # If we got the exact date we requested, that's perfect
+            if actual_date == target_date:
+                cards = hf_daily.parse_daily_cards(html)
+                enriched_cards = await enrich_cards(cards)
+                db.cache_papers(actual_date, html, enriched_cards)
+                return {
+                    "date": actual_date,
+                    "requested_date": target_date,
+                    "cards": enriched_cards,
+                    "fallback_used": False,
+                    "cached": False
+                }
+            # If we got redirected, it means the requested date doesn't exist
+            # We need to find the next available date by incrementing
+            print(f"Requested date {target_date} doesn't exist, searching for next available date")
+            # Try to find the next available date by incrementing
+            next_date = await find_next_available_date_forward(target_date)
+            if next_date:
+                cached_data = db.get_cached_papers(next_date)
+                if cached_data and db.is_cache_fresh(next_date):
+                    print(f"Using cached data for next available date {next_date}")
+                    return {
+                        "date": next_date,
+                        "requested_date": target_date,
+                        "cards": cached_data['cards'],
+                        "fallback_used": True,
+                        "cached": True,
+                        "cached_at": cached_data['cached_at']
+                    }
+                # Fetch the next available date
+                actual_date, html = await hf_daily.fetch_daily_html(next_date)
+                cards = hf_daily.parse_daily_cards(html)
+                enriched_cards = await enrich_cards(cards)
+                db.cache_papers(actual_date, html, enriched_cards)
+                return {
+                    "date": actual_date,
+                    "requested_date": target_date,
+                    "cards": enriched_cards,
+                    "fallback_used": True,
+                    "cached": False
+                }
+            # If no next date found, return empty
+            return {
+                "date": target_date,
+                "requested_date": target_date,
+                "cards": [],
+                "fallback_used": False,
+                "cached": False
+            }
+        except Exception as e:
+            print(f"Failed to fetch {target_date} for next navigation: {e}")
+            # Try to find next available date
+            next_date = await find_next_available_date_forward(target_date)
+            if next_date:
+                cached_data = db.get_cached_papers(next_date)
+                if cached_data:
+                    return {
+                        "date": next_date,
+                        "requested_date": target_date,
+                        "cards": cached_data['cards'],
+                        "fallback_used": True,
+                        "cached": True,
+                        "cached_at": cached_data['cached_at']
+                    }
+            # If no cache available, return error
+            raise HTTPException(status_code=503, detail="Unable to fetch papers and no cache available")
+    else:
+        # No direction specified, try the exact date first
+        try:
+            actual_date, html = await hf_daily.fetch_daily_html(target_date)
+            print(f"Direct fetch: fetched {actual_date} (requested {target_date})")
+            # If we got redirected, that's our fallback
+            if actual_date != target_date:
+                print(f"Redirected from {target_date} to {actual_date}")
+                # Check if the redirected date has fresh cache
+                cached_data = db.get_cached_papers(actual_date)
+                if cached_data and db.is_cache_fresh(actual_date):
+                    print(f"Using cached data for redirected date {actual_date}")
+                    return {
+                        "date": actual_date,
+                        "requested_date": target_date,
+                        "cards": cached_data['cards'],
+                        "fallback_used": True,
+                        "cached": True,
+                        "cached_at": cached_data['cached_at']
+                    }
+                # Process the HTML we got
+                cards = hf_daily.parse_daily_cards(html)
+                enriched_cards = await enrich_cards(cards)
+                # Cache the results for the redirected date
+                db.cache_papers(actual_date, html, enriched_cards)
+                return {
+                    "date": actual_date,
+                    "requested_date": target_date,
+                    "cards": enriched_cards,
+                    "fallback_used": True,
+                    "cached": False
+                }
+            # If we got the exact date we requested, process normally
+            cards = hf_daily.parse_daily_cards(html)
+            enriched_cards = await enrich_cards(cards)
+            db.cache_papers(actual_date, html, enriched_cards)
+            return {
+                "date": actual_date,
+                "requested_date": target_date,
+                "cards": enriched_cards,
+                "fallback_used": False,
+                "cached": False
+            }
+        except Exception as e:
+            print(f"Failed to fetch {target_date}: {e}")
+            # If everything fails, return cached data if available
+            cached_data = db.get_cached_papers(target_date)
+            if cached_data:
+                return {
+                    "date": target_date,
+                    "requested_date": target_date,
+                    "cards": cached_data['cards'],
+                    "fallback_used": False,
+                    "cached": True,
+                    "cached_at": cached_data['cached_at']
+                }
+            # If no cache available, return error
+            raise HTTPException(status_code=503, detail="Unable to fetch papers and no cache available")
+async def find_next_available_date_forward(start_date: str, max_attempts: int = 30) -> Optional[str]:
+    """Find the next available date by incrementing and checking"""
+    from datetime import datetime, timedelta
+    current_date = datetime.strptime(start_date, "%Y-%m-%d")
+    for i in range(max_attempts):
+        current_date += timedelta(days=1)
+        date_str = current_date.strftime("%Y-%m-%d")
+        # Check if we have cache for this date
+        cached_data = db.get_cached_papers(date_str)
+        if cached_data:
+            return date_str
+        # Try to fetch this date (but don't wait too long)
+        try:
+            import httpx
+            from src.crawl.huggingface_daily import HuggingFaceDailyPapers
+            hf_daily = HuggingFaceDailyPapers()
+            # Use a shorter timeout for quick checks
+            async with httpx.AsyncClient(timeout=5) as client:
+                actual_date, html = await hf_daily.fetch_daily_html(date_str)
+                if actual_date == date_str:
+                    return date_str
+        except Exception as e:
+            print(f"Failed to check {date_str}: {e}")
+            continue
+    return None
+async def enrich_cards(cards):
+    """Enrich cards with paper details from database"""
+    for c in cards:
+        arxiv_id = c.get("arxiv_id")
+        if arxiv_id:
+            paper = db.get_paper(arxiv_id)
+            if paper:
+                # Add evaluation status
+                c["has_eval"] = paper.get('is_evaluated', False)
+                c["is_evaluated"] = paper.get('is_evaluated', False)
+                # Add evaluation details if available
+                if paper.get('is_evaluated'):
+                    c["evaluation_score"] = paper.get('evaluation_score')
+                    c["overall_score"] = paper.get('overall_score')
+                    c["evaluation_date"] = paper.get('evaluation_date')
+                    c["evaluation_tags"] = paper.get('evaluation_tags')
+                # Add paper details (use cached data as fallback)
+                if not c.get("title") and paper.get("title"):
+                    c["title"] = paper["title"]
+                if not c.get("authors") and paper.get("authors"):
+                    c["authors"] = paper["authors"]
+                if not c.get("abstract") and paper.get("abstract"):
+                    c["abstract"] = paper["abstract"]
+            else:
+                c["has_eval"] = False
+                c["is_evaluated"] = False
+        else:
+            c["has_eval"] = False
+            c["is_evaluated"] = False
+    return cards
+@app.get("/api/evals")
+def list_evals() -> Dict[str, Any]:
+    # Get evaluated papers from database
+    evaluated_papers = db.get_evaluated_papers()
+    items: List[Dict[str, Any]] = []
+    for paper in evaluated_papers:
+        items.append({
+            "arxiv_id": paper['arxiv_id'],
+            "title": paper['title'],
+            "authors": paper['authors'],
+            "evaluation_date": paper['evaluation_date'],
+            "evaluation_score": paper['evaluation_score'],
+            "evaluation_tags": paper['evaluation_tags']
+        })
+    return {"count": len(items), "items": items}
+@app.get("/api/has-eval/{paper_id}")
+def has_eval(paper_id: str) -> Dict[str, bool]:
+    paper = db.get_paper(paper_id)
+    exists = paper is not None and paper.get('is_evaluated', False)
+    return {"exists": exists}
+@app.get("/api/paper/{paper_id}")
+def get_paper_details(paper_id: str) -> Dict[str, Any]:
+    """Get detailed paper information from database"""
+    paper = db.get_paper(paper_id)
+    if not paper:
+        raise HTTPException(status_code=404, detail="Paper not found")
+    return {
+        "arxiv_id": paper.get('arxiv_id'),
+        "title": paper.get('title'),
+        "authors": paper.get('authors'),
+        "abstract": paper.get('abstract'),
+        "categories": paper.get('categories'),
+        "published_date": paper.get('published_date'),
+        "is_evaluated": paper.get('is_evaluated', False),
+        "evaluation_date": paper.get('evaluation_date'),
+        "created_at": paper.get('created_at'),
+        "updated_at": paper.get('updated_at')
+    }
+@app.get("/api/paper-score/{paper_id}")
+def get_paper_score(paper_id: str) -> Dict[str, Any]:
+    paper = db.get_paper(paper_id)
+    print(f"Paper data for {paper_id}:", paper)
+    if not paper or not paper.get('is_evaluated', False):
+        print(f"Paper {paper_id} not found or not evaluated")
+        return {"has_score": False}
+    # Calculate overall score as average of all dimensions (same as radar chart)
+    try:
+        evaluation_content = paper.get('evaluation_content')
+        if evaluation_content:
+            evaluation_json = json.loads(evaluation_content)
+            if 'scorecard' in evaluation_json:
+                scorecard = evaluation_json['scorecard']
+                values = [
+                    scorecard.get('task_formalization', 0),
+                    scorecard.get('data_resource_availability', 0),
+                    scorecard.get('input_output_complexity', 0),
+                    scorecard.get('real_world_interaction', 0),
+                    scorecard.get('existing_ai_coverage', 0),
+                    scorecard.get('human_originality', 0),
+                    scorecard.get('safety_ethics', 0),
+                    scorecard.get('technical_maturity_needed', 0),
+                    scorecard.get('three_year_feasibility_pct', 0) / 25,  # Convert percentage to 0-4 scale
+                    scorecard.get('overall_automatability', 0)
+                ]
+                valid_scores = [v for v in values if v > 0]
+                overall_score = sum(valid_scores) / len(valid_scores) if valid_scores else 0
+                print(f"Calculated overall score: {overall_score}")
+                return {
+                    "has_score": True,
+                    "score": overall_score,
+                    "evaluation_date": paper.get('evaluation_date')
+                }
+    except Exception as e:
+        print(f"Error calculating overall score: {e}")
+    # Fallback to stored values
+    overall_score = paper.get('overall_score')
+    evaluation_score = paper.get('evaluation_score')
+    print(f"Fallback - Overall score: {overall_score}, Evaluation score: {evaluation_score}")
+    return {
+        "has_score": True,
+        "score": overall_score if overall_score is not None else evaluation_score,
+        "evaluation_date": paper.get('evaluation_date')
+    }
+@app.get("/api/eval/{paper_id}")
+def get_eval(paper_id: str) -> Any:
+    paper = db.get_paper(paper_id)
+    if not paper or not paper.get('is_evaluated', False):
+        raise HTTPException(status_code=404, detail="Evaluation not found")
+    # Parse evaluation content if it's JSON
+    evaluation_content = paper['evaluation_content']
+    try:
+        evaluation_json = json.loads(evaluation_content)
+    except json.JSONDecodeError:
+        # If not JSON, create a simple structure
+        evaluation_json = {
+            "evaluation_content": evaluation_content,
+            "arxiv_id": paper_id,
+            "evaluation_date": paper['evaluation_date'],
+            "evaluation_score": paper['evaluation_score'],
+            "evaluation_tags": paper['evaluation_tags']
+        }
+    return evaluation_json
+@app.get("/api/available-dates")
+def get_available_dates() -> Dict[str, Any]:
+    """Get list of available dates in the cache"""
+    with db.get_connection() as conn:
+        cursor = conn.cursor()
+        cursor.execute('SELECT date_str FROM papers_cache ORDER BY date_str DESC LIMIT 30')
+        dates = [row['date_str'] for row in cursor.fetchall()]
+        return {
+            "available_dates": dates,
+            "count": len(dates)
+        }
+@app.get("/api/cache/status")
+def get_cache_status() -> Dict[str, Any]:
+    """Get cache status and statistics"""
+    with db.get_connection() as conn:
+        cursor = conn.cursor()
+        # Get total cached dates
+        cursor.execute('SELECT COUNT(*) as count FROM papers_cache')
+        total_cached = cursor.fetchone()['count']
+        # Get latest cached date
+        cursor.execute('SELECT date_str, updated_at FROM latest_date WHERE id = 1')
+        latest_info = cursor.fetchone()
+        # Get cache age distribution
+        cursor.execute('''
+            SELECT
+                CASE
+                    WHEN updated_at > datetime('now', '-1 hour') THEN '1 hour'
+                    WHEN updated_at > datetime('now', '-24 hours') THEN '24 hours'
+                    WHEN updated_at > datetime('now', '-7 days') THEN '7 days'
+                    ELSE 'older'
+                END as age_group,
+                COUNT(*) as count
+            FROM papers_cache
+            GROUP BY age_group
+        ''')
+        age_distribution = {row['age_group']: row['count'] for row in cursor.fetchall()}
+        return {
+            "total_cached_dates": total_cached,
+            "latest_cached_date": latest_info['date_str'] if latest_info else None,
+            "latest_updated": latest_info['updated_at'] if latest_info else None,
+            "age_distribution": age_distribution
+        }
+@app.get("/api/papers/status")
+def get_papers_status() -> Dict[str, Any]:
+    """Get papers database status and statistics"""
+    papers_count = db.get_papers_count()
+    # Get recent evaluations
+    recent_papers = db.get_evaluated_papers()
+    recent_evaluations = []
+    for paper in recent_papers[:10]:  # Get last 10 evaluations
+        recent_evaluations.append({
+            "arxiv_id": paper['arxiv_id'],
+            "title": paper['title'],
+            "evaluation_date": paper['evaluation_date'],
+            "evaluation_score": paper['evaluation_score']
+        })
+    return {
+        "papers_count": papers_count,
+        "recent_evaluations": recent_evaluations
+    }
+@app.post("/api/papers/insert")
+def insert_paper(paper_data: Dict[str, Any]) -> Dict[str, Any]:
+    """Insert a new paper into the database"""
+    try:
+        required_fields = ['arxiv_id', 'title', 'authors']
+        for field in required_fields:
+            if field not in paper_data:
+                raise HTTPException(status_code=400, detail=f"Missing required field: {field}")
+        db.insert_paper(
+            arxiv_id=paper_data['arxiv_id'],
+            title=paper_data['title'],
+            authors=paper_data['authors'],
+            abstract=paper_data.get('abstract'),
+            categories=paper_data.get('categories'),
+            published_date=paper_data.get('published_date')
+        )
+        return {"message": f"Paper {paper_data['arxiv_id']} inserted successfully"}
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=f"Failed to insert paper: {str(e)}")
+@app.post("/api/papers/evaluate/{arxiv_id}")
+async def evaluate_paper(arxiv_id: str) -> Dict[str, Any]:
+    """Evaluate a paper by its arxiv_id"""
+    try:
+        # Check if paper exists in database
+        paper = db.get_paper(arxiv_id)
+        if not paper:
+            raise HTTPException(status_code=404, detail="Paper not found in database")
+        # Check if already evaluated
+        if paper.get('is_evaluated', False):
+            return {"message": f"Paper {arxiv_id} already evaluated", "status": "already_evaluated"}
+        # Create PDF URL from arxiv_id
+        pdf_url = f"https://arxiv.org/pdf/{arxiv_id}.pdf"
+        # Run evaluation in background task
+        async def run_eval():
+            try:
+                # Update paper status to "evaluating"
+                db.update_paper_status(arxiv_id, "evaluating")
+                logger.info(f"Started evaluation for {arxiv_id}")
+                result = await run_evaluation(
+                    pdf_path=pdf_url,
+                    arxiv_id=arxiv_id,
+                    api_key=os.getenv("ANTHROPIC_API_KEY")
+                )
+                # Update paper status to "completed"
+                db.update_paper_status(arxiv_id, "completed")
+                logger.info(f"Evaluation completed for {arxiv_id}")
+            except Exception as e:
+                # Update paper status to "failed"
+                db.update_paper_status(arxiv_id, "failed")
+                logger.error(f"Evaluation failed for {arxiv_id}: {str(e)}")
+        # Start evaluation in background
+        asyncio.create_task(run_eval())
+        return {
+            "message": f"Evaluation started for paper {arxiv_id}",
+            "status": "started",
+            "pdf_url": pdf_url
+        }
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=f"Failed to evaluate paper: {str(e)}")
+@app.get("/api/papers/evaluate/{arxiv_id}/status")
+def get_evaluation_status(arxiv_id: str) -> Dict[str, Any]:
+    """Get evaluation status for a paper"""
+    try:
+        paper = db.get_paper(arxiv_id)
+        if not paper:
+            raise HTTPException(status_code=404, detail="Paper not found")
+        status = paper.get('evaluation_status', 'not_started')
+        is_evaluated = paper.get('is_evaluated', False)
+        return {
+            "arxiv_id": arxiv_id,
+            "status": status,
+            "is_evaluated": is_evaluated,
+            "evaluation_date": paper.get('evaluation_date'),
+            "evaluation_score": paper.get('evaluation_score')
+        }
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=f"Failed to get evaluation status: {str(e)}")
+@app.post("/api/cache/clear")
+def clear_cache() -> Dict[str, str]:
+    """Clear all cached data"""
+    with db.get_connection() as conn:
+        cursor = conn.cursor()
+        cursor.execute('DELETE FROM papers_cache')
+        conn.commit()
+    return {"message": "Cache cleared successfully"}
+@app.post("/api/cache/refresh/{date_str}")
+async def refresh_cache(date_str: str) -> Dict[str, Any]:
+    """Force refresh cache for a specific date"""
+    try:
+        # Initialize HuggingFaceDailyPapers
+        hf_daily = HuggingFaceDailyPapers()
+        # Force fetch fresh data
+        actual_date, html = await hf_daily.fetch_daily_html(date_str)
+        cards = hf_daily.parse_daily_cards(html)
+        # Cache the results
+        db.cache_papers(actual_date, html, cards)
+        return {
+            "message": f"Cache refreshed for {actual_date}",
+            "cards_count": len(cards)
+        }
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=f"Failed to refresh cache: {str(e)}")
+@app.get("/styles.css")
+async def get_styles():
+    """Serve CSS with no-cache headers to prevent caching issues during development"""
+    response = FileResponse("frontend/styles.css", media_type="text/css")
+    response.headers["Cache-Control"] = "no-cache, no-store, must-revalidate"
+    response.headers["Pragma"] = "no-cache"
+    response.headers["Expires"] = "0"
+    return response
 if __name__ == "__main__":
+    # Parse command line arguments
+    args = parse_args()
+    # Initialize the configuration
+    config.init_config(args.config, args)
+    # Initialize the logger
+    logger.init_logger(config=config)
+    logger.info(f"| Logger initialized at: {config.log_path}")
+    logger.info(f"| Config:\n{config.pretty_text}")
+    # Initialize the database
+    db.init_db(config=config)
+    logger.info(f"| Database initialized at: {config.db_path}")
+    # Load Frontend
+    os.makedirs(config.frontend_path, exist_ok=True)
+    app.mount("/", StaticFiles(directory=config.frontend_path, html=True), name="static")
+    logger.info(f"| Frontend initialized at: {config.frontend_path}")
+    uvicorn.run(app, host="0.0.0.0", port=8000)

cli.py CHANGED Viewed

@@ -1,65 +1,12 @@
-import argparse
 import os
 import sys
-from typing import Optional
-from dotenv import load_dotenv
-load_dotenv()
-from rich.console import Console
-from rich.panel import Panel
-from agents.evaluator import run_evaluation
-console = Console()
-def build_parser() -> argparse.ArgumentParser:
-    parser = argparse.ArgumentParser(
-        description="AI Automation Evaluator (LangGraph) — evaluate a paper PDF or arXiv URL",
-        epilog="Example: python cli.py https://arxiv.org/pdf/2507.14683 -o /abs/path/save_dir/eval_2507_14683",
-    )
-    parser.add_argument("pdf", help="Local PDF absolute path or URL (e.g., https://arxiv.org/pdf/xxxx)")
-    parser.add_argument(
-        "-o",
-        "--output-prefix",
-        dest="output_prefix",
-        help="Output file prefix (if provided, will save as <prefix>_YYYYMMDD_HHMMSS.md)",
-    )
-    parser.add_argument(
-        "--api-key",
-        dest="api_key",
-        default=os.getenv("ANTHROPIC_API_KEY"),
-        help="Anthropic API key (overrides ANTHROPIC_API_KEY env)",
-    )
-    return parser
-def main(argv: Optional[list[str]] = None):
-    parser = build_parser()
-    args = parser.parse_args(argv)
-    pdf_path: str = args.pdf
-    output_prefix: Optional[str] = args.output_prefix
-    api_key: Optional[str] = args.api_key or os.getenv("ANTHROPIC_API_KEY")
-    if not api_key:
-        console.print("[yellow]Warning:[/yellow] ANTHROPIC_API_KEY not set and --api-key not provided.", highlight=False)
-    console.print(Panel.fit(f"Evaluating: {pdf_path}"))
-    try:
-        result = run_evaluation(pdf_path=pdf_path, output_file=output_prefix, api_key=api_key)
-        console.print("\n[bold green]Done.[/bold green]\n")
-        if output_prefix:
-            console.print(f"Saved to prefix: {output_prefix}_<timestamp>.md")
-        else:
-            console.print(result)
-    except Exception as e:
-        console.print(f"[bold red]Error:[/bold red] {e}")
-        sys.exit(2)
 if __name__ == "__main__":
     main()

 import os
 import sys
+from pathlib import Path
+# Add the src directory to Python path
+sys.path.insert(0, str(Path(__file__).parent / "src"))
+# Import and run the CLI
+from src.cli.cli import main
 if __name__ == "__main__":
     main()

configs/paper_agent.py CHANGED Viewed

@@ -2,3 +2,8 @@ workdir = "workdir"
 tag = "paper_agent"
 exp_path = f"{workdir}/{tag}"
 log_path = "agent.log"

 tag = "paper_agent"
 exp_path = f"{workdir}/{tag}"
 log_path = "agent.log"
+db_path = "papers_cache.db"
+frontend_path = "frontend"
+model_id = "claude-sonnet-4-20250514"
+version = "0.1.0"

frontend/index.html CHANGED Viewed

@@ -3,7 +3,7 @@
   <head>
     <meta charset="utf-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1" />
-    <title>PaperIndex — Daily Papers</title>
     <link rel="stylesheet" href="/styles.css?v=9" />
     <link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.0/css/all.min.css" rel="stylesheet">
   </head>
@@ -14,7 +14,7 @@
         <div class="nav-left">
           <div class="logo">
             <i class="fas fa-book-open"></i>
-            <span>PaperIndex</span>
           </div>
         </div>
@@ -40,7 +40,7 @@
       <div class="header-container">
         <div class="header-left">
           <h1>Daily Papers</h1>
-          <p class="subtitle">by PaperIndex and the research community</p>
         </div>
         <div class="header-center">
@@ -81,6 +81,15 @@
       </div>
     </main>
     <script src="/main.js"></script>
   </body>
 </html>

   <head>
     <meta charset="utf-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1" />
+    <title>AIR Index — Daily Papers</title>
     <link rel="stylesheet" href="/styles.css?v=9" />
     <link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.0/css/all.min.css" rel="stylesheet">
   </head>
         <div class="nav-left">
           <div class="logo">
             <i class="fas fa-book-open"></i>
+            <span>AIR Index</span>
           </div>
         </div>
       <div class="header-container">
         <div class="header-left">
           <h1>Daily Papers</h1>
+          <p class="subtitle">by AI Realizability Index Agent</p>
         </div>
         <div class="header-center">
       </div>
     </main>
+    <!-- Loading Overlay -->
+    <div id="loadingOverlay" class="loading-overlay">
+      <div class="loading-spinner">
+        <div class="spinner"></div>
+        <div class="loading-text">Loading papers...</div>
+        <div class="loading-subtext">Fetching data from Hugging Face</div>
+      </div>
+    </div>
     <script src="/main.js"></script>
   </body>
 </html>

frontend/main.js CHANGED Viewed

@@ -36,6 +36,7 @@ class DateManager {
   constructor() {
     // Start with today's date, but it will be updated when we get the actual available date
     this.currentDate = new Date();
     this.init();
   }
@@ -44,6 +45,10 @@ class DateManager {
     this.bindEvents();
   }
   formatDate(date) {
     const options = {
       year: 'numeric',
@@ -53,27 +58,121 @@ class DateManager {
     return date.toLocaleDateString('en-US', options);
   }
-  updateDateDisplay() {
     const dateDisplay = document.getElementById('dateDisplay');
-    dateDisplay.textContent = this.formatDate(this.currentDate);
   }
-  navigateDate(direction) {
-    const newDate = new Date(this.currentDate);
-    newDate.setDate(newDate.getDate() + direction);
-    this.currentDate = newDate;
-    this.updateDateDisplay();
-    this.loadDaily();
   }
   bindEvents() {
-    document.getElementById('prevDate').addEventListener('click', () => {
-      this.navigateDate(-1);
-    });
-    document.getElementById('nextDate').addEventListener('click', () => {
-      this.navigateDate(1);
-    });
   }
   getDateString() {
@@ -258,13 +357,24 @@ class PaperCardRenderer {
       ${paper.arxiv_id ? `
         <div class="card-actions">
-          <a href="/paper.html?id=${encodeURIComponent(paper.arxiv_id)}" class="eval-button">
-            <i class="fas fa-chart-line"></i>Evaluation
-          </a>
         </div>
       ` : ''}
     `;
     return card;
   }
@@ -287,6 +397,326 @@ class PaperCardRenderer {
       this.cardsContainer.appendChild(card);
     });
   }
 }
 // Main Application
@@ -294,6 +724,7 @@ class PaperIndexApp {
   constructor() {
     this.themeManager = new ThemeManager();
     this.dateManager = new DateManager();
     this.searchManager = new SearchManager();
     this.cardRenderer = new PaperCardRenderer();
     this.init();
@@ -301,6 +732,7 @@ class PaperIndexApp {
   init() {
     this.bindEvents();
     this.loadDaily();
   }
@@ -319,11 +751,17 @@ class PaperIndexApp {
     });
   }
-  async loadDaily() {
     const dateStr = this.dateManager.getDateString();
     try {
-      const response = await fetch(`/api/daily?date_str=${encodeURIComponent(dateStr)}`);
       if (!response.ok) {
         throw new Error('Failed to load daily papers');
@@ -335,18 +773,24 @@ class PaperIndexApp {
         requested_date: data.requested_date,
         actual_date: data.date,
         fallback_used: data.fallback_used,
-        cards_count: data.cards?.length
       });
-      // Update the date display if a fallback was used or if we got a different date than requested
       if (data.date && data.requested_date && data.date !== data.requested_date) {
-        console.log('Updating date display from', data.requested_date, 'to', data.date);
-        const fallbackDate = new Date(data.date);
-        this.dateManager.currentDate = fallbackDate;
         this.dateManager.updateDateDisplay();
-        // Show a notification about the fallback
-        this.showFallbackNotification(data.requested_date, data.date);
       }
       // Show cache status if available
@@ -371,83 +815,173 @@ class PaperIndexApp {
           </a>
         </div>
       `;
     }
   }
-  showFallbackNotification(requestedDate, actualDate) {
-    // Create a temporary notification
-    const notification = document.createElement('div');
-    notification.style.cssText = `
-      position: fixed;
-      top: 20px;
-      right: 20px;
-      background: var(--bg-primary);
-      border: 1px solid var(--border-medium);
-      border-radius: 8px;
-      padding: 16px;
-      box-shadow: var(--shadow-lg);
-      z-index: 1000;
-      max-width: 300px;
-      color: var(--text-primary);
-    `;
-    notification.innerHTML = `
-      <div style="display: flex; align-items: center; gap: 8px; margin-bottom: 8px;">
-        <i class="fas fa-info-circle" style="color: var(--accent-primary);"></i>
-        <strong>Date Updated</strong>
-      </div>
-      <p style="margin: 0; font-size: 14px; color: var(--text-secondary);">
-        Papers for ${requestedDate} not available. Showing latest available: ${actualDate}
-      </p>
-    `;
-    document.body.appendChild(notification);
-    // Remove notification after 5 seconds
-    setTimeout(() => {
       if (notification.parentNode) {
         notification.parentNode.removeChild(notification);
       }
-    }, 5000);
-  }
-  showCacheNotification(cachedAt) {
-    // Create a temporary notification
     const notification = document.createElement('div');
     notification.style.cssText = `
       position: fixed;
       top: 20px;
       right: 20px;
       background: var(--bg-primary);
-      border: 1px solid var(--border-medium);
       border-radius: 8px;
       padding: 16px;
       box-shadow: var(--shadow-lg);
       z-index: 1000;
-      max-width: 300px;
       color: var(--text-primary);
     `;
-    const cacheTime = new Date(cachedAt).toLocaleString();
     notification.innerHTML = `
-      <div style="display: flex; align-items: center; gap: 8px; margin-bottom: 8px;">
-        <i class="fas fa-database" style="color: var(--accent-success);"></i>
-        <strong>Cached Data</strong>
       </div>
-      <p style="margin: 0; font-size: 14px; color: var(--text-secondary);">
-        Showing cached data from ${cacheTime}
-      </p>
     `;
     document.body.appendChild(notification);
-    // Remove notification after 3 seconds
-    setTimeout(() => {
-      if (notification.parentNode) {
-        notification.parentNode.removeChild(notification);
-      }
-    }, 3000);
   }
 }

   constructor() {
     // Start with today's date, but it will be updated when we get the actual available date
     this.currentDate = new Date();
+    this.app = null; // Reference to the main app
     this.init();
   }
     this.bindEvents();
   }
+  setApp(app) {
+    this.app = app;
+  }
   formatDate(date) {
     const options = {
       year: 'numeric',
     return date.toLocaleDateString('en-US', options);
   }
+  async updateDateDisplay() {
     const dateDisplay = document.getElementById('dateDisplay');
+    if (dateDisplay) {
+      dateDisplay.textContent = this.formatDate(this.currentDate);
+    }
+    // Update button states based on available dates
+    await this.updateButtonStates();
   }
+  async updateButtonStates() {
+    try {
+      // Check if current date is in the future
+      const today = new Date();
+      today.setHours(23, 59, 59, 999);
+      if (this.currentDate > today) {
+        this.setButtonState('nextDate', false);
+        this.setButtonState('prevDate', true);
+        return;
+      }
+      // For previous button, always allow going back (unless it's too far in the past)
+      const minDate = new Date('2020-01-01'); // Reasonable minimum date
+      this.setButtonState('prevDate', this.currentDate > minDate);
+      // For next button, only disable if it's today or in the future
+      this.setButtonState('nextDate', this.currentDate < today);
+    } catch (error) {
+      console.error('Error updating button states:', error);
+    }
+  }
+  setButtonState(buttonId, enabled) {
+    const button = document.getElementById(buttonId);
+    if (button) {
+      button.disabled = !enabled;
+      button.style.opacity = enabled ? '1' : '0.5';
+      button.style.cursor = enabled ? 'pointer' : 'not-allowed';
+    }
+  }
+  async navigateDate(direction) {
+    try {
+      // Calculate target date first
+      const newDate = new Date(this.currentDate);
+      newDate.setDate(newDate.getDate() + direction);
+      // Check if the new date is in the future
+      const today = new Date();
+      today.setHours(23, 59, 59, 999); // End of today
+      if (newDate > today) {
+        this.showDateLimitNotification('Cannot navigate to future dates');
+        return;
+      }
+      // Update current date
+      this.currentDate = newDate;
+      this.updateDateDisplay();
+      // Show loading animation
+      const dateStr = this.formatDate(this.currentDate);
+      const direction_str = direction > 0 ? "next" : "prev";
+      this.showLoading(`Loading papers for ${dateStr}...`, `Navigating ${direction_str} from Hugging Face`);
+      // Try to load the target date with direction
+      if (this.app && this.app.loadDaily) {
+        await this.app.loadDaily(direction_str);
+      }
+    } catch (error) {
+      console.error('Error navigating date:', error);
+      this.showDateLimitNotification('Error loading date');
+    }
+  }
+  // Removed old notification functions - now using unified notification system
+  showLoading(message = 'Loading papers...', submessage = 'Fetching data from Hugging Face') {
+    const loadingOverlay = document.getElementById('loadingOverlay');
+    if (loadingOverlay) {
+      const loadingText = loadingOverlay.querySelector('.loading-text');
+      const loadingSubtext = loadingOverlay.querySelector('.loading-subtext');
+      if (loadingText) loadingText.textContent = message;
+      if (loadingSubtext) loadingSubtext.textContent = submessage;
+      loadingOverlay.classList.add('show');
+    }
+  }
+  hideLoading() {
+    const loadingOverlay = document.getElementById('loadingOverlay');
+    if (loadingOverlay) {
+      loadingOverlay.classList.remove('show');
+    }
   }
   bindEvents() {
+    const prevBtn = document.getElementById('prevDate');
+    const nextBtn = document.getElementById('nextDate');
+    if (prevBtn) {
+      prevBtn.addEventListener('click', async () => {
+        await this.navigateDate(-1);
+      });
+    }
+    if (nextBtn) {
+      nextBtn.addEventListener('click', async () => {
+        await this.navigateDate(1);
+      });
+    }
   }
   getDateString() {
       ${paper.arxiv_id ? `
         <div class="card-actions">
+          <button class="eval-button" data-arxiv-id="${paper.arxiv_id}" data-paper-title="${encodeURIComponent(title)}">
+            <i class="fas fa-spinner fa-spin" style="display: none;"></i>
+            <i class="fas fa-chart-line eval-icon"></i>
+            <span class="eval-text">Checking...</span>
+          </button>
         </div>
       ` : ''}
     `;
+    // Check evaluation status for this paper
+    if (paper.arxiv_id) {
+      this.checkEvaluationStatus(card, paper.arxiv_id);
+      // Store paper data in card for score checking
+      card.setAttribute('data-paper-data', JSON.stringify(paper));
+      this.checkPaperScore(card, paper.arxiv_id);
+    }
     return card;
   }
       this.cardsContainer.appendChild(card);
     });
   }
+  async checkEvaluationStatus(card, arxivId) {
+    const button = card.querySelector('.eval-button');
+    const spinner = button.querySelector('.fa-spinner');
+    const evalIcon = button.querySelector('.eval-icon');
+    const evalText = button.querySelector('.eval-text');
+    try {
+      const response = await fetch(`/api/has-eval/${encodeURIComponent(arxivId)}`);
+      const data = await response.json();
+      if (data.exists) {
+        // Paper has evaluation - show evaluation button
+        evalIcon.className = 'fas fa-chart-line eval-icon';
+        evalText.textContent = 'Evaluation';
+        button.className = 'eval-button evaluation-state';
+        button.onclick = () => {
+          window.location.href = `/paper.html?id=${encodeURIComponent(arxivId)}`;
+        };
+      } else {
+        // Paper doesn't have evaluation - show evaluate button
+        evalIcon.className = 'fas fa-play eval-icon';
+        evalText.textContent = 'Evaluate';
+        button.className = 'eval-button evaluate-state';
+        button.onclick = () => {
+          this.evaluatePaper(button, arxivId);
+        };
+      }
+    } catch (error) {
+      console.error('Error checking evaluation status:', error);
+      evalIcon.className = 'fas fa-exclamation-triangle eval-icon';
+      evalText.textContent = 'Error';
+      button.className = 'eval-button error-state';
+    }
+  }
+  async checkPaperScore(card, arxivId) {
+    try {
+      // First check if the card already has score data from the API response
+      const cardData = card.getAttribute('data-paper-data');
+      if (cardData) {
+        const paperData = JSON.parse(cardData);
+        if (paperData.overall_score !== null && paperData.overall_score !== undefined) {
+          this.displayScoreBadge(card, paperData.overall_score, arxivId);
+          return;
+        }
+      }
+      // Fallback to API call if no score data in card
+      const response = await fetch(`/api/paper-score/${encodeURIComponent(arxivId)}`);
+      const data = await response.json();
+      console.log(`Paper score data for ${arxivId}:`, data);
+      if (data.has_score && data.score !== null) {
+        this.displayScoreBadge(card, data.score, arxivId);
+      }
+    } catch (error) {
+      console.error('Error checking paper score:', error);
+    }
+  }
+  displayScoreBadge(card, score, arxivId) {
+    // Create score badge
+    const scoreBadge = document.createElement('div');
+    scoreBadge.className = 'score-badge';
+    const formattedScore = parseFloat(score).toFixed(1);
+    // Determine score color based on value (0-4 scale)
+    const scoreValue = parseFloat(score);
+    let scoreColor = 'var(--accent-primary)';
+    if (scoreValue >= 3.0) {
+      scoreColor = 'var(--accent-success)';
+    } else if (scoreValue >= 2.0) {
+      scoreColor = 'var(--accent-warning)';
+    } else if (scoreValue < 1.0) {
+      scoreColor = 'var(--accent-danger)';
+    }
+    scoreBadge.style.background = `linear-gradient(135deg, ${scoreColor}, ${scoreColor}dd)`;
+    scoreBadge.innerHTML = `
+      <span class="score-number">${formattedScore}</span>
+      <span class="score-label">Overall</span>
+    `;
+    // Add click handler to navigate to evaluation page
+    scoreBadge.onclick = () => {
+      window.location.href = `/paper.html?id=${encodeURIComponent(arxivId)}`;
+    };
+    // Add to card with animation
+    card.appendChild(scoreBadge);
+    scoreBadge.style.opacity = '0';
+    scoreBadge.style.transform = 'scale(0.8) translateY(10px)';
+    // Animate in
+    setTimeout(() => {
+      scoreBadge.style.transition = 'all 0.3s ease';
+      scoreBadge.style.opacity = '1';
+      scoreBadge.style.transform = 'scale(1) translateY(0)';
+    }, 100);
+  }
+  async evaluatePaper(button, arxivId) {
+    const spinner = button.querySelector('.fa-spinner');
+    const evalIcon = button.querySelector('.eval-icon');
+    const evalText = button.querySelector('.eval-text');
+    const paperTitle = button.getAttribute('data-paper-title');
+    // Show loading state
+    spinner.style.display = 'inline-block';
+    evalIcon.style.display = 'none';
+    evalText.textContent = 'Evaluating...';
+    button.className = 'eval-button evaluating-state';
+    button.disabled = true;
+    try {
+      // First, check if paper exists in database, if not, insert it
+      const paperData = {
+        arxiv_id: arxivId,
+        title: decodeURIComponent(paperTitle),
+        authors: "Unknown Authors", // We don't have authors in the card data
+        abstract: "No abstract available",
+        categories: "Unknown",
+        published_date: new Date().toISOString().split('T')[0]
+      };
+      // Try to insert the paper (this will work even if it already exists)
+      await fetch('/api/papers/insert', {
+        method: 'POST',
+        headers: {
+          'Content-Type': 'application/json',
+        },
+        body: JSON.stringify(paperData)
+      });
+      // Start evaluation
+      const response = await fetch(`/api/papers/evaluate/${encodeURIComponent(arxivId)}`, {
+        method: 'POST'
+      });
+      if (response.ok) {
+        const result = await response.json();
+        if (result.status === 'already_evaluated') {
+          // Paper was already evaluated, redirect to evaluation page
+          window.location.href = `/paper.html?id=${encodeURIComponent(arxivId)}`;
+        } else {
+          // Evaluation started, show progress and poll for status
+          evalText.textContent = 'Started...';
+          button.className = 'eval-button started-state';
+          // Start polling for status
+          this.pollEvaluationStatus(button, arxivId);
+        }
+      } else {
+        throw new Error('Failed to start evaluation');
+      }
+    } catch (error) {
+      console.error('Error evaluating paper:', error);
+      evalIcon.className = 'fas fa-exclamation-triangle eval-icon';
+      evalText.textContent = 'Error';
+      button.className = 'eval-button error-state';
+      button.disabled = false;
+    } finally {
+      spinner.style.display = 'none';
+      evalIcon.style.display = 'inline-block';
+    }
+  }
+  async pollEvaluationStatus(button, arxivId) {
+    const evalIcon = button.querySelector('.eval-icon');
+    const evalText = button.querySelector('.eval-text');
+    let pollCount = 0;
+    const maxPolls = 60; // Poll for up to 5 minutes (5s intervals)
+    // Show log message
+    this.showLogMessage(`Started evaluation for paper ${arxivId}`, 'info');
+    const poll = async () => {
+      try {
+        const response = await fetch(`/api/papers/evaluate/${encodeURIComponent(arxivId)}/status`);
+        if (response.ok) {
+          const status = await response.json();
+          switch (status.status) {
+            case 'evaluating':
+              evalText.textContent = `Evaluating... (${pollCount * 5}s)`;
+              evalIcon.className = 'fas fa-spinner fa-spin eval-icon';
+              button.className = 'eval-button evaluating-state';
+              this.showLogMessage(`Evaluating paper ${arxivId}... (${pollCount * 5}s)`, 'info');
+              break;
+            case 'completed':
+              evalIcon.className = 'fas fa-check eval-icon';
+              evalText.textContent = 'Completed';
+              button.className = 'eval-button evaluation-state';
+              button.onclick = () => {
+                window.location.href = `/paper.html?id=${encodeURIComponent(arxivId)}`;
+              };
+              this.showLogMessage(`Evaluation completed for paper ${arxivId}`, 'success');
+              // Add score badge after completion
+              this.checkPaperScore(button.closest('.hf-paper-card'), arxivId);
+              return; // Stop polling
+            case 'failed':
+              evalIcon.className = 'fas fa-exclamation-triangle eval-icon';
+              evalText.textContent = 'Failed';
+              button.className = 'eval-button error-state';
+              button.disabled = false;
+              this.showLogMessage(`Evaluation failed for paper ${arxivId}`, 'error');
+              return; // Stop polling
+            default:
+              evalText.textContent = `Processing... (${pollCount * 5}s)`;
+              button.className = 'eval-button processing-state';
+          }
+        }
+      } catch (error) {
+        console.error('Error polling status:', error);
+        this.showLogMessage(`Error checking status for paper ${arxivId}`, 'error');
+      }
+      pollCount++;
+      if (pollCount < maxPolls) {
+        setTimeout(poll, 5000); // Poll every 5 seconds
+      } else {
+        // Timeout
+        evalIcon.className = 'fas fa-clock eval-icon';
+        evalText.textContent = 'Timeout';
+        button.className = 'eval-button error-state';
+        button.disabled = false;
+        this.showLogMessage(`Evaluation timeout for paper ${arxivId}`, 'warning');
+      }
+    };
+    // Start polling
+    setTimeout(poll, 5000); // First poll after 5 seconds
+  }
+  showLogMessage(message, type = 'info') {
+    // Create or get log container
+    let logContainer = document.getElementById('evaluation-log');
+    if (!logContainer) {
+      logContainer = document.createElement('div');
+      logContainer.id = 'evaluation-log';
+      logContainer.className = 'evaluation-log';
+      logContainer.style.cssText = `
+        position: fixed;
+        bottom: 20px;
+        right: 20px;
+        max-width: 400px;
+        max-height: 300px;
+        overflow-y: auto;
+        background: var(--bg-primary);
+        border: 1px solid var(--border-medium);
+        border-radius: 8px;
+        padding: 12px;
+        box-shadow: var(--shadow-lg);
+        z-index: 1000;
+        font-size: 12px;
+      `;
+      document.body.appendChild(logContainer);
+    }
+    // Create log entry
+    const logEntry = document.createElement('div');
+    logEntry.className = `log-entry log-${type}`;
+    logEntry.style.cssText = `
+      margin-bottom: 8px;
+      padding: 8px;
+      border-radius: 4px;
+      border-left: 3px solid;
+    `;
+    // Set color based on type
+    switch (type) {
+      case 'success':
+        logEntry.style.borderLeftColor = 'var(--accent-success)';
+        logEntry.style.backgroundColor = 'rgba(16, 185, 129, 0.1)';
+        break;
+      case 'error':
+        logEntry.style.borderLeftColor = 'var(--accent-danger)';
+        logEntry.style.backgroundColor = 'rgba(239, 68, 68, 0.1)';
+        break;
+      case 'warning':
+        logEntry.style.borderLeftColor = 'var(--accent-warning)';
+        logEntry.style.backgroundColor = 'rgba(245, 158, 11, 0.1)';
+        break;
+      default:
+        logEntry.style.borderLeftColor = 'var(--accent-primary)';
+        logEntry.style.backgroundColor = 'rgba(59, 130, 246, 0.1)';
+    }
+    const timestamp = new Date().toLocaleTimeString();
+    logEntry.innerHTML = `
+      <div style="font-weight: 500; margin-bottom: 2px;">${timestamp}</div>
+      <div>${message}</div>
+    `;
+    logContainer.appendChild(logEntry);
+    logContainer.scrollTop = logContainer.scrollHeight;
+    // Auto-remove old entries (keep last 10)
+    const entries = logContainer.querySelectorAll('.log-entry');
+    if (entries.length > 10) {
+      entries[0].remove();
+    }
+    // Auto-hide success messages after 5 seconds
+    if (type === 'success') {
+      setTimeout(() => {
+        if (logEntry.parentNode) {
+          logEntry.style.opacity = '0.5';
+        }
+      }, 5000);
+    }
+  }
 }
 // Main Application
   constructor() {
     this.themeManager = new ThemeManager();
     this.dateManager = new DateManager();
+    this.dateManager.setApp(this); // Pass app reference to date manager
     this.searchManager = new SearchManager();
     this.cardRenderer = new PaperCardRenderer();
     this.init();
   init() {
     this.bindEvents();
+    this.dateManager.showLoading('Loading papers...', 'Initializing application');
     this.loadDaily();
   }
     });
   }
+  async loadDaily(direction = null) {
     const dateStr = this.dateManager.getDateString();
     try {
+      // Build URL with direction parameter if provided
+      let url = `/api/daily?date_str=${encodeURIComponent(dateStr)}`;
+      if (direction) {
+        url += `&direction=${direction}`;
+      }
+      const response = await fetch(url);
       if (!response.ok) {
         throw new Error('Failed to load daily papers');
         requested_date: data.requested_date,
         actual_date: data.date,
         fallback_used: data.fallback_used,
+        cards_count: data.cards?.length,
+        direction: direction
       });
+      // Handle fallback cases - if we got redirected to a different date
       if (data.date && data.requested_date && data.date !== data.requested_date) {
+        console.log('Redirected from', data.requested_date, 'to', data.date);
+        // Update to the actual date that was found
+        const actualDate = new Date(data.date);
+        this.dateManager.currentDate = actualDate;
         this.dateManager.updateDateDisplay();
+        // Show a notification about the redirect
+        this.showRedirectNotification(data.requested_date, data.date);
+      } else if (data.cards && data.cards.length === 0) {
+        // No papers found for the requested date
+        this.showNoPapersNotification(data.requested_date);
       }
       // Show cache status if available
           </a>
         </div>
       `;
+    } finally {
+      // Hide loading animation and update button states
+      this.dateManager.hideLoading();
+      await this.dateManager.updateDateDisplay();
     }
   }
+  // Removed showFallbackNotification - now using unified notification system
+  // Unified notification system
+  showNotification(options) {
+    const {
+      type = 'info', // 'info', 'success', 'warning', 'error'
+      title = '',
+      message = '',
+      duration = 4000,
+      icon = null
+    } = options;
+    // Remove existing notifications
+    const existingNotifications = document.querySelectorAll('.notification');
+    existingNotifications.forEach(notification => {
       if (notification.parentNode) {
         notification.parentNode.removeChild(notification);
       }
+    });
+    // Create notification element
     const notification = document.createElement('div');
+    notification.className = 'notification';
+    // Set icon based on type if not provided
+    let iconClass = icon;
+    if (!iconClass) {
+      switch (type) {
+        case 'success':
+          iconClass = 'fas fa-check-circle';
+          break;
+        case 'warning':
+          iconClass = 'fas fa-exclamation-triangle';
+          break;
+        case 'error':
+          iconClass = 'fas fa-times-circle';
+          break;
+        case 'info':
+        default:
+          iconClass = 'fas fa-info-circle';
+          break;
+      }
+    }
+    // Set colors based on type
+    let borderColor = 'var(--accent-info)';
+    let iconColor = 'var(--accent-info)';
+    switch (type) {
+      case 'success':
+        borderColor = 'var(--accent-success)';
+        iconColor = 'var(--accent-success)';
+        break;
+      case 'warning':
+        borderColor = 'var(--accent-warning)';
+        iconColor = 'var(--accent-warning)';
+        break;
+      case 'error':
+        borderColor = 'var(--accent-danger)';
+        iconColor = 'var(--accent-danger)';
+        break;
+    }
     notification.style.cssText = `
       position: fixed;
       top: 20px;
       right: 20px;
       background: var(--bg-primary);
+      border: 1px solid ${borderColor};
       border-radius: 8px;
       padding: 16px;
       box-shadow: var(--shadow-lg);
       z-index: 1000;
+      max-width: 350px;
       color: var(--text-primary);
+      animation: slideInRight 0.3s ease;
     `;
     notification.innerHTML = `
+      <div style="display: flex; align-items: flex-start; gap: 12px;">
+        <i class="${iconClass}" style="color: ${iconColor}; font-size: 18px; margin-top: 2px; flex-shrink: 0;"></i>
+        <div style="flex: 1; min-width: 0;">
+          ${title ? `<div style="font-weight: 600; margin-bottom: 4px; color: var(--text-primary);">${title}</div>` : ''}
+          ${message ? `<div style="font-size: 14px; color: var(--text-secondary); line-height: 1.4;">${message}</div>` : ''}
+        </div>
       </div>
     `;
+    // Add CSS animation
+    const style = document.createElement('style');
+    style.textContent = `
+      @keyframes slideInRight {
+        from {
+          transform: translateX(100%);
+          opacity: 0;
+        }
+        to {
+          transform: translateX(0);
+          opacity: 1;
+        }
+      }
+    `;
+    document.head.appendChild(style);
     document.body.appendChild(notification);
+    // Remove notification after duration
+    if (duration > 0) {
+      setTimeout(() => {
+        if (notification.parentNode) {
+          notification.style.animation = 'slideInRight 0.3s ease reverse';
+          setTimeout(() => {
+            if (notification.parentNode) {
+              notification.parentNode.removeChild(notification);
+            }
+          }, 300);
+        }
+      }, duration);
+    }
+    return notification;
+  }
+  // Convenience methods for different notification types
+  showDateLimitNotification(message) {
+    this.showNotification({
+      type: 'warning',
+      title: 'Date Limit',
+      message: message,
+      icon: 'fas fa-calendar-times'
+    });
+  }
+  showNoPapersNotification(date) {
+    this.showNotification({
+      type: 'info',
+      title: 'No Papers Found',
+      message: `No papers available for ${date}. Try a different date.`,
+      icon: 'fas fa-search'
+    });
+  }
+  showRedirectNotification(requestedDate, actualDate) {
+    this.showNotification({
+      type: 'info',
+      title: 'Date Redirected',
+      message: `Papers for ${requestedDate} not available. Showing papers for ${actualDate}.`,
+      icon: 'fas fa-arrow-right'
+    });
+  }
+  showCacheNotification(cachedAt) {
+    const cacheTime = new Date(cachedAt).toLocaleTimeString();
+    this.showNotification({
+      type: 'info',
+      title: 'Cached Data',
+      message: `Showing cached data from ${cacheTime}`,
+      icon: 'fas fa-database',
+      duration: 3000
+    });
   }
 }

frontend/paper.html CHANGED Viewed

@@ -81,7 +81,7 @@
       </div>
     </main>
-    <script src="/paper.js"></script>
   </body>
 </html>

       </div>
     </main>
+    <script src="/paper.js?v=2"></script>
   </body>
 </html>

frontend/paper.js CHANGED Viewed

@@ -77,7 +77,7 @@ class PaperEvaluationRenderer {
     }
   }
-  renderMetaGrid(meta) {
     const metaGrid = document.getElementById('metaGrid');
     if (!metaGrid) return;
@@ -85,6 +85,7 @@ class PaperEvaluationRenderer {
       { label: 'Assessed At', value: meta.assessed_at || '-', icon: 'fas fa-calendar' },
       { label: 'Model', value: meta.model || '-', icon: 'fas fa-robot' },
       { label: 'Version', value: meta.version || '-', icon: 'fas fa-tag' },
       { label: 'Paper Path', value: meta.paper_path || '-', icon: 'fas fa-file-pdf', isLink: true }
     ];
@@ -136,7 +137,7 @@ class PaperEvaluationRenderer {
     `;
   }
-  renderContent(json) {
     const contentEl = document.getElementById('content');
     const titleEl = document.getElementById('title');
     if (!contentEl || !titleEl) return;
@@ -144,11 +145,30 @@ class PaperEvaluationRenderer {
     const meta = json.metadata || {};
     const paperId = getParam('id');
-    // Update title with paper ID
-    titleEl.textContent = `Paper Evaluation - ${paperId}`;
-    // Render meta grid
-    this.renderMetaGrid(meta);
     // Executive Summary - styled like Hugging Face abstract
     const execSummary = json.executive_summary ? `
@@ -494,8 +514,8 @@ class PaperEvaluationRenderer {
     }
   }
-  render(json) {
-    this.renderContent(json);
     this.updateRadarChart(json);
   }
 }
@@ -548,7 +568,7 @@ class PaperEvaluationApp {
       }
       console.log('Rendering evaluation...');
-      this.renderer.render(json);
       console.log('Evaluation rendered successfully');
     } catch (error) {

     }
   }
+  renderMetaGrid(meta, paperAuthors = '') {
     const metaGrid = document.getElementById('metaGrid');
     if (!metaGrid) return;
       { label: 'Assessed At', value: meta.assessed_at || '-', icon: 'fas fa-calendar' },
       { label: 'Model', value: meta.model || '-', icon: 'fas fa-robot' },
       { label: 'Version', value: meta.version || '-', icon: 'fas fa-tag' },
+      { label: 'Authors', value: paperAuthors || '-', icon: 'fas fa-users' },
       { label: 'Paper Path', value: meta.paper_path || '-', icon: 'fas fa-file-pdf', isLink: true }
     ];
     `;
   }
+  async renderContent(json) {
     const contentEl = document.getElementById('content');
     const titleEl = document.getElementById('title');
     if (!contentEl || !titleEl) return;
     const meta = json.metadata || {};
     const paperId = getParam('id');
+    // Fetch paper details from database
+    let paperTitle = `Paper Evaluation - ${paperId}`;
+    let paperAuthors = '';
+    let paperAbstract = '';
+    try {
+      const response = await fetch(`/api/paper/${encodeURIComponent(paperId)}`);
+      if (response.ok) {
+        const paperData = await response.json();
+        if (paperData.title) {
+          paperTitle = paperData.title;
+          paperAuthors = paperData.authors || '';
+          paperAbstract = paperData.abstract || '';
+        }
+      }
+    } catch (error) {
+      console.error('Error fetching paper details:', error);
+    }
+    // Update title with actual paper title
+    titleEl.textContent = paperTitle;
+    // Render meta grid with paper info
+    this.renderMetaGrid(meta, paperAuthors);
     // Executive Summary - styled like Hugging Face abstract
     const execSummary = json.executive_summary ? `
     }
   }
+  async render(json) {
+    await this.renderContent(json);
     this.updateRadarChart(json);
   }
 }
       }
       console.log('Rendering evaluation...');
+      await this.renderer.render(json);
       console.log('Evaluation rendered successfully');
     } catch (error) {

frontend/styles.css CHANGED Viewed

@@ -14,6 +14,7 @@
   --accent-success: #10b981;
   --accent-warning: #f59e0b;
   --accent-danger: #ef4444;
   --shadow-sm: 0 1px 2px 0 rgb(0 0 0 / 0.05);
   --shadow-md: 0 4px 6px -1px rgb(0 0 0 / 0.1), 0 2px 4px -2px rgb(0 0 0 / 0.1);
   --shadow-lg: 0 10px 15px -3px rgb(0 0 0 / 0.1), 0 4px 6px -4px rgb(0 0 0 / 0.1);
@@ -35,6 +36,7 @@
   --accent-success: #34d399;
   --accent-warning: #fbbf24;
   --accent-danger: #f87171;
   --shadow-sm: 0 1px 2px 0 rgb(0 0 0 / 0.3);
   --shadow-md: 0 4px 6px -1px rgb(0 0 0 / 0.3), 0 2px 4px -2px rgb(0 0 0 / 0.3);
   --shadow-lg: 0 10px 15px -3px rgb(0 0 0 / 0.3), 0 4px 6px -4px rgb(0 0 0 / 0.3);
@@ -328,6 +330,7 @@ body {
   display: grid;
   grid-template-columns: repeat(auto-fill, minmax(350px, 1fr));
   gap: 24px;
 }
 /* Hugging Face Style Paper Cards */
@@ -340,6 +343,7 @@ body {
   border: 1px solid var(--border-light);
   background-color: var(--bg-primary);
   transition: all 0.2s ease;
 }
 .hf-paper-card:hover {
@@ -398,6 +402,46 @@ body {
   color: var(--text-secondary);
 }
 .submitter-avatar-img {
   width: 10px;
   height: 10px;
@@ -429,6 +473,7 @@ body {
   display: flex;
   padding: 32px 24px 24px 24px;
   gap: 24px;
 }
 /* Upvote Section */
@@ -489,6 +534,9 @@ body {
 /* Paper Info */
 .paper-info {
   width: 100%;
 }
 .paper-title {
@@ -519,6 +567,7 @@ body {
   align-items: center;
   justify-content: space-between;
   gap: 8px;
 }
 .authors-section {
@@ -615,6 +664,7 @@ body {
   padding: 0 24px 24px 24px;
   display: flex;
   gap: 8px;
 }
 .eval-button {
@@ -631,6 +681,8 @@ body {
   text-decoration: none;
   cursor: pointer;
   transition: all 0.2s ease;
 }
 .eval-button:hover {
@@ -639,6 +691,148 @@ body {
   border-color: var(--border-medium);
 }
 .badge {
   display: inline-flex;
   align-items: center;
@@ -652,6 +846,62 @@ body {
   font-weight: 500;
 }
 /* Responsive Design */
 @media (max-width: 1024px) {
   .header-container {
@@ -665,6 +915,7 @@ body {
   .cards-grid {
     grid-template-columns: repeat(auto-fill, minmax(350px, 1fr));
   }
 }
@@ -687,6 +938,7 @@ body {
   .cards-grid {
     grid-template-columns: 1fr;
   }
   .paper-card {
@@ -763,54 +1015,132 @@ body {
 }
 .paper-header {
-  background-color: var(--bg-primary);
   border: 1px solid var(--border-light);
-  border-radius: 16px;
-  padding: 32px;
-  margin-bottom: 24px;
-  box-shadow: var(--shadow-sm);
 }
 .paper-meta h1 {
-  font-size: 28px;
-  font-weight: 800;
-  color: var(--text-primary);
-  margin-bottom: 24px;
 }
 .meta-grid {
   display: grid;
-  grid-template-columns: repeat(auto-fit, minmax(250px, 1fr));
-  gap: 16px;
 }
 .meta-item {
   display: flex;
   flex-direction: column;
-  gap: 4px;
 }
 .meta-label {
-  font-size: 12px;
-  font-weight: 600;
   color: var(--text-muted);
   text-transform: uppercase;
-  letter-spacing: 0.5px;
 }
 .meta-value {
-  font-size: 14px;
   color: var(--text-primary);
-  font-weight: 500;
 }
 .meta-value a {
   color: var(--accent-primary);
   text-decoration: none;
 }
 .meta-value a:hover {
-  text-decoration: underline;
 }
 .content-layout {
@@ -1372,4 +1702,55 @@ body {
   }
 }

   --accent-success: #10b981;
   --accent-warning: #f59e0b;
   --accent-danger: #ef4444;
+  --accent-info: #3b82f6;
   --shadow-sm: 0 1px 2px 0 rgb(0 0 0 / 0.05);
   --shadow-md: 0 4px 6px -1px rgb(0 0 0 / 0.1), 0 2px 4px -2px rgb(0 0 0 / 0.1);
   --shadow-lg: 0 10px 15px -3px rgb(0 0 0 / 0.1), 0 4px 6px -4px rgb(0 0 0 / 0.1);
   --accent-success: #34d399;
   --accent-warning: #fbbf24;
   --accent-danger: #f87171;
+  --accent-info: #60a5fa;
   --shadow-sm: 0 1px 2px 0 rgb(0 0 0 / 0.3);
   --shadow-md: 0 4px 6px -1px rgb(0 0 0 / 0.3), 0 2px 4px -2px rgb(0 0 0 / 0.3);
   --shadow-lg: 0 10px 15px -3px rgb(0 0 0 / 0.3), 0 4px 6px -4px rgb(0 0 0 / 0.3);
   display: grid;
   grid-template-columns: repeat(auto-fill, minmax(350px, 1fr));
   gap: 24px;
+  align-items: stretch;
 }
 /* Hugging Face Style Paper Cards */
   border: 1px solid var(--border-light);
   background-color: var(--bg-primary);
   transition: all 0.2s ease;
+  height: 100%;
 }
 .hf-paper-card:hover {
   color: var(--text-secondary);
 }
+/* Score badge */
+.score-badge {
+  position: absolute;
+  right: 16px;
+  bottom: 16px;
+  display: flex;
+  flex-direction: column;
+  align-items: center;
+  justify-content: center;
+  border-radius: 12px;
+  background: linear-gradient(135deg, var(--accent-primary), var(--accent-secondary));
+  color: white;
+  padding: 8px 12px;
+  min-width: 60px;
+  box-shadow: var(--shadow-lg);
+  z-index: 20;
+  transition: all 0.2s ease;
+  cursor: pointer;
+}
+.score-badge:hover {
+  transform: translateY(-2px);
+  box-shadow: var(--shadow-xl);
+}
+.score-badge .score-number {
+  font-size: 20px;
+  font-weight: 800;
+  line-height: 1;
+  margin-bottom: 2px;
+}
+.score-badge .score-label {
+  font-size: 10px;
+  font-weight: 600;
+  opacity: 0.9;
+  text-transform: uppercase;
+  letter-spacing: 0.5px;
+}
 .submitter-avatar-img {
   width: 10px;
   height: 10px;
   display: flex;
   padding: 32px 24px 24px 24px;
   gap: 24px;
+  flex: 1;
 }
 /* Upvote Section */
 /* Paper Info */
 .paper-info {
   width: 100%;
+  display: flex;
+  flex-direction: column;
+  flex: 1;
 }
 .paper-title {
   align-items: center;
   justify-content: space-between;
   gap: 8px;
+  margin-top: auto;
 }
 .authors-section {
   padding: 0 24px 24px 24px;
   display: flex;
   gap: 8px;
+  margin-top: auto;
 }
 .eval-button {
   text-decoration: none;
   cursor: pointer;
   transition: all 0.2s ease;
+  min-width: 100px;
+  justify-content: center;
 }
 .eval-button:hover {
   border-color: var(--border-medium);
 }
+.eval-button:disabled {
+  opacity: 0.6;
+  cursor: not-allowed;
+}
+/* Evaluate state (green) */
+.eval-button.evaluate-state {
+  color: var(--accent-success);
+  border-color: var(--accent-success);
+}
+.eval-button.evaluate-state:hover {
+  background-color: var(--accent-success);
+  color: white;
+}
+/* Evaluation state (blue) */
+.eval-button.evaluation-state {
+  color: var(--accent-primary);
+  border-color: var(--accent-primary);
+}
+.eval-button.evaluation-state:hover {
+  background-color: var(--accent-primary);
+  color: white;
+}
+/* Evaluating state (orange) */
+.eval-button.evaluating-state {
+  color: var(--accent-warning);
+  border-color: var(--accent-warning);
+  position: relative;
+}
+.eval-button.evaluating-state::after {
+  content: '';
+  position: absolute;
+  top: 50%;
+  right: 8px;
+  width: 12px;
+  height: 12px;
+  border: 2px solid transparent;
+  border-top: 2px solid var(--accent-warning);
+  border-radius: 50%;
+  transform: translateY(-50%);
+  animation: spin 1s linear infinite;
+}
+@keyframes spin {
+  0% { transform: translateY(-50%) rotate(0deg); }
+  100% { transform: translateY(-50%) rotate(360deg); }
+}
+/* Started state (green) */
+.eval-button.started-state {
+  color: var(--accent-success);
+  border-color: var(--accent-success);
+  position: relative;
+}
+.eval-button.started-state::after {
+  content: '';
+  position: absolute;
+  top: 50%;
+  right: 8px;
+  width: 8px;
+  height: 8px;
+  background-color: var(--accent-success);
+  border-radius: 50%;
+  transform: translateY(-50%);
+  animation: pulse 1.5s ease-in-out infinite;
+}
+@keyframes pulse {
+  0% {
+    opacity: 1;
+    transform: translateY(-50%) scale(1);
+  }
+  50% {
+    opacity: 0.5;
+    transform: translateY(-50%) scale(1.2);
+  }
+  100% {
+    opacity: 1;
+    transform: translateY(-50%) scale(1);
+  }
+}
+/* Processing state (blue) */
+.eval-button.processing-state {
+  color: var(--accent-primary);
+  border-color: var(--accent-primary);
+  position: relative;
+}
+.eval-button.processing-state::after {
+  content: '';
+  position: absolute;
+  top: 50%;
+  right: 8px;
+  width: 10px;
+  height: 10px;
+  background-color: var(--accent-primary);
+  border-radius: 50%;
+  transform: translateY(-50%);
+  animation: bounce 1s ease-in-out infinite;
+}
+@keyframes bounce {
+  0%, 20%, 50%, 80%, 100% {
+    transform: translateY(-50%) scale(1);
+  }
+  40% {
+    transform: translateY(-50%) scale(1.1);
+  }
+  60% {
+    transform: translateY(-50%) scale(0.9);
+  }
+}
+/* Error state (red) */
+.eval-button.error-state {
+  color: var(--accent-danger);
+  border-color: var(--accent-danger);
+}
+/* Timeout state (gray) */
+.eval-button.timeout-state {
+  color: var(--text-muted);
+  border-color: var(--text-muted);
+}
+/* Spinner animation */
+.eval-button .fa-spinner {
+  animation: spin 1s linear infinite;
+}
+@keyframes spin {
+  from { transform: rotate(0deg); }
+  to { transform: rotate(360deg); }
+}
 .badge {
   display: inline-flex;
   align-items: center;
   font-weight: 500;
 }
+/* Loading Animation */
+.loading-overlay {
+  position: fixed;
+  top: 0;
+  left: 0;
+  width: 100%;
+  height: 100%;
+  background-color: rgba(255, 255, 255, 0.8);
+  backdrop-filter: blur(4px);
+  display: flex;
+  align-items: center;
+  justify-content: center;
+  z-index: 9999;
+  opacity: 0;
+  visibility: hidden;
+  transition: all 0.3s ease;
+}
+[data-theme="dark"] .loading-overlay {
+  background-color: rgba(15, 23, 42, 0.8);
+}
+.loading-overlay.show {
+  opacity: 1;
+  visibility: visible;
+}
+.loading-spinner {
+  display: flex;
+  flex-direction: column;
+  align-items: center;
+  gap: 16px;
+}
+.spinner {
+  width: 48px;
+  height: 48px;
+  border: 4px solid var(--border-light);
+  border-top: 4px solid var(--accent-primary);
+  border-radius: 50%;
+  animation: spin 1s linear infinite;
+}
+.loading-text {
+  font-size: 16px;
+  font-weight: 500;
+  color: var(--text-primary);
+  text-align: center;
+}
+.loading-subtext {
+  font-size: 14px;
+  color: var(--text-secondary);
+  text-align: center;
+}
 /* Responsive Design */
 @media (max-width: 1024px) {
   .header-container {
   .cards-grid {
     grid-template-columns: repeat(auto-fill, minmax(350px, 1fr));
+    align-items: stretch;
   }
 }
   .cards-grid {
     grid-template-columns: 1fr;
+    align-items: stretch;
   }
   .paper-card {
 }
 .paper-header {
+  background: linear-gradient(135deg,
+    var(--bg-primary) 0%,
+    var(--bg-secondary) 50%,
+    var(--bg-tertiary) 100%);
   border: 1px solid var(--border-light);
+  border-radius: 20px;
+  padding: 40px;
+  margin-bottom: 32px;
+  box-shadow: var(--shadow-lg);
+  position: relative;
+  overflow: hidden;
+}
+.paper-header::before {
+  content: '';
+  position: absolute;
+  top: 0;
+  left: 0;
+  right: 0;
+  height: 4px;
+  background: linear-gradient(90deg,
+    var(--accent-primary) 0%,
+    var(--accent-secondary) 50%,
+    var(--accent-success) 100%);
+}
+.paper-header::after {
+  content: '';
+  position: absolute;
+  top: -50%;
+  right: -50%;
+  width: 200%;
+  height: 200%;
+  background: radial-gradient(circle,
+    rgba(59, 130, 246, 0.03) 0%,
+    rgba(6, 182, 212, 0.02) 50%,
+    transparent 100%);
+  pointer-events: none;
 }
 .paper-meta h1 {
+  font-size: 32px;
+  font-weight: 900;
+  background: linear-gradient(135deg,
+    var(--text-primary) 0%,
+    var(--accent-primary) 50%,
+    var(--accent-secondary) 100%);
+  -webkit-background-clip: text;
+  -webkit-text-fill-color: transparent;
+  background-clip: text;
+  margin-bottom: 32px;
+  position: relative;
+  z-index: 1;
+  letter-spacing: -0.5px;
 }
 .meta-grid {
   display: grid;
+  grid-template-columns: repeat(auto-fit, minmax(280px, 1fr));
+  gap: 20px;
+  position: relative;
+  z-index: 1;
 }
 .meta-item {
+  background: rgba(255, 255, 255, 0.7);
+  backdrop-filter: blur(10px);
+  border: 1px solid rgba(255, 255, 255, 0.2);
+  border-radius: 12px;
+  padding: 16px;
   display: flex;
   flex-direction: column;
+  gap: 8px;
+  transition: all 0.3s ease;
+  box-shadow: var(--shadow-sm);
+}
+[data-theme="dark"] .meta-item {
+  background: rgba(30, 41, 59, 0.7);
+  border: 1px solid rgba(255, 255, 255, 0.1);
+}
+.meta-item:hover {
+  transform: translateY(-2px);
+  box-shadow: var(--shadow-md);
+  border-color: var(--accent-primary);
 }
 .meta-label {
+  font-size: 11px;
+  font-weight: 700;
   color: var(--text-muted);
   text-transform: uppercase;
+  letter-spacing: 0.8px;
+  display: flex;
+  align-items: center;
+  gap: 6px;
+}
+.meta-label i {
+  color: var(--accent-primary);
+  font-size: 12px;
 }
 .meta-value {
+  font-size: 15px;
   color: var(--text-primary);
+  font-weight: 600;
+  line-height: 1.4;
 }
 .meta-value a {
   color: var(--accent-primary);
   text-decoration: none;
+  font-weight: 600;
+  transition: all 0.3s ease;
+  padding: 4px 8px;
+  border-radius: 6px;
+  background: rgba(59, 130, 246, 0.1);
+  display: inline-block;
 }
 .meta-value a:hover {
+  background: rgba(59, 130, 246, 0.2);
+  transform: translateY(-1px);
+  box-shadow: var(--shadow-sm);
 }
 .content-layout {
   }
 }
+/* Responsive Design for Paper Header */
+@media (max-width: 768px) {
+  .paper-header {
+    padding: 24px;
+    margin-bottom: 24px;
+    border-radius: 16px;
+  }
+  .paper-meta h1 {
+    font-size: 24px;
+    margin-bottom: 24px;
+  }
+  .meta-grid {
+    grid-template-columns: 1fr;
+    gap: 16px;
+  }
+  .meta-item {
+    padding: 12px;
+  }
+  .meta-value {
+    font-size: 14px;
+  }
+}
+@media (max-width: 480px) {
+  .paper-header {
+    padding: 20px;
+    margin-bottom: 20px;
+  }
+  .paper-meta h1 {
+    font-size: 20px;
+    margin-bottom: 20px;
+  }
+  .meta-item {
+    padding: 10px;
+  }
+  .meta-label {
+    font-size: 10px;
+  }
+  .meta-value {
+    font-size: 13px;
+  }
+}

server.py DELETED Viewed

@@ -1,731 +0,0 @@
-import os
-import re
-import glob
-import json
-import sqlite3
-from datetime import date, datetime, timedelta
-from typing import Any, Dict, List, Optional
-from contextlib import contextmanager
-from fastapi import FastAPI, HTTPException
-from fastapi.middleware.cors import CORSMiddleware
-from fastapi.responses import FileResponse
-from fastapi.staticfiles import StaticFiles
-from dotenv import load_dotenv
-import httpx
-from bs4 import BeautifulSoup
-# Load environment variables
-load_dotenv()
-# Get API key from HF Spaces secrets
-def get_anthropic_api_key() -> Optional[str]:
-    """Get Anthropic API key, prioritize HF Spaces secrets"""
-    # First try to get from HF Spaces secrets
-    hf_secret = os.getenv("HF_SECRET_ANTHROPIC_API_KEY")
-    if hf_secret:
-        return hf_secret
-    # Then try to get from environment variables
-    env_key = os.getenv("ANTHROPIC_API_KEY")
-    if env_key:
-        return env_key
-    return None
-def get_project_root() -> str:
-    return os.path.dirname(os.path.abspath(__file__))
-PROJECT_ROOT = get_project_root()
-DEFAULT_WORKDIR = os.getenv("WORKDIR", os.path.join(PROJECT_ROOT, "workdir"))
-DB_PATH = os.path.join(PROJECT_ROOT, "papers_cache.db")
-# Database management
-class PapersDatabase:
-    def __init__(self, db_path: str):
-        self.db_path = db_path
-        self.init_database()
-    def init_database(self):
-        """Initialize the database with required tables"""
-        with self.get_connection() as conn:
-            cursor = conn.cursor()
-            # Create papers cache table
-            cursor.execute('''
-                CREATE TABLE IF NOT EXISTS papers_cache (
-                    date_str TEXT PRIMARY KEY,
-                    html_content TEXT NOT NULL,
-                    parsed_cards TEXT NOT NULL,
-                    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
-                    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
-                )
-            ''')
-            # Create latest_date table to track the most recent available date
-            cursor.execute('''
-                CREATE TABLE IF NOT EXISTS latest_date (
-                    id INTEGER PRIMARY KEY CHECK (id = 1),
-                    date_str TEXT NOT NULL,
-                    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
-                )
-            ''')
-            # Insert default latest_date record if it doesn't exist
-            cursor.execute('''
-                INSERT OR IGNORE INTO latest_date (id, date_str)
-                VALUES (1, ?)
-            ''', (date.today().isoformat(),))
-            conn.commit()
-    @contextmanager
-    def get_connection(self):
-        """Context manager for database connections"""
-        conn = sqlite3.connect(self.db_path)
-        conn.row_factory = sqlite3.Row  # Enable dict-like access
-        try:
-            yield conn
-        finally:
-            conn.close()
-    def get_cached_papers(self, date_str: str) -> Optional[Dict[str, Any]]:
-        """Get cached papers for a specific date"""
-        with self.get_connection() as conn:
-            cursor = conn.cursor()
-            cursor.execute('''
-                SELECT parsed_cards, created_at
-                FROM papers_cache
-                WHERE date_str = ?
-            ''', (date_str,))
-            row = cursor.fetchone()
-            if row:
-                return {
-                    'cards': json.loads(row['parsed_cards']),
-                    'cached_at': row['created_at']
-                }
-            return None
-    def cache_papers(self, date_str: str, html_content: str, parsed_cards: List[Dict[str, Any]]):
-        """Cache papers for a specific date"""
-        with self.get_connection() as conn:
-            cursor = conn.cursor()
-            cursor.execute('''
-                INSERT OR REPLACE INTO papers_cache
-                (date_str, html_content, parsed_cards, updated_at)
-                VALUES (?, ?, ?, CURRENT_TIMESTAMP)
-            ''', (date_str, html_content, json.dumps(parsed_cards)))
-            conn.commit()
-    def get_latest_cached_date(self) -> Optional[str]:
-        """Get the latest cached date"""
-        with self.get_connection() as conn:
-            cursor = conn.cursor()
-            cursor.execute('SELECT date_str FROM latest_date WHERE id = 1')
-            row = cursor.fetchone()
-            return row['date_str'] if row else None
-    def update_latest_date(self, date_str: str):
-        """Update the latest available date"""
-        with self.get_connection() as conn:
-            cursor = conn.cursor()
-            cursor.execute('''
-                UPDATE latest_date
-                SET date_str = ?, updated_at = CURRENT_TIMESTAMP
-                WHERE id = 1
-            ''', (date_str,))
-            conn.commit()
-    def is_cache_fresh(self, date_str: str, max_age_hours: int = 24) -> bool:
-        """Check if cache is fresh (within max_age_hours)"""
-        with self.get_connection() as conn:
-            cursor = conn.cursor()
-            cursor.execute('''
-                SELECT updated_at
-                FROM papers_cache
-                WHERE date_str = ?
-            ''', (date_str,))
-            row = cursor.fetchone()
-            if not row:
-                return False
-            cached_time = datetime.fromisoformat(row['updated_at'].replace('Z', '+00:00'))
-            age = datetime.now(cached_time.tzinfo) - cached_time
-            return age.total_seconds() < max_age_hours * 3600
-    def cleanup_old_cache(self, days_to_keep: int = 7):
-        """Clean up old cache entries"""
-        cutoff_date = (datetime.now() - timedelta(days=days_to_keep)).isoformat()
-        with self.get_connection() as conn:
-            cursor = conn.cursor()
-            cursor.execute('''
-                DELETE FROM papers_cache
-                WHERE updated_at < ?
-            ''', (cutoff_date,))
-            conn.commit()
-# Initialize database
-db = PapersDatabase(DB_PATH)
-app = FastAPI(title="PaperIndex Web")
-# Local development: allow same-origin and localhost
-app.add_middleware(
-    CORSMiddleware,
-    allow_origins=["*"],
-    allow_credentials=True,
-    allow_methods=["*"],
-    allow_headers=["*"],
-)
-# --- Utility functions ---
-def ensure_workdir() -> str:
-    os.makedirs(DEFAULT_WORKDIR, exist_ok=True)
-    return DEFAULT_WORKDIR
-def extract_arxiv_id(url: str) -> Optional[str]:
-    if not url:
-        return None
-    # Matches /abs/2508.05629, /pdf/2508.05629.pdf
-    m = re.search(r"arxiv\.org/(abs|pdf)/([0-9]{4}\.\d{4,5})(?:\.pdf)?", url)
-    if m:
-        return m.group(2)
-    return None
-def extract_json_data(html: str) -> Dict[str, Any]:
-    """Extract JSON data from the HTML page to get GitHub stars and other metadata."""
-    try:
-        soup = BeautifulSoup(html, "lxml")
-        # Look for GitHub stars in the HTML structure
-        # Based on the user's description, GitHub stars are displayed with SVG icons
-        # Look for SVG elements that might represent GitHub stars
-        svg_elements = soup.find_all("svg")
-        github_stars_map = {}
-        for svg in svg_elements:
-            # Look for GitHub-related SVG (usually has specific viewBox or path)
-            svg_html = str(svg)
-            if "github" in svg_html.lower() or "256 250" in svg_html:  # GitHub icon viewBox
-                # Look for the star count near this SVG
-                parent = svg.parent
-                if parent:
-                    # Look for numbers that might be star counts
-                    text_content = parent.get_text()
-                    numbers = re.findall(r'\b(\d+)\b', text_content)
-                    if numbers:
-                        # The number near a GitHub SVG is likely the star count
-                        star_count = int(numbers[0])
-                        # Try to find the paper title or ID to associate with
-                        # Look for the closest article or card container
-                        article = svg.find_parent("article")
-                        if article:
-                            title_elem = article.find("h3")
-                            if title_elem:
-                                paper_title = title_elem.get_text(strip=True)
-                                github_stars_map[paper_title] = star_count
-        # Also look for any elements with GitHub-related text
-        github_text_elements = soup.find_all(string=lambda text: text and "github" in text.lower())
-        for text_elem in github_text_elements:
-            parent = text_elem.parent
-            if parent:
-                text_content = parent.get_text()
-                numbers = re.findall(r'\b(\d+)\b', text_content)
-                if numbers:
-                    star_count = int(numbers[0])
-                    # Try to find the paper title
-                    article = parent.find_parent("article")
-                    if article:
-                        title_elem = article.find("h3")
-                        if title_elem:
-                            paper_title = title_elem.get_text(strip=True)
-                            if paper_title not in github_stars_map:
-                                github_stars_map[paper_title] = star_count
-        return {"github_stars_map": github_stars_map}
-    except Exception as e:
-        print(f"Error extracting JSON data: {e}")
-    return {"github_stars_map": {}}
-def find_eval_file_for_id(arxiv_id: str) -> Optional[str]:
-    workdir = ensure_workdir()
-    # Look for any json containing the id substring
-    candidates = glob.glob(os.path.join(workdir, f"**/*{arxiv_id}*.json"), recursive=True)
-    return candidates[0] if candidates else None
-async def fetch_daily_html(target_date: str) -> tuple[str, str]:
-    """Fetch daily papers HTML, with fallback to find the latest available date"""
-    async with httpx.AsyncClient(timeout=20, follow_redirects=False) as client:
-        # First try the requested date
-        url = f"https://huggingface.co/papers/date/{target_date}"
-        try:
-            r = await client.get(url)
-            # Check if we got redirected
-            if r.status_code in [301, 302, 303, 307, 308]:
-                # We got redirected, extract the actual date from the redirect location
-                location = r.headers.get('location', '')
-                print(f"Got redirect to: {location}")
-                # Extract date from redirect URL (e.g., /papers/date/2025-08-08)
-                import re
-                date_match = re.search(r'/papers/date/(\d{4}-\d{2}-\d{2})', location)
-                if date_match:
-                    actual_date = date_match.group(1)
-                    print(f"Redirected from {target_date} to {actual_date}")
-                    # Fetch the actual page
-                    actual_url = f"https://huggingface.co{location}"
-                    r = await client.get(actual_url)
-                    if r.status_code == 200:
-                        return actual_date, r.text
-                    else:
-                        raise Exception(f"Failed to fetch redirected page: {r.status_code}")
-                else:
-                    # Couldn't extract date from redirect, use fallback
-                    raise Exception("Could not extract date from redirect")
-            elif r.status_code == 200:
-                # Direct success, check if the page actually contains the requested date
-                if target_date in r.text or "Daily Papers" in r.text:
-                    return target_date, r.text
-                else:
-                    # Page exists but doesn't contain expected content
-                    raise Exception("Page exists but doesn't contain expected content")
-            else:
-                # Other error status
-                raise Exception(f"Status code {r.status_code}")
-        except Exception as e:
-            print(f"Failed to fetch {target_date}: {e}")
-            # If the requested date fails, try to find the latest available date
-            actual_date, html = await find_latest_available_date(client)
-            return actual_date, html
-async def find_latest_available_date(client: httpx.AsyncClient) -> tuple[str, str]:
-    """Find the latest available date by checking recent dates"""
-    # Start from today and go backwards up to 30 days
-    today = datetime.now()
-    for i in range(30):
-        check_date = today - timedelta(days=i)
-        date_str = check_date.strftime("%Y-%m-%d")
-        url = f"https://huggingface.co/papers/date/{date_str}"
-        try:
-            r = await client.get(url)
-            if r.status_code == 200:
-                # Check if the page actually has content (not just a 404 or empty page)
-                if "Daily Papers" in r.text and len(r.text) > 1000:
-                    print(f"Found latest available date: {date_str}")
-                    return date_str, r.text
-        except Exception:
-            continue
-    # If no date found, return a default page or raise an error
-    raise Exception("No available daily papers found in the last 30 days")
-def parse_daily_cards(html: str) -> List[Dict[str, Any]]:
-    soup = BeautifulSoup(html, "lxml")
-    # First, extract JSON data from the page to get GitHub stars
-    json_data = extract_json_data(html)
-    # Find all article elements that contain paper cards
-    cards: List[Dict[str, Any]] = []
-    # Look for article elements with the specific class structure from Hugging Face
-    for article in soup.select("article.relative.flex.flex-col.overflow-hidden.rounded-xl.border"):
-        try:
-            card_data = {}
-            # Extract title and link
-            title_link = article.select_one("h3 a")
-            if title_link:
-                card_data["title"] = title_link.get_text(strip=True)
-                href = title_link.get("href")
-                if href:
-                    if href.startswith("http"):
-                        card_data["huggingface_url"] = href
-                    else:
-                        card_data["huggingface_url"] = f"https://huggingface.co{href}"
-            # Extract upvote count
-            upvote_div = article.select_one("div.shadow-alternate div.leading-none")
-            if upvote_div:
-                upvote_text = upvote_div.get_text(strip=True)
-                try:
-                    card_data["upvotes"] = int(upvote_text)
-                except ValueError:
-                    card_data["upvotes"] = 0
-            # Extract author count - look for the author count text
-            author_count_div = article.select_one("div.flex.truncate.text-sm")
-            if author_count_div:
-                author_text = author_count_div.get_text(strip=True)
-                # Extract number from "· 10 authors"
-                author_match = re.search(r'(\d+)\s*authors?', author_text)
-                if author_match:
-                    card_data["author_count"] = int(author_match.group(1))
-                else:
-                    card_data["author_count"] = 0
-            # Extract GitHub stars from JSON data in the page
-            # This will be handled later when we parse the JSON data
-            card_data["github_stars"] = 0  # Default value
-            # Extract comments count - look for comment icon and number
-            comment_links = article.select("a[href*='#community']")
-            for comment_link in comment_links:
-                comment_text = comment_link.get_text(strip=True)
-                try:
-                    card_data["comments"] = int(comment_text)
-                    break
-                except ValueError:
-                    continue
-            # Extract submitter information
-            submitted_div = article.select_one("div.shadow-xs")
-            if submitted_div:
-                submitter_text = submitted_div.get_text(strip=True)
-                # Extract submitter name from "Submitted byLiang0223" (no space)
-                submitter_match = re.search(r'Submitted by(\S+)', submitter_text)
-                if submitter_match:
-                    card_data["submitter"] = submitter_match.group(1)
-            # Extract arXiv ID from the URL
-            if card_data.get("huggingface_url"):
-                arxiv_id = extract_arxiv_id(card_data["huggingface_url"])
-                if arxiv_id:
-                    card_data["arxiv_id"] = arxiv_id
-            # Try to get GitHub stars from the extracted data
-            # Look for GitHub stars by matching paper title
-            paper_title = card_data.get("title", "")
-            if paper_title in json_data.get("github_stars_map", {}):
-                card_data["github_stars"] = json_data["github_stars_map"][paper_title]
-            # Only add cards that have at least a title
-            if card_data.get("title"):
-                cards.append(card_data)
-        except Exception as e:
-            print(f"Error parsing card: {e}")
-            continue
-    # If the above method didn't work, fall back to the old method
-    if not cards:
-        print("Falling back to old parsing method")
-        for h3 in soup.select("h3"):
-            # Title and Hugging Face paper link (if present)
-            a = h3.find("a")
-            title = h3.get_text(strip=True)
-            hf_link = None
-            if a and a.get("href"):
-                href = a.get("href")
-                # Absolute URL to huggingface
-                if href.startswith("http"):
-                    hf_link = href
-                else:
-                    hf_link = f"https://huggingface.co{href}"
-            # Try to capture sibling info (authors, votes, etc.) as a small snippet
-            meta_text = None
-            parent = h3.parent
-            if parent:
-                # Join immediate text content following h3
-                collected: List[str] = []
-                for sib in parent.find_all(text=True, recursive=False):
-                    t = (sib or "").strip()
-                    if t:
-                        collected.append(t)
-                if collected:
-                    meta_text = " ".join(collected)
-            # Try to discover any arXiv link inside nearby anchors
-            arxiv_id: Optional[str] = None
-            container = parent if parent else h3
-            for link in container.find_all("a", href=True):
-                possible = extract_arxiv_id(link["href"])
-                if possible:
-                    arxiv_id = possible
-                    break
-            cards.append(
-                {
-                    "title": title,
-                    "huggingface_url": hf_link,
-                    "meta": meta_text,
-                    "arxiv_id": arxiv_id,
-                }
-            )
-    # Deduplicate by title
-    seen = set()
-    unique_cards: List[Dict[str, Any]] = []
-    for c in cards:
-        key = c.get("title") or ""
-        if key and key not in seen:
-            seen.add(key)
-            unique_cards.append(c)
-    print(f"Parsed {len(unique_cards)} cards")
-    return unique_cards
-# --- API Routes ---
-@app.get("/api/daily")
-async def get_daily(date_str: Optional[str] = None) -> Dict[str, Any]:
-    target_date = date_str or date.today().isoformat()
-    # First, check if we have fresh cache for the requested date
-    cached_data = db.get_cached_papers(target_date)
-    if cached_data and db.is_cache_fresh(target_date):
-        print(f"Using cached data for {target_date}")
-        return {
-            "date": target_date,
-            "requested_date": target_date,
-            "cards": cached_data['cards'],
-            "fallback_used": False,
-            "cached": True,
-            "cached_at": cached_data['cached_at']
-        }
-    # If no cache or stale cache, try to fetch fresh data
-    try:
-        actual_date, html = await fetch_daily_html(target_date)
-        print(f"Fetched fresh data for {actual_date} (requested {target_date})")
-        # Check if we got redirected to a different date, and if that date has fresh cache
-        if actual_date != target_date:
-            cached_data = db.get_cached_papers(actual_date)
-            if cached_data and db.is_cache_fresh(actual_date):
-                print(f"Using cached data for redirected date {actual_date}")
-                return {
-                    "date": actual_date,
-                    "requested_date": target_date,
-                    "cards": cached_data['cards'],
-                    "fallback_used": True,
-                    "cached": True,
-                    "cached_at": cached_data['cached_at']
-                }
-    except Exception as e:
-        print(f"Failed to fetch {target_date}, trying to find latest available date")
-        # If the requested date fails, try to find the latest available date
-        async with httpx.AsyncClient(timeout=20) as client:
-            try:
-                actual_date, html = await find_latest_available_date(client)
-                print(f"Using fallback date: {actual_date}")
-                # Check if the fallback date has fresh cache
-                cached_data = db.get_cached_papers(actual_date)
-                if cached_data and db.is_cache_fresh(actual_date):
-                    print(f"Using cached data for fallback date {actual_date}")
-                    return {
-                        "date": actual_date,
-                        "requested_date": target_date,
-                        "cards": cached_data['cards'],
-                        "fallback_used": True,
-                        "cached": True,
-                        "cached_at": cached_data['cached_at']
-                    }
-            except Exception as fallback_error:
-                print(f"Fallback also failed: {fallback_error}")
-                # If everything fails, return cached data if available
-                cached_data = db.get_cached_papers(target_date)
-                if cached_data:
-                    return {
-                        "date": target_date,
-                        "requested_date": target_date,
-                        "cards": cached_data['cards'],
-                        "fallback_used": False,
-                        "cached": True,
-                        "cached_at": cached_data['cached_at']
-                    }
-                # If no cache available, return error
-                raise HTTPException(status_code=503, detail="Unable to fetch papers and no cache available")
-    # Parse the HTML and process cards
-    cards = parse_daily_cards(html)
-    # Attempt to resolve missing arXiv ids by scraping the HF paper page
-    async with httpx.AsyncClient(timeout=15) as client:
-        async def resolve_card(card: Dict[str, Any]) -> None:
-            if card.get("arxiv_id") or not card.get("huggingface_url"):
-                return
-            try:
-                r = await client.get(card["huggingface_url"])
-                if r.status_code == 200:
-                    soup = BeautifulSoup(r.text, "lxml")
-                    for a in soup.select("a[href]"):
-                        aid = extract_arxiv_id(a.get("href") or "")
-                        if aid:
-                            card["arxiv_id"] = aid
-                            break
-            except Exception:
-                pass
-        # Resolve sequentially for compatibility/simplicity
-        for c in cards:
-            await resolve_card(c)
-    # Fallback HF link to the daily page when missing
-    for c in cards:
-        if not c.get("huggingface_url"):
-            c["huggingface_url"] = f"https://huggingface.co/papers/date/{actual_date}"
-    # Attach has_eval flag
-    for c in cards:
-        arxiv_id = c.get("arxiv_id")
-        c["has_eval"] = bool(arxiv_id and find_eval_file_for_id(arxiv_id))
-    # Cache the results
-    db.cache_papers(actual_date, html, cards)
-    db.update_latest_date(actual_date)
-    # Clean up old cache entries (run occasionally)
-    if datetime.now().hour == 2:  # Run cleanup at 2 AM
-        db.cleanup_old_cache()
-    return {
-        "date": actual_date,
-        "requested_date": target_date,
-        "cards": cards,
-        "fallback_used": actual_date != target_date,
-        "cached": False
-    }
-@app.get("/api/evals")
-def list_evals() -> Dict[str, Any]:
-    workdir = ensure_workdir()
-    files = sorted(glob.glob(os.path.join(workdir, "**/*.json"), recursive=True))
-    items: List[Dict[str, Any]] = []
-    for f in files:
-        base = os.path.basename(f)
-        # Extract arxiv-like id from filename if present
-        m = re.search(r"([0-9]{4}\.\d{4,5})", base)
-        arxiv_id = m.group(1) if m else None
-        items.append({"file": f, "name": base, "arxiv_id": arxiv_id})
-    return {"count": len(items), "items": items}
-@app.get("/api/has-eval/{paper_id}")
-def has_eval(paper_id: str) -> Dict[str, bool]:
-    exists = find_eval_file_for_id(paper_id) is not None
-    return {"exists": exists}
-@app.get("/api/eval/{paper_id}")
-def get_eval(paper_id: str) -> Any:
-    path = find_eval_file_for_id(paper_id)
-    if not path:
-        raise HTTPException(status_code=404, detail="Evaluation not found")
-    return FileResponse(path, media_type="application/json")
-@app.get("/api/cache/status")
-def get_cache_status() -> Dict[str, Any]:
-    """Get cache status and statistics"""
-    with db.get_connection() as conn:
-        cursor = conn.cursor()
-        # Get total cached dates
-        cursor.execute('SELECT COUNT(*) as count FROM papers_cache')
-        total_cached = cursor.fetchone()['count']
-        # Get latest cached date
-        cursor.execute('SELECT date_str, updated_at FROM latest_date WHERE id = 1')
-        latest_info = cursor.fetchone()
-        # Get cache age distribution
-        cursor.execute('''
-            SELECT
-                CASE
-                    WHEN updated_at > datetime('now', '-1 hour') THEN '1 hour'
-                    WHEN updated_at > datetime('now', '-24 hours') THEN '24 hours'
-                    WHEN updated_at > datetime('now', '-7 days') THEN '7 days'
-                    ELSE 'older'
-                END as age_group,
-                COUNT(*) as count
-            FROM papers_cache
-            GROUP BY age_group
-        ''')
-        age_distribution = {row['age_group']: row['count'] for row in cursor.fetchall()}
-        return {
-            "total_cached_dates": total_cached,
-            "latest_cached_date": latest_info['date_str'] if latest_info else None,
-            "latest_updated": latest_info['updated_at'] if latest_info else None,
-            "age_distribution": age_distribution
-        }
-@app.post("/api/cache/clear")
-def clear_cache() -> Dict[str, str]:
-    """Clear all cached data"""
-    with db.get_connection() as conn:
-        cursor = conn.cursor()
-        cursor.execute('DELETE FROM papers_cache')
-        conn.commit()
-    return {"message": "Cache cleared successfully"}
-@app.post("/api/cache/refresh/{date_str}")
-async def refresh_cache(date_str: str) -> Dict[str, Any]:
-    """Force refresh cache for a specific date"""
-    try:
-        # Force fetch fresh data
-        html = await fetch_daily_html(date_str)
-        cards = parse_daily_cards(html)
-        # Cache the results
-        db.cache_papers(date_str, html, cards)
-        return {
-            "message": f"Cache refreshed for {date_str}",
-            "cards_count": len(cards)
-        }
-    except Exception as e:
-        raise HTTPException(status_code=500, detail=f"Failed to refresh cache: {str(e)}")
-@app.get("/styles.css")
-async def get_styles():
-    """Serve CSS with no-cache headers to prevent caching issues during development"""
-    response = FileResponse("frontend/styles.css", media_type="text/css")
-    response.headers["Cache-Control"] = "no-cache, no-store, must-revalidate"
-    response.headers["Pragma"] = "no-cache"
-    response.headers["Expires"] = "0"
-    return response
-# --- Static Frontend ---
-FRONTEND_DIR = os.path.join(PROJECT_ROOT, "frontend")
-os.makedirs(FRONTEND_DIR, exist_ok=True)
-app.mount("/", StaticFiles(directory=FRONTEND_DIR, html=True), name="static")
-# Initialize database
-db = PapersDatabase(DB_PATH)

src/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # PaperIndex - A beautiful web application for browsing and evaluating daily papers

src/agents/__init__.py ADDED Viewed

	@@ -0,0 +1,3 @@


1	+ # AI agents for paper evaluation
2	+
3	+

{agents → src/agents}/evaluator.py RENAMED Viewed

@@ -1,5 +1,6 @@
 from __future__ import annotations
 import base64
 import os
 import json
@@ -9,14 +10,15 @@ from pathlib import Path
 from datetime import datetime
 from anthropic import Anthropic
 from langgraph.graph import END, StateGraph
 from pydantic import BaseModel, Field
-from agents.prompt import REVIEWER_SYSTEM_PROMPT, EVALUATION_PROMPT_TEMPLATE, TOOLS, TOOL_CHOICE
-# Import API key function from server module
-import sys
-sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
-from server import get_anthropic_api_key
 class ConversationState(BaseModel):
@@ -24,6 +26,9 @@ class ConversationState(BaseModel):
     messages: List[Dict[str, Any]] = Field(default_factory=list)
     response_text: str = ""
     tool_result: Optional[Dict[str, Any]] = None
 def _load_pdf_as_content(pdf_path: str) -> Dict[str, Any]:
@@ -51,7 +56,7 @@ def _load_pdf_as_content(pdf_path: str) -> Dict[str, Any]:
 class Evaluator:
     def __init__(self, api_key: Optional[str] = None):
-        api_key = api_key or get_anthropic_api_key()
         if not api_key:
             raise ValueError("Anthropic API key is required. Please set HF_SECRET_ANTHROPIC_API_KEY in Hugging Face Spaces secrets or ANTHROPIC_API_KEY environment variable.")
         self.client = Anthropic(api_key=api_key)
@@ -61,9 +66,24 @@ class Evaluator:
     async def __call__(self, state: ConversationState) -> ConversationState:
         """Evaluate the paper using the conversation state"""
         # Prepare messages for the API call
-        messages = [{"role": "system", "content": self.system_prompt}]
         messages.extend(state.messages)
         # Add the evaluation prompt
         messages.append({
             "role": "user",
@@ -72,32 +92,49 @@ class Evaluator:
         try:
             # Call Anthropic API with tools
-            response = await self.client.messages.create(
-                model="claude-3-5-sonnet-20241022",
                 max_tokens=4000,
                 messages=messages,
                 tools=TOOLS,
                 tool_choice=TOOL_CHOICE
             )
             # Process the response
-            state.messages.append({
-                "role": "assistant",
-                "content": response.content[0].text if response.content else ""
-            })
-            # Extract tool result if present
-            tool_result = None
-            if response.content and hasattr(response.content[0], 'tool_use'):
-                tool_use = response.content[0].tool_use
                 if tool_use:
-                    tool_result = json.loads(tool_use.input)
-            # Prefer tool JSON; if absent, fall back to raw text
-            if tool_result is not None:
-                state.response_text = json.dumps(tool_result, ensure_ascii=False, indent=2)
             else:
-                state.response_text = response.content[0].text if response.content else ""
         except Exception as e:
             state.response_text = f"Error during evaluation: {str(e)}"
@@ -106,29 +143,85 @@ class Evaluator:
 async def save_node(state: ConversationState) -> ConversationState:
-    """Save the evaluation result to a file"""
     try:
-        # Create workdir if it doesn't exist
-        workdir = os.getenv("WORKDIR", "workdir")
-        os.makedirs(workdir, exist_ok=True)
-        # Generate filename based on timestamp
-        timestamp = datetime.now().strftime("%Y%m%d_%H%M%S")
-        filename = f"evaluation_{timestamp}.json"
-        filepath = os.path.join(workdir, filename)
-        # Save the evaluation result
-        with open(filepath, 'w', encoding='utf-8') as f:
-            json.dump({
-                "timestamp": timestamp,
-                "response": state.response_text,
-                "tool_result": state.tool_result
-            }, f, ensure_ascii=False, indent=2)
-        state.response_text += f"\n\nEvaluation saved to: {filename}"
     except Exception as e:
-        state.response_text += f"\n\nError saving evaluation: {str(e)}"
     return state
@@ -148,11 +241,11 @@ def build_graph(api_key: Optional[str] = None):
     return graph.compile()
-def run_evaluation(pdf_path: str, output_file: Optional[str] = None, api_key: Optional[str] = None) -> str:
     app = build_graph(api_key=api_key)
-    initial = ConversationState(pdf_path=pdf_path, output_file=output_file)
     # Ensure compatibility with LangGraph's dict-based state
-    final_state = app.invoke(initial.model_dump())
     if isinstance(final_state, dict):
         return str(final_state.get("response_text", ""))
     if isinstance(final_state, ConversationState):

 from __future__ import annotations
+import os
+import sys
 import base64
 import os
 import json
 from datetime import datetime
 from anthropic import Anthropic
+from anthropic.types import ToolUseBlock
 from langgraph.graph import END, StateGraph
 from pydantic import BaseModel, Field
+from src.agents.prompt import REVIEWER_SYSTEM_PROMPT, EVALUATION_PROMPT_TEMPLATE, TOOLS, TOOL_CHOICE
+from src.database import db
+from src.config import config
+from src.logger import logger
 class ConversationState(BaseModel):
     messages: List[Dict[str, Any]] = Field(default_factory=list)
     response_text: str = ""
     tool_result: Optional[Dict[str, Any]] = None
+    arxiv_id: Optional[str] = None
+    pdf_path: Optional[str] = None
+    output_file: Optional[str] = None
 def _load_pdf_as_content(pdf_path: str) -> Dict[str, Any]:
 class Evaluator:
     def __init__(self, api_key: Optional[str] = None):
+        api_key = api_key or os.getenv("ANTHROPIC_API_KEY")
         if not api_key:
             raise ValueError("Anthropic API key is required. Please set HF_SECRET_ANTHROPIC_API_KEY in Hugging Face Spaces secrets or ANTHROPIC_API_KEY environment variable.")
         self.client = Anthropic(api_key=api_key)
     async def __call__(self, state: ConversationState) -> ConversationState:
         """Evaluate the paper using the conversation state"""
         # Prepare messages for the API call
+        messages = []
         messages.extend(state.messages)
+        # Load PDF content if pdf_path is provided
+        if state.pdf_path:
+            try:
+                pdf_content = _load_pdf_as_content(state.pdf_path)
+                messages.append({
+                    "role": "user",
+                    "content": [
+                        {"type": "text", "text": "Please evaluate this academic paper:"},
+                        pdf_content
+                    ]
+                })
+            except Exception as e:
+                state.response_text = f"Error loading PDF: {str(e)}"
+                return state
         # Add the evaluation prompt
         messages.append({
             "role": "user",
         try:
             # Call Anthropic API with tools
+            response = self.client.messages.create(
+                model=config.model_id,
                 max_tokens=4000,
+                system=self.system_prompt,
                 messages=messages,
                 tools=TOOLS,
                 tool_choice=TOOL_CHOICE
             )
             # Process the response
+            # Check if response is a tool use or text
+            if response.content and isinstance(response.content[0], ToolUseBlock):
+                # This is a tool use response
+                tool_use = response.content[0]
                 if tool_use:
+                    tool_result = tool_use.input
+                    # set metadata
+                    tool_result['metadata'] = {
+                        'assessed_at': datetime.now().strftime("%Y-%m-%d %H:%M:%S"),
+                        'model': config.model_id,
+                        'version': config.version,
+                        'paper_path': state.pdf_path
+                    }
+                    state.tool_result = tool_result
+                    state.response_text = json.dumps(tool_result, ensure_ascii=False, indent=4)
+                    # Add tool use to messages
+                    state.messages.append({
+                        "role": "assistant",
+                        "content": f"Tool use: {tool_use.name}"
+                    })
+                else:
+                    state.response_text = "Error: Tool use response but no tool_use found"
             else:
+                # This is a text response
+                text_content = response.content[0].text if response.content else ""
+                state.messages.append({
+                    "role": "assistant",
+                    "content": text_content
+                })
+                state.response_text = text_content
         except Exception as e:
             state.response_text = f"Error during evaluation: {str(e)}"
 async def save_node(state: ConversationState) -> ConversationState:
+    """Save the evaluation result to database"""
     try:
+        if not state.arxiv_id:
+            state.response_text += f"\n\nError: No arxiv_id provided for database save"
+            return state
+        # Parse the evaluation result
+        evaluation_content = state.response_text
+        evaluation_score = None
+        overall_score = None
+        evaluation_tags = None
+        # Try to extract score and tags from tool_result if available
+        if state.tool_result:
+            try:
+                # Extract overall automatability score from scorecard
+                if 'scorecard' in state.tool_result and 'overall_automatability' in state.tool_result['scorecard']:
+                    evaluation_score = state.tool_result['scorecard']['overall_automatability']
+                # Extract overall score from scorecard
+                if 'scorecard' in state.tool_result and 'overall_automatability' in state.tool_result['scorecard']:
+                    overall_score = state.tool_result['scorecard']['overall_automatability']
+                # Create tags from key dimensions in scorecard
+                tags = []
+                if 'scorecard' in state.tool_result:
+                    scorecard = state.tool_result['scorecard']
+                    if 'three_year_feasibility_pct' in scorecard:
+                        tags.append(f"3yr_feasibility:{scorecard['three_year_feasibility_pct']}%")
+                    if 'task_formalization' in scorecard:
+                        tags.append(f"task_formalization:{scorecard['task_formalization']}/4")
+                    if 'data_resource_availability' in scorecard:
+                        tags.append(f"data_availability:{scorecard['data_resource_availability']}/4")
+                evaluation_tags = ",".join(tags) if tags else None
+            except Exception as e:
+                logger.warning(f"Warning: Could not extract structured data from tool_result: {e}")
+        else:
+            # Try to parse evaluation_content as JSON to extract structured data
+            try:
+                evaluation_json = json.loads(evaluation_content)
+                # Extract overall automatability score from scorecard
+                if 'scorecard' in evaluation_json and 'overall_automatability' in evaluation_json['scorecard']:
+                    evaluation_score = evaluation_json['scorecard']['overall_automatability']
+                # Extract overall score from scorecard
+                if 'scorecard' in evaluation_json and 'overall_automatability' in evaluation_json['scorecard']:
+                    overall_score = evaluation_json['scorecard']['overall_automatability']
+                # Create tags from key dimensions in scorecard
+                tags = []
+                if 'scorecard' in evaluation_json:
+                    scorecard = evaluation_json['scorecard']
+                    if 'three_year_feasibility_pct' in scorecard:
+                        tags.append(f"3yr_feasibility:{scorecard['three_year_feasibility_pct']}%")
+                    if 'task_formalization' in scorecard:
+                        tags.append(f"task_formalization:{scorecard['task_formalization']}/4")
+                    if 'data_resource_availability' in scorecard:
+                        tags.append(f"data_availability:{scorecard['data_resource_availability']}/4")
+                evaluation_tags = ",".join(tags) if tags else None
+            except Exception as e:
+                logger.warning(f"Warning: Could not parse evaluation_content as JSON: {e}")
+        # Save to database
+        db.update_paper_evaluation(
+            arxiv_id=state.arxiv_id,
+            evaluation_content=evaluation_content,
+            evaluation_score=evaluation_score,
+            overall_score=overall_score,
+            evaluation_tags=evaluation_tags
+        )
+        state.response_text += f"\n\nEvaluation saved to database for paper: {state.arxiv_id}"
     except Exception as e:
+        state.response_text += f"\n\nError saving evaluation to database: {str(e)}"
     return state
     return graph.compile()
+async def run_evaluation(pdf_path: str, arxiv_id: Optional[str] = None, output_file: Optional[str] = None, api_key: Optional[str] = None) -> str:
     app = build_graph(api_key=api_key)
+    initial = ConversationState(pdf_path=pdf_path, arxiv_id=arxiv_id, output_file=output_file)
     # Ensure compatibility with LangGraph's dict-based state
+    final_state = await app.ainvoke(initial.model_dump())
     if isinstance(final_state, dict):
         return str(final_state.get("response_text", ""))
     if isinstance(final_state, ConversationState):

{agents → src/agents}/prompt.py RENAMED Viewed

@@ -248,16 +248,6 @@ TOOLS = [
         "input_schema": {
             "type": "object",
             "properties": {
-                "metadata": {
-                    "type": "object",
-                    "properties": {
-                        "assessed_at": {"type": "string"},
-                        "model": {"type": "string"},
-                        "version": {"type": "string"},
-                        "paper_path": {"type": "string"},
-                    },
-                    "required": ["assessed_at", "model", "version", "paper_path"],
-                },
                 "executive_summary": {"type": "string"},
                 "dimensions": {
                     "type": "object",
@@ -382,7 +372,6 @@ TOOLS = [
                 "limitations_uncertainties": {"type": "array", "items": {"type": "string"}},
             },
             "required": [
-                "metadata",
                 "executive_summary",
                 "dimensions",
                 "scorecard",

         "input_schema": {
             "type": "object",
             "properties": {
                 "executive_summary": {"type": "string"},
                 "dimensions": {
                     "type": "object",
                 "limitations_uncertainties": {"type": "array", "items": {"type": "string"}},
             },
             "required": [
                 "executive_summary",
                 "dimensions",
                 "scorecard",

src/cli/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # Command line interface tools

src/cli/cli.py ADDED Viewed

	@@ -0,0 +1,80 @@

+import argparse
+import os
+import sys
+import asyncio
+from typing import Optional
+from dotenv import load_dotenv
+load_dotenv()
+from rich.console import Console
+from rich.panel import Panel
+import os
+import sys
+sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
+from agents.evaluator import run_evaluation
+console = Console()
+def build_parser() -> argparse.ArgumentParser:
+    parser = argparse.ArgumentParser(
+        description="AI Automation Evaluator (LangGraph) — evaluate a paper PDF or arXiv URL",
+        epilog="Example: python cli.py https://arxiv.org/pdf/2507.14683 --arxiv-id 2507.14683 -o /abs/path/save_dir/eval_2507_14683",
+    )
+    parser.add_argument("pdf", help="Local PDF absolute path or URL (e.g., https://arxiv.org/pdf/xxxx)")
+    parser.add_argument(
+        "--arxiv-id",
+        dest="arxiv_id",
+        help="arXiv ID for the paper (e.g., 2507.14683)",
+    )
+    parser.add_argument(
+        "-o",
+        "--output-prefix",
+        dest="output_prefix",
+        help="Output file prefix (if provided, will save as <prefix>_YYYYMMDD_HHMMSS.md)",
+    )
+    parser.add_argument(
+        "--api-key",
+        dest="api_key",
+        default=os.getenv("ANTHROPIC_API_KEY"),
+        help="Anthropic API key (overrides ANTHROPIC_API_KEY env)",
+    )
+    return parser
+async def main(argv: Optional[list[str]] = None):
+    parser = build_parser()
+    args = parser.parse_args(argv)
+    pdf_path: str = args.pdf
+    arxiv_id: Optional[str] = args.arxiv_id
+    output_prefix: Optional[str] = args.output_prefix
+    api_key: Optional[str] = args.api_key or os.getenv("ANTHROPIC_API_KEY")
+    if not api_key:
+        console.print("[yellow]Warning:[/yellow] ANTHROPIC_API_KEY not set and --api-key not provided.", highlight=False)
+    console.print(Panel.fit(f"Evaluating: {pdf_path}"))
+    if arxiv_id:
+        console.print(f"arXiv ID: {arxiv_id}")
+    try:
+        result = await run_evaluation(pdf_path=pdf_path, arxiv_id=arxiv_id, output_file=output_prefix, api_key=api_key)
+        console.print("\n[bold green]Done.[/bold green]\n")
+        if output_prefix:
+            console.print(f"Saved to prefix: {output_prefix}_<timestamp>.md")
+        elif arxiv_id:
+            console.print(f"Evaluation saved to database for paper: {arxiv_id}")
+        else:
+            console.print(result)
+    except Exception as e:
+        console.print(f"[bold red]Error:[/bold red] {e}")
+        sys.exit(2)
+if __name__ == "__main__":
+    asyncio.run(main())

src/config/config.py CHANGED Viewed

@@ -2,45 +2,19 @@ import os
 from mmengine import Config as MMConfig
 from argparse import Namespace
-from dotenv import load_dotenv
-load_dotenv(verbose=True)
-from finworld.utils import assemble_project_path, get_tag_name, Singleton, set_seed
-def check_level(level: str) -> bool:
-    """
-    Check if the level is valid.
-    """
-    valid_levels = ['1day', '1min', '5min', '15min', '30min', '1hour', '4hour']
-    if level not in valid_levels:
-        return False
-    return True
 def process_general(config: MMConfig) -> MMConfig:
     config.exp_path = assemble_project_path(os.path.join(config.workdir, config.tag))
     os.makedirs(config.exp_path, exist_ok=True)
-    config.log_path = os.path.join(config.exp_path, getattr(config, 'log_path', 'finworld.log'))
-    if "checkpoint_path" in config:
-        config.checkpoint_path = os.path.join(config.exp_path, getattr(config, 'checkpoint_path', 'checkpoint'))
-        os.makedirs(config.checkpoint_path, exist_ok=True)
-    if "plot_path" in config:
-        config.plot_path = os.path.join(config.exp_path, getattr(config, 'plot_path', 'plot'))
-        os.makedirs(config.plot_path, exist_ok=True)
-    if "tracker" in config:
-        for key, value in config.tracker.items():
-            config.tracker[key]['logging_dir'] = os.path.join(config.exp_path, value['logging_dir'])
-    if "seed" in config:
-        set_seed(config.seed)
     return config
 class Config(MMConfig, metaclass=Singleton):
     def __init__(self):
         super(Config, self).__init__()
@@ -57,30 +31,9 @@ class Config(MMConfig, metaclass=Singleton):
                 cfg_options[item] = args.__dict__[item]
         mmconfig.merge_from_dict(cfg_options)
-        tag = get_tag_name(
-            tag=getattr(mmconfig, 'tag', None),
-            assets_name=getattr(mmconfig, 'assets_name', None),
-            source=getattr(mmconfig, 'source', None),
-            data_type= getattr(mmconfig, 'data_type', None),
-            level= getattr(mmconfig, 'level', None),
-        )
-        mmconfig.tag = tag
         # Process general configuration
         mmconfig = process_general(mmconfig)
-        # Initialize the price downloader configuration
-        if 'downloader' in mmconfig:
-            if "assets_path" in mmconfig.downloader:
-                mmconfig.downloader.assets_path = assemble_project_path(mmconfig.downloader.assets_path)
-                assert check_level(mmconfig.downloader.level), f"Invalid level: {mmconfig.downloader.level}. Valid levels are: ['1day', '1min', '5min', '15min', '30min', '1hour', '4hour']"
-        if 'processor' in mmconfig:
-            if "assets_path" in mmconfig.processor:
-                mmconfig.processor.assets_path = assemble_project_path(mmconfig.processor.assets_path)
-            mmconfig.processor.repo_id = f"{os.getenv('HF_REPO_NAME')}/{mmconfig.processor.repo_id}"
-            mmconfig.processor.repo_type = mmconfig.processor.repo_type if 'repo_type' in mmconfig.processor else 'dataset'
         self.__dict__.update(mmconfig.__dict__)
 config = Config()

 from mmengine import Config as MMConfig
 from argparse import Namespace
+from src.utils import assemble_project_path, Singleton
 def process_general(config: MMConfig) -> MMConfig:
     config.exp_path = assemble_project_path(os.path.join(config.workdir, config.tag))
     os.makedirs(config.exp_path, exist_ok=True)
+    config.log_path = os.path.join(config.exp_path, getattr(config, 'log_path', 'paper_agent.log'))
+    config.db_path = os.path.join(config.exp_path, getattr(config, 'db_path', 'papers_cache.db'))
+    config.frontend_path = assemble_project_path(getattr(config, 'frontend_path', 'frontend'))
     return config
 class Config(MMConfig, metaclass=Singleton):
     def __init__(self):
         super(Config, self).__init__()
                 cfg_options[item] = args.__dict__[item]
         mmconfig.merge_from_dict(cfg_options)
         # Process general configuration
         mmconfig = process_general(mmconfig)
         self.__dict__.update(mmconfig.__dict__)
 config = Config()

src/crawl/__init__.py ADDED Viewed

	@@ -0,0 +1,5 @@

+# Crawl module for web scraping and data extraction
+from .huggingface_daily import HuggingFaceDailyPapers
+__all__ = ['HuggingFaceDailyPapers']

src/crawl/huggingface_daily.py ADDED Viewed

	@@ -0,0 +1,309 @@

+from typing import List, Dict, Any, Optional
+import re
+import httpx
+from datetime import datetime, timedelta
+from bs4 import BeautifulSoup
+from src.logger import logger
+class HuggingFaceDailyPapers:
+    """Class for crawling and parsing Hugging Face daily papers"""
+    def __init__(self):
+        self.base_url = "https://huggingface.co/papers/date"
+        self.timeout = 20
+    def extract_arxiv_id(self, url: str) -> Optional[str]:
+        """Extract arXiv ID from a URL"""
+        if not url:
+            return None
+        # Matches /abs/2508.05629, /pdf/2508.05629.pdf
+        m = re.search(r"arxiv\.org/(abs|pdf)/([0-9]{4}\.\d{4,5})(?:\.pdf)?", url)
+        if m:
+            return m.group(2)
+        return None
+    def extract_json_data(self, html: str) -> Dict[str, Any]:
+        """Extract JSON data from the HTML page to get GitHub stars and other metadata."""
+        try:
+            soup = BeautifulSoup(html, "lxml")
+            # Look for GitHub stars in the HTML structure
+            # Based on the user's description, GitHub stars are displayed with SVG icons
+            # Look for SVG elements that might represent GitHub stars
+            svg_elements = soup.find_all("svg")
+            github_stars_map = {}
+            for svg in svg_elements:
+                # Look for GitHub-related SVG (usually has specific viewBox or path)
+                svg_html = str(svg)
+                if "github" in svg_html.lower() or "256 250" in svg_html:  # GitHub icon viewBox
+                    # Look for the star count near this SVG
+                    parent = svg.parent
+                    if parent:
+                        # Look for numbers that might be star counts
+                        text_content = parent.get_text()
+                        numbers = re.findall(r'\b(\d+)\b', text_content)
+                        if numbers:
+                            # The number near a GitHub SVG is likely the star count
+                            star_count = int(numbers[0])
+                            # Try to find the paper title or ID to associate with
+                            # Look for the closest article or card container
+                            article = svg.find_parent("article")
+                            if article:
+                                title_elem = article.find("h3")
+                                if title_elem:
+                                    paper_title = title_elem.get_text(strip=True)
+                                    github_stars_map[paper_title] = star_count
+            # Also look for any elements with GitHub-related text
+            github_text_elements = soup.find_all(string=lambda text: text and "github" in text.lower())
+            for text_elem in github_text_elements:
+                parent = text_elem.parent
+                if parent:
+                    text_content = parent.get_text()
+                    numbers = re.findall(r'\b(\d+)\b', text_content)
+                    if numbers:
+                        star_count = int(numbers[0])
+                        # Try to find the paper title
+                        article = parent.find_parent("article")
+                        if article:
+                            title_elem = article.find("h3")
+                            if title_elem:
+                                paper_title = title_elem.get_text(strip=True)
+                                if paper_title not in github_stars_map:
+                                    github_stars_map[paper_title] = star_count
+            return {"github_stars_map": github_stars_map}
+        except Exception as e:
+            logger.error(f"Error extracting JSON data: {e}")
+        return {"github_stars_map": {}}
+    async def fetch_daily_html(self, target_date: str) -> tuple[str, str]:
+        """Fetch daily papers HTML, with fallback to find the latest available date"""
+        async with httpx.AsyncClient(timeout=self.timeout, follow_redirects=False) as client:
+            # First try the requested date
+            url = f"{self.base_url}/{target_date}"
+            try:
+                r = await client.get(url)
+                # Check if we got redirected
+                if r.status_code in [301, 302, 303, 307, 308]:
+                    # We got redirected, extract the actual date from the redirect location
+                    location = r.headers.get('location', '')
+                    logger.info(f"Got redirect to: {location}")
+                    # Extract date from redirect URL (e.g., /papers/date/2025-08-08)
+                    date_match = re.search(r'/papers/date/(\d{4}-\d{2}-\d{2})', location)
+                    if date_match:
+                        actual_date = date_match.group(1)
+                        logger.info(f"Redirected from {target_date} to {actual_date}")
+                        # Fetch the actual page
+                        actual_url = f"https://huggingface.co{location}"
+                        r = await client.get(actual_url)
+                        if r.status_code == 200:
+                            return actual_date, r.text
+                        else:
+                            raise Exception(f"Failed to fetch redirected page: {r.status_code}")
+                    else:
+                        # Couldn't extract date from redirect, use fallback
+                        raise Exception("Could not extract date from redirect")
+                elif r.status_code == 200:
+                    # Direct success, check if the page actually contains the requested date
+                    if target_date in r.text or "Daily Papers" in r.text:
+                        return target_date, r.text
+                    else:
+                        # Page exists but doesn't contain expected content
+                        raise Exception("Page exists but doesn't contain expected content")
+                else:
+                    # Other error status
+                    raise Exception(f"Status code {r.status_code}")
+            except Exception as e:
+                logger.error(f"Failed to fetch {target_date}: {e}")
+                # If the requested date fails, try to find the latest available date
+                actual_date, html = await self.find_latest_available_date(client)
+                return actual_date, html
+    async def find_latest_available_date(self, client: httpx.AsyncClient) -> tuple[str, str]:
+        """Find the latest available date by checking recent dates"""
+        # Start from today and go backwards up to 30 days
+        today = datetime.now()
+        for i in range(30):
+            check_date = today - timedelta(days=i)
+            date_str = check_date.strftime("%Y-%m-%d")
+            url = f"{self.base_url}/{date_str}"
+            try:
+                r = await client.get(url)
+                if r.status_code == 200:
+                    # Check if the page actually has content (not just a 404 or empty page)
+                    if "Daily Papers" in r.text and len(r.text) > 1000:
+                        logger.info(f"Found latest available date: {date_str}")
+                        return date_str, r.text
+            except Exception:
+                continue
+        # If no date found, return a default page or raise an error
+        raise Exception("No available daily papers found in the last 30 days")
+    def parse_daily_cards(self, html: str) -> List[Dict[str, Any]]:
+        """Parse daily papers HTML and extract paper cards"""
+        soup = BeautifulSoup(html, "lxml")
+        # First, extract JSON data from the page to get GitHub stars
+        json_data = self.extract_json_data(html)
+        # Find all article elements that contain paper cards
+        cards: List[Dict[str, Any]] = []
+        # Look for article elements with the specific class structure from Hugging Face
+        for article in soup.select("article.relative.flex.flex-col.overflow-hidden.rounded-xl.border"):
+            try:
+                card_data = {}
+                # Extract title and link
+                title_link = article.select_one("h3 a")
+                if title_link:
+                    card_data["title"] = title_link.get_text(strip=True)
+                    href = title_link.get("href")
+                    if href:
+                        if href.startswith("http"):
+                            card_data["huggingface_url"] = href
+                        else:
+                            card_data["huggingface_url"] = f"https://huggingface.co{href}"
+                # Extract upvote count
+                upvote_div = article.select_one("div.shadow-alternate div.leading-none")
+                if upvote_div:
+                    upvote_text = upvote_div.get_text(strip=True)
+                    try:
+                        card_data["upvotes"] = int(upvote_text)
+                    except ValueError:
+                        card_data["upvotes"] = 0
+                # Extract author count - look for the author count text
+                author_count_div = article.select_one("div.flex.truncate.text-sm")
+                if author_count_div:
+                    author_text = author_count_div.get_text(strip=True)
+                    # Extract number from "· 10 authors"
+                    author_match = re.search(r'(\d+)\s*authors?', author_text)
+                    if author_match:
+                        card_data["author_count"] = int(author_match.group(1))
+                    else:
+                        card_data["author_count"] = 0
+                # Extract GitHub stars from JSON data in the page
+                # This will be handled later when we parse the JSON data
+                card_data["github_stars"] = 0  # Default value
+                # Extract comments count - look for comment icon and number
+                comment_links = article.select("a[href*='#community']")
+                for comment_link in comment_links:
+                    comment_text = comment_link.get_text(strip=True)
+                    try:
+                        card_data["comments"] = int(comment_text)
+                        break
+                    except ValueError:
+                        continue
+                # Extract submitter information
+                submitted_div = article.select_one("div.shadow-xs")
+                if submitted_div:
+                    submitter_text = submitted_div.get_text(strip=True)
+                    # Extract submitter name from "Submitted byLiang0223" (no space)
+                    submitter_match = re.search(r'Submitted by(\S+)', submitter_text)
+                    if submitter_match:
+                        card_data["submitter"] = submitter_match.group(1)
+                # Extract arXiv ID from the URL
+                if card_data.get("huggingface_url"):
+                    arxiv_id = self.extract_arxiv_id(card_data["huggingface_url"])
+                    if arxiv_id:
+                        card_data["arxiv_id"] = arxiv_id
+                # Try to get GitHub stars from the extracted data
+                # Look for GitHub stars by matching paper title
+                paper_title = card_data.get("title", "")
+                if paper_title in json_data.get("github_stars_map", {}):
+                    card_data["github_stars"] = json_data["github_stars_map"][paper_title]
+                # Only add cards that have at least a title
+                if card_data.get("title"):
+                    cards.append(card_data)
+            except Exception as e:
+                logger.error(f"Error parsing card: {e}")
+                continue
+        # If the above method didn't work, fall back to the old method
+        if not cards:
+            logger.info("Falling back to old parsing method")
+            for h3 in soup.select("h3"):
+                # Title and Hugging Face paper link (if present)
+                a = h3.find("a")
+                title = h3.get_text(strip=True)
+                hf_link = None
+                if a and a.get("href"):
+                    href = a.get("href")
+                    # Absolute URL to huggingface
+                    if href.startswith("http"):
+                        hf_link = href
+                    else:
+                        hf_link = f"https://huggingface.co{href}"
+                # Try to capture sibling info (authors, votes, etc.) as a small snippet
+                meta_text = None
+                parent = h3.parent
+                if parent:
+                    # Join immediate text content following h3
+                    collected: List[str] = []
+                    for sib in parent.find_all(text=True, recursive=False):
+                        t = (sib or "").strip()
+                        if t:
+                            collected.append(t)
+                    if collected:
+                        meta_text = " ".join(collected)
+                # Try to discover any arXiv link inside nearby anchors
+                arxiv_id: Optional[str] = None
+                container = parent if parent else h3
+                for link in container.find_all("a", href=True):
+                    possible = self.extract_arxiv_id(link["href"])
+                    if possible:
+                        arxiv_id = possible
+                        break
+                cards.append(
+                    {
+                        "title": title,
+                        "huggingface_url": hf_link,
+                        "meta": meta_text,
+                        "arxiv_id": arxiv_id,
+                    }
+                )
+        # Deduplicate by title
+        seen = set()
+        unique_cards: List[Dict[str, Any]] = []
+        for c in cards:
+            key = c.get("title") or ""
+            if key and key not in seen:
+                seen.add(key)
+                unique_cards.append(c)
+        logger.info(f"Parsed {len(unique_cards)} cards")
+        return unique_cards
+    async def get_daily_papers(self, target_date: str) -> tuple[str, List[Dict[str, Any]]]:
+        """Get daily papers for a specific date"""
+        date_str, html = await self.fetch_daily_html(target_date)
+        cards = self.parse_daily_cards(html)
+        return date_str, cards

src/database/db.py CHANGED Viewed

@@ -30,6 +30,27 @@ class PapersDatabase():
                 )
             ''')
             # Create latest_date table to track the most recent available date
             cursor.execute('''
                 CREATE TABLE IF NOT EXISTS latest_date (
@@ -59,7 +80,7 @@ class PapersDatabase():
     def get_cached_papers(self, date_str: str) -> Optional[Dict[str, Any]]:
         """Get cached papers for a specific date"""
-        with self.get_connection(self.db_path) as conn:
             cursor = conn.cursor()
             cursor.execute('''
                 SELECT parsed_cards, created_at
@@ -134,6 +155,122 @@ class PapersDatabase():
             ''', (cutoff_date,))
             conn.commit()
     def __str__(self):
         return f"PapersDatabase(db_path={self.db_path})"

                 )
             ''')
+            # Create papers table for individual arXiv papers
+            cursor.execute('''
+                CREATE TABLE IF NOT EXISTS papers (
+                    arxiv_id TEXT PRIMARY KEY,
+                    title TEXT NOT NULL,
+                    authors TEXT NOT NULL,
+                    abstract TEXT,
+                    categories TEXT,
+                    published_date TEXT,
+                    evaluation_content TEXT,
+                    evaluation_score REAL,
+                    overall_score REAL,
+                    evaluation_tags TEXT,
+                    evaluation_status TEXT DEFAULT 'not_started',
+                    is_evaluated BOOLEAN DEFAULT FALSE,
+                    evaluation_date TIMESTAMP,
+                    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
+                    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
+                )
+            ''')
             # Create latest_date table to track the most recent available date
             cursor.execute('''
                 CREATE TABLE IF NOT EXISTS latest_date (
     def get_cached_papers(self, date_str: str) -> Optional[Dict[str, Any]]:
         """Get cached papers for a specific date"""
+        with self.get_connection() as conn:
             cursor = conn.cursor()
             cursor.execute('''
                 SELECT parsed_cards, created_at
             ''', (cutoff_date,))
             conn.commit()
+    # Papers table methods
+    def insert_paper(self, arxiv_id: str, title: str, authors: str, abstract: str = None,
+                    categories: str = None, published_date: str = None):
+        """Insert a new paper into the papers table"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            cursor.execute('''
+                INSERT OR REPLACE INTO papers
+                (arxiv_id, title, authors, abstract, categories, published_date, updated_at)
+                VALUES (?, ?, ?, ?, ?, ?, CURRENT_TIMESTAMP)
+            ''', (arxiv_id, title, authors, abstract, categories, published_date))
+            conn.commit()
+    def get_paper(self, arxiv_id: str) -> Optional[Dict[str, Any]]:
+        """Get a paper by arxiv_id"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            cursor.execute('''
+                SELECT * FROM papers WHERE arxiv_id = ?
+            ''', (arxiv_id,))
+            row = cursor.fetchone()
+            if row:
+                return dict(row)
+            return None
+    def get_papers_by_evaluation_status(self, is_evaluated: bool = None) -> List[Dict[str, Any]]:
+        """Get papers by evaluation status"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            if is_evaluated is None:
+                cursor.execute('SELECT * FROM papers ORDER BY created_at DESC')
+            else:
+                cursor.execute('''
+                    SELECT * FROM papers
+                    WHERE is_evaluated = ?
+                    ORDER BY created_at DESC
+                ''', (is_evaluated,))
+            return [dict(row) for row in cursor.fetchall()]
+    def update_paper_evaluation(self, arxiv_id: str, evaluation_content: str,
+                               evaluation_score: float = None, overall_score: float = None, evaluation_tags: str = None):
+        """Update paper with evaluation content"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            cursor.execute('''
+                UPDATE papers
+                SET evaluation_content = ?,
+                    evaluation_score = ?,
+                    overall_score = ?,
+                    evaluation_tags = ?,
+                    is_evaluated = TRUE,
+                    evaluation_status = 'completed',
+                    evaluation_date = CURRENT_TIMESTAMP,
+                    updated_at = CURRENT_TIMESTAMP
+                WHERE arxiv_id = ?
+            ''', (evaluation_content, evaluation_score, overall_score, evaluation_tags, arxiv_id))
+            conn.commit()
+    def update_paper_status(self, arxiv_id: str, status: str):
+        """Update paper evaluation status"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            cursor.execute('''
+                UPDATE papers
+                SET evaluation_status = ?,
+                    updated_at = CURRENT_TIMESTAMP
+                WHERE arxiv_id = ?
+            ''', (status, arxiv_id))
+            conn.commit()
+    def get_unevaluated_papers(self) -> List[Dict[str, Any]]:
+        """Get all papers that haven't been evaluated yet"""
+        return self.get_papers_by_evaluation_status(is_evaluated=False)
+    def get_evaluated_papers(self) -> List[Dict[str, Any]]:
+        """Get all papers that have been evaluated"""
+        return self.get_papers_by_evaluation_status(is_evaluated=True)
+    def search_papers(self, query: str) -> List[Dict[str, Any]]:
+        """Search papers by title, authors, or abstract"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            search_pattern = f'%{query}%'
+            cursor.execute('''
+                SELECT * FROM papers
+                WHERE title LIKE ? OR authors LIKE ? OR abstract LIKE ?
+                ORDER BY created_at DESC
+            ''', (search_pattern, search_pattern, search_pattern))
+            return [dict(row) for row in cursor.fetchall()]
+    def delete_paper(self, arxiv_id: str):
+        """Delete a paper from the database"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            cursor.execute('DELETE FROM papers WHERE arxiv_id = ?', (arxiv_id,))
+            conn.commit()
+    def get_papers_count(self) -> Dict[str, int]:
+        """Get count of papers by evaluation status"""
+        with self.get_connection() as conn:
+            cursor = conn.cursor()
+            cursor.execute('SELECT COUNT(*) as total FROM papers')
+            total = cursor.fetchone()['total']
+            cursor.execute('SELECT COUNT(*) as evaluated FROM papers WHERE is_evaluated = TRUE')
+            evaluated = cursor.fetchone()['evaluated']
+            return {
+                'total': total,
+                'evaluated': evaluated,
+                'unevaluated': total - evaluated
+            }
     def __str__(self):
         return f"PapersDatabase(db_path={self.db_path})"

src/logger/__init__.py CHANGED Viewed

@@ -1,10 +1,7 @@
-from .logger import logger, LogLevel, AgentLogger, YELLOW_HEX
-from .monitor import Monitor, Timing, TokenUsage
 __all__ = ["logger",
            "LogLevel",
-           "AgentLogger",
-           "Monitor",
            "YELLOW_HEX",
-           "Timing",
-           "TokenUsage"]

+from .log import logger, LogLevel, Logger, YELLOW_HEX
 __all__ = ["logger",
            "LogLevel",
+           "Logger",
            "YELLOW_HEX",
+           ]

src/logger/log.py ADDED Viewed

	@@ -0,0 +1,136 @@

+import logging
+from enum import IntEnum
+from typing import Any, Optional
+from rich.console import Console, Group
+from rich.panel import Panel
+from rich.rule import Rule
+from rich.syntax import Syntax
+from rich.table import Table
+from rich.tree import Tree
+from rich.logging import RichHandler
+from src.utils import Singleton
+YELLOW_HEX = "#d4b702"
+class LogLevel(IntEnum):
+    CRITICAL = logging.CRITICAL
+    FATAL = logging.FATAL
+    ERROR = logging.ERROR
+    WARNING = logging.WARNING
+    WARN = logging.WARN
+    INFO = logging.INFO
+    DEBUG = logging.DEBUG
+class Logger(logging.Logger, metaclass=Singleton):
+    def __init__(self, name="logger", level=logging.INFO):
+        # Initialize the parent class
+        super().__init__(name, level)
+        # Define a formatter for log messages
+        self.formatter = logging.Formatter(
+            fmt="%(asctime)s - %(name)s:%(levelname)s - %(filename)s:%(lineno)s - %(message)s",
+            datefmt="%Y-%m-%d %H:%M:%S",
+        )
+    def init_logger(self, config, level: int = LogLevel.INFO):
+        """
+        Initialize the logger with a file path and optional main process check.
+        Args:
+            log_path (str): The log file path.
+            level (int, optional): The logging level. Defaults to logging.INFO.
+            accelerator (Accelerator, optional): Accelerator instance to determine the main process.
+        """
+        log_path = config.log_path
+        self.handlers.clear()
+        self.console = Console(
+            width=None,
+            markup=True,
+            color_system="truecolor",
+            force_terminal=True
+        )
+        rich_handler = RichHandler(
+            console=self.console,
+            rich_tracebacks=True,
+            show_time=False,
+            show_level=False,
+            show_path=False,
+            markup=True,
+            omit_repeated_times=False
+        )
+        rich_handler.setLevel(level)
+        rich_handler.setFormatter(self.formatter)
+        self.addHandler(rich_handler)
+        self.file_console = Console(
+            width=None,
+            markup=True,
+            color_system="truecolor",
+            force_terminal=True,
+            file=open(log_path, "a", encoding="utf-8")
+        )
+        rich_file_handler = RichHandler(
+            console=self.file_console,
+            rich_tracebacks=True,
+            show_time=False,
+            show_level=False,
+            show_path=False,
+            markup=True,
+            omit_repeated_times=False,
+        )
+        rich_file_handler.setLevel(level)
+        rich_file_handler.setFormatter(self.formatter)
+        self.addHandler(rich_file_handler)
+        self.propagate = False
+    def info(self, msg, *args, **kwargs):
+        """
+        Only for string messages, not for rich objects.
+        """
+        kwargs.setdefault("stacklevel", 2)
+        if "style" in kwargs:
+            kwargs.pop("style")
+        if "level" in kwargs:
+            kwargs.pop("level")
+        super().info(msg, *args, **kwargs)
+    def warning(self, msg, *args, **kwargs):
+        """
+        Only for string messages, not for rich objects.
+        """
+        kwargs.setdefault("stacklevel", 2)
+        super().warning(msg, *args, **kwargs)
+    def error(self, msg, *args, **kwargs):
+        kwargs.setdefault("stacklevel", 2)
+        super().error(msg, *args, **kwargs)
+    def critical(self, msg, *args, **kwargs):
+        kwargs.setdefault("stacklevel", 2)
+        super().critical(msg, *args, **kwargs)
+    def debug(self, msg, *args, **kwargs):
+        kwargs.setdefault("stacklevel", 2)
+        super().debug(msg, *args, **kwargs)
+    def log(self,
+            msg: Optional[Any] = None,
+            level: LogLevel = LogLevel.INFO,
+            **kwargs):
+        """
+        Log a rich object or a string message to both console and file.
+        """
+        if isinstance(msg, str):
+            self.info(msg, **kwargs)
+        elif isinstance(msg, (Group, Panel, Rule, Syntax, Table, Tree)):
+            self.console.print(msg, **kwargs)
+            self.file_console.print(msg, **kwargs)
+logger = Logger()

src/logger/logger.py DELETED Viewed

@@ -1,229 +0,0 @@
-import logging
-import json
-from enum import IntEnum
-from rich import box
-from rich.console import Console, Group
-from rich.panel import Panel
-from rich.rule import Rule
-from rich.syntax import Syntax
-from rich.table import Table
-from rich.tree import Tree
-from src.utils import (
-    escape_code_brackets,
-    Singleton
-)
-YELLOW_HEX = "#d4b702"
-class LogLevel(IntEnum):
-    OFF = -1  # No output
-    ERROR = 0  # Only errors
-    INFO = 1  # Normal output (default)
-    DEBUG = 2  # Detailed output
-class AgentLogger(logging.Logger, metaclass=Singleton):
-    def __init__(self, name="logger", level=logging.INFO):
-        # Initialize the parent class
-        super().__init__(name, level)
-        # Define a formatter for log messages
-        self.formatter = logging.Formatter(
-            fmt="\033[92m%(asctime)s - %(name)s:%(levelname)s\033[0m: %(filename)s:%(lineno)s - %(message)s",
-            datefmt="%H:%M:%S",
-        )
-    def init_logger(self, log_path: str, level=logging.INFO):
-        """
-        Initialize the logger with a file path and optional main process check.
-        Args:
-            log_path (str): The log file path.
-            level (int, optional): The logging level. Defaults to logging.INFO.
-            accelerator (Accelerator, optional): Accelerator instance to determine the main process.
-        """
-        # Add a console handler for logging to the console
-        console_handler = logging.StreamHandler()
-        console_handler.setLevel(level)
-        console_handler.setFormatter(self.formatter)
-        self.addHandler(console_handler)
-        # Add a file handler for logging to the file
-        file_handler = logging.FileHandler(
-            log_path, mode="a"
-        )  # 'a' mode appends to the file
-        file_handler.setLevel(level)
-        file_handler.setFormatter(self.formatter)
-        self.addHandler(file_handler)
-        self.console = Console(width=100)
-        self.file_console = Console(file=open(log_path, "a"), width=100)
-        # Prevent duplicate logs from propagating to the root logger
-        self.propagate = False
-    def log(self, *args, level: int | str | LogLevel = LogLevel.INFO, **kwargs) -> None:
-        """Logs a message to the console.
-        Args:
-            level (LogLevel, optional): Defaults to LogLevel.INFO.
-        """
-        if isinstance(level, str):
-            level = LogLevel[level.upper()]
-        if level <= self.level:
-            self.info(*args, **kwargs)
-    def info(self, msg, *args, **kwargs):
-        """
-        Overridden info method with stacklevel adjustment for correct log location.
-        """
-        if isinstance(msg, (Rule, Panel, Group, Tree, Table, Syntax)):
-            self.console.print(msg)
-            self.file_console.print(msg)
-        else:
-            kwargs.setdefault(
-                "stacklevel", 2
-            )  # Adjust stack level to show the actual caller
-            if "style" in kwargs:
-                kwargs.pop("style")
-            if "level" in kwargs:
-                kwargs.pop("level")
-            super().info(msg, *args, **kwargs)
-    def warning(self, msg, *args, **kwargs):
-        kwargs.setdefault("stacklevel", 2)
-        super().warning(msg, *args, **kwargs)
-    def error(self, msg, *args, **kwargs):
-        kwargs.setdefault("stacklevel", 2)
-        super().error(msg, *args, **kwargs)
-    def critical(self, msg, *args, **kwargs):
-        kwargs.setdefault("stacklevel", 2)
-        super().critical(msg, *args, **kwargs)
-    def debug(self, msg, *args, **kwargs):
-        kwargs.setdefault("stacklevel", 2)
-        super().debug(msg, *args, **kwargs)
-    def log_error(self, error_message: str) -> None:
-        self.info(escape_code_brackets(error_message), style="bold red", level=LogLevel.ERROR)
-    def log_markdown(self, content: str, title: str | None = None, level=LogLevel.INFO, style=YELLOW_HEX) -> None:
-        markdown_content = Syntax(
-            content,
-            lexer="markdown",
-            theme="github-dark",
-            word_wrap=True,
-        )
-        if title:
-            self.info(
-                Group(
-                    Rule(
-                        "[bold italic]" + title,
-                        align="left",
-                        style=style,
-                    ),
-                    markdown_content,
-                ),
-                level=level,
-            )
-        else:
-            self.info(markdown_content, level=level)
-    def log_code(self, title: str, content: str, level: int = LogLevel.INFO) -> None:
-        self.info(
-            Panel(
-                Syntax(
-                    content,
-                    lexer="python",
-                    theme="monokai",
-                    word_wrap=True,
-                ),
-                title="[bold]" + title,
-                title_align="left",
-                box=box.HORIZONTALS,
-            ),
-            level=level,
-        )
-    def log_rule(self, title: str, level: int = LogLevel.INFO) -> None:
-        self.info(
-            Rule(
-                "[bold]" + title,
-                characters="━",
-                style=YELLOW_HEX,
-            ),
-            level=LogLevel.INFO,
-        )
-    def log_task(self, content: str, subtitle: str, title: str | None = None, level: LogLevel = LogLevel.INFO) -> None:
-        self.info(
-            Panel(
-                f"\n[bold]{escape_code_brackets(content)}\n",
-                title="[bold]New run" + (f" - {title}" if title else ""),
-                subtitle=subtitle,
-                border_style=YELLOW_HEX,
-                subtitle_align="left",
-            ),
-            level=level,
-        )
-    def log_messages(self, messages: list[dict], level: LogLevel = LogLevel.DEBUG) -> None:
-        messages_as_string = "\n".join([json.dumps(dict(message), indent=4, ensure_ascii=False) for message in messages])
-        self.info(
-            Syntax(
-                messages_as_string,
-                lexer="markdown",
-                theme="github-dark",
-                word_wrap=True,
-            ),
-            level=level,
-        )
-    def visualize_agent_tree(self, agent):
-        def create_tools_section(tools_dict):
-            table = Table(show_header=True, header_style="bold")
-            table.add_column("Name", style="#1E90FF")
-            table.add_column("Description")
-            table.add_column("Arguments")
-            for name, tool in tools_dict.items():
-                args = [
-                    f"{arg_name} (`{info.get('type', 'Any')}`{', optional' if info.get('optional') else ''}): {info.get('description', '')}"
-                    for arg_name, info in getattr(tool, "inputs", {}).items()
-                ]
-                table.add_row(name, getattr(tool, "description", str(tool)), "\n".join(args))
-            return Group("🛠️ [italic #1E90FF]Tools:[/italic #1E90FF]", table)
-        def get_agent_headline(agent, name: str | None = None):
-            name_headline = f"{name} | " if name else ""
-            return f"[bold {YELLOW_HEX}]{name_headline}{agent.__class__.__name__} | {agent.model.model_id}"
-        def build_agent_tree(parent_tree, agent_obj):
-            """Recursively builds the agent tree."""
-            parent_tree.add(create_tools_section(agent_obj.tools))
-            if agent_obj.managed_agents:
-                agents_branch = parent_tree.add("🤖 [italic #1E90FF]Managed agents:")
-                for name, managed_agent in agent_obj.managed_agents.items():
-                    agent_tree = agents_branch.add(get_agent_headline(managed_agent, name))
-                    if managed_agent.__class__.__name__ == "CodeAgent":
-                        agent_tree.add(
-                            f"✅ [italic #1E90FF]Authorized imports:[/italic #1E90FF] {managed_agent.additional_authorized_imports}"
-                        )
-                    agent_tree.add(f"📝 [italic #1E90FF]Description:[/italic #1E90FF] {managed_agent.description}")
-                    build_agent_tree(agent_tree, managed_agent)
-        main_tree = Tree(get_agent_headline(agent))
-        if agent.__class__.__name__ == "CodeAgent":
-            main_tree.add(
-                f"✅ [italic #1E90FF]Authorized imports:[/italic #1E90FF] {agent.additional_authorized_imports}"
-            )
-        build_agent_tree(main_tree, agent)
-        self.console.print(main_tree)
-logger = AgentLogger()

src/utils/__init__.py CHANGED Viewed

@@ -4,5 +4,5 @@ from .singleton import Singleton
 __all__ = [
     "get_project_root",
     "assemble_project_path",
-    "Singleton"
 ]

 __all__ = [
     "get_project_root",
     "assemble_project_path",
+    "Singleton",
 ]

src/utils/hf_utils.py DELETED Viewed

File without changes

test_evaluation.py ADDED Viewed

	@@ -0,0 +1,181 @@

+#!/usr/bin/env python3
+"""
+Test script: Verify that the run_evaluation function works correctly
+"""
+import asyncio
+import os
+import sys
+from pathlib import Path
+from dotenv import load_dotenv
+import argparse
+from mmengine import DictAction
+# 加载环境变量
+load_dotenv(verbose=True)
+# 设置根目录路径
+root = str(Path(__file__).parent)
+sys.path.append(root)
+from src.database import db
+from src.logger import logger
+from src.config import config
+from src.agents.evaluator import run_evaluation
+def parse_args():
+    """Parse command line arguments"""
+    parser = argparse.ArgumentParser(description='main')
+    parser.add_argument("--config", default=os.path.join(root, "configs", "paper_agent.py"), help="config file path")
+    parser.add_argument(
+        '--cfg-options',
+        nargs='+',
+        action=DictAction,
+        help='override some settings in the used config, the key-value pair '
+        'in xxx=yyy format will be merged into config file. If the value to '
+        'be overwritten is a list, it should be like key="[a,b]" or key=a,b '
+        'It also allows nested list/tuple values, e.g. key="[(a,b),(c,d)]" '
+        'Note that the quotation marks are necessary and that no white space '
+        'is allowed.')
+    args = parser.parse_args()
+    return args
+async def test_evaluation():
+    """Test evaluation functionality"""
+    print("=== Starting Evaluation Test ===")
+    # Test parameters
+    test_arxiv_id = "2508.09889"  # Use existing paper in database
+    test_pdf_url = f"https://arxiv.org/pdf/{test_arxiv_id}.pdf"
+    print(f"Test paper ID: {test_arxiv_id}")
+    print(f"PDF URL: {test_pdf_url}")
+    # Check API key
+    api_key = os.getenv("ANTHROPIC_API_KEY")
+    if not api_key:
+        print("❌ Error: ANTHROPIC_API_KEY environment variable not found")
+        return False
+    print(f"✅ API key is set: {api_key[:20]}...")
+    try:
+        # Check if paper exists in database
+        paper = db.get_paper(test_arxiv_id)
+        if paper:
+            print(f"✅ Paper found in database: {paper['title']}")
+        else:
+            print(f"⚠️  Paper not in database, creating new record")
+            # Insert test paper
+            db.insert_paper(
+                arxiv_id=test_arxiv_id,
+                title="Test Paper for Evaluation",
+                authors="Test Author",
+                abstract="This is a test paper for evaluation.",
+                categories="cs.AI",
+                published_date="2024-08-01"
+            )
+            print(f"✅ Test paper inserted into database")
+        print("\n=== Starting Evaluation ===")
+        # Run evaluation
+        result = await run_evaluation(
+            pdf_path=test_pdf_url,
+            arxiv_id=test_arxiv_id,
+            api_key=api_key
+        )
+        print(f"\n=== Evaluation Results ===")
+        print(f"Result length: {len(result)} characters")
+        print(f"First 500 characters: {result[:500]}...")
+        # Check if result contains expected content
+        if "AI Automation Assessment" in result or "Executive Summary" in result:
+            print("✅ Evaluation result contains expected content")
+        else:
+            print("⚠️  Evaluation result may be incomplete")
+        # Check evaluation status in database
+        updated_paper = db.get_paper(test_arxiv_id)
+        if updated_paper and updated_paper.get('is_evaluated'):
+            print("✅ Evaluation saved to database")
+            print(f"Evaluation score: {updated_paper.get('evaluation_score')}")
+            print(f"Evaluation tags: {updated_paper.get('evaluation_tags')}")
+        else:
+            print("❌ Evaluation not saved to database")
+        return True
+    except Exception as e:
+        print(f"❌ Error during evaluation: {str(e)}")
+        import traceback
+        traceback.print_exc()
+        return False
+async def test_database_operations():
+    """Test database operations"""
+    print("\n=== Testing Database Operations ===")
+    try:
+        # Test getting paper
+        paper = db.get_paper("2508.09889")
+        if paper:
+            print(f"✅ Database connection OK, found paper: {paper['title']}")
+        else:
+            print("⚠️  Test paper not found in database")
+        # Test getting paper statistics
+        stats = db.get_papers_count()
+        print(f"✅ Paper statistics: Total={stats['total']}, Evaluated={stats['evaluated']}, Unevaluated={stats['unevaluated']}")
+        return True
+    except Exception as e:
+        print(f"❌ Database operation error: {str(e)}")
+        return False
+async def main():
+    """Main test function"""
+    print("🚀 Starting Evaluation System Test")
+    # Parse command line arguments
+    args = parse_args()
+    # Initialize configuration
+    config.init_config(args.config, args)
+    # Initialize logger
+    logger.init_logger(config=config)
+    logger.info(f"| Logger initialized at: {config.log_path}")
+    logger.info(f"| Config:\n{config.pretty_text}")
+    # Initialize database
+    db.init_db(config=config)
+    logger.info(f"| Database initialized at: {config.db_path}")
+    print(f"✅ Database initialized: {config.db_path}")
+    # Test database operations
+    db_success = await test_database_operations()
+    # Test evaluation functionality
+    eval_success = await test_evaluation()
+    print("\n=== Test Summary ===")
+    print(f"Database operations: {'✅ Success' if db_success else '❌ Failed'}")
+    print(f"Evaluation functionality: {'✅ Success' if eval_success else '❌ Failed'}")
+    if db_success and eval_success:
+        print("🎉 All tests passed!")
+    else:
+        print("⚠️  Some tests failed, please check error messages")
+if __name__ == "__main__":
+    asyncio.run(main())

workdir/2508.05629.json DELETED Viewed

@@ -1,57 +0,0 @@
-{
-  "dimensions": "{\n  \"task_formalization\": {\n    \"score\": 3,\n    \"analysis\": \"The research task is highly formalized with clear mathematical objectives. The authors present a mathematical framework for analyzing Supervised Fine-Tuning (SFT) through the lens of Reinforcement Learning (RL). They provide precise mathematical formulations for both SFT and RL objectives, establish a formal equivalence between SFT gradients and policy gradients, and derive their proposed Dynamic Fine-Tuning (DFT) approach with well-defined equations. The paper includes rigorous mathematical proofs and derivations, particularly in Section 3 where they rewrite SFT gradients as policy gradients via importance sampling. While the mathematical formulation is comprehensive, there are some minor implementation details and hyperparameter considerations that leave room for case-by-case adjustments, preventing a perfect score.\"\n  },\n  \"data_resource_availability\": {\n    \"score\": 3,\n    \"analysis\": \"The research relies on publicly available datasets and models for experimentation. The authors use established benchmarks including NuminaMath, Math500, Minerva Math, Olympiad Bench, AIME 2024, and AMC 2023. They experiment with multiple open-source models including Qwen2.5-Math, LLaMA-3.1/3.2, and DeepSeekMath. The paper mentions that code will be made publicly available on GitHub. The implementation builds upon existing frameworks (verl, ms-swift) that are accessible. The experimental setup is well-documented, allowing for reproducibility. The primary limitation is that some of the most challenging mathematical benchmarks may have limited sample sizes, and the authors acknowledge not yet testing on a broader range of domains beyond mathematics or with larger models (13B+).\"\n  },\n  \"input_output_complexity\": {\n    \"score\": 2,\n    \"analysis\": \"The input-output complexity is moderate. The research deals with complex mathematical reasoning tasks that require processing detailed problem statements and generating multi-step solutions through chain-of-thought reasoning. These outputs can be lengthy and must follow specific mathematical reasoning patterns. However, the structure of the inputs and outputs is relatively well-defined within the domain of mathematical problem-solving. The paper focuses on a specific modification to the training process (adding one line of code) that applies across different model architectures and data types. The implementation requires understanding of token-level probabilities and loss functions, which adds some complexity but is manageable within standard language model frameworks. The method itself is designed to handle complex reasoning tasks, but its implementation is streamlined.\"\n  },\n  \"real_world_interaction\": {\n    \"score\": 4,\n    \"analysis\": \"The approach requires minimal real-world interaction. The entire process can be conducted offline with existing datasets and models. Both training and evaluation are fully computational processes that don't require human feedback loops or environmental interaction. The proposed DFT method specifically targets improvements in the standard SFT setting without requiring reward models, preference data, or verification signals that might necessitate additional human feedback. Even in the 'offline RL setting' experiment, the authors use automatically generated samples and verification rather than interactive feedback. The method is designed to work with static datasets and can be deployed in a fully offline manner without any ongoing human or environmental interaction.\"\n  },\n  \"existing_ai_coverage\": {\n    \"score\": 3,\n    \"coverage_pct_estimate\": 75,\n    \"analysis\": \"A significant portion of the research task is already covered by existing AI tools and models. The core components include mathematical analysis of training objectives, implementation of fine-tuning techniques, experimental evaluation, and visualization of results. Current frameworks like PyTorch, Hugging Face Transformers, and specialized fine-tuning libraries (mentioned verl and ms-swift) provide comprehensive support for implementing various fine-tuning approaches. The mathematical derivation requires human insight, but computational validation of these derivations can be assisted by AI. Existing LLMs can help with code implementation, experimental design, and literature review. The most novel aspect - the insight that SFT can be reframed as RL with an implicit reward structure - required human originality, but once identified, the implementation of the proposed solution (DFT) is straightforward. Most of the experimental pipeline, from data processing to evaluation metrics calculation, can be handled by existing AI tools.\",\n    \"tools_models\": [\"PyTorch\", \"Hugging Face Transformers\", \"verl framework\", \"ms-swift framework\", \"Mathematical computation libraries\", \"Data visualization tools\", \"LLMs for code generation\", \"Experimental analysis tools\"]\n  },\n  \"automation_barriers\": {\n    \"analysis\": \"Several barriers limit full automation of this research:\\n\\n1. Theoretical insight: The core insight of connecting SFT and RL through mathematical analysis required creative human reasoning. Identifying the problematic inverse probability weighting in SFT was a novel insight that current AI systems would struggle to generate independently.\\n\\n2. Research direction determination: Choosing to focus on improving SFT rather than developing yet another hybrid SFT-RL method required understanding of research gaps and strategic thinking about valuable contributions to the field.\\n\\n3. Interpretation of results: The analysis of token probability distributions and what they reveal about the learning dynamics of different methods requires domain expertise and causal reasoning that remains challenging for AI.\\n\\n4. Experimental design decisions: Selecting appropriate benchmarks, models, and evaluation methods to comprehensively test the hypothesis required research experience and domain knowledge.\\n\\n5. Limitations analysis: Identifying the boundaries of the approach and potential future work directions demands critical thinking about when and why the approach might fail.\\n\\n6. Interdisciplinary connection: Bridging supervised learning and reinforcement learning perspectives requires deep understanding of both fields and the ability to see non-obvious connections between different learning paradigms.\"\n  },\n  \"human_originality\": {\n    \"score\": 3,\n    \"analysis\": \"The research demonstrates clear novelty in its core contribution. The key insight - reinterpreting SFT gradients as policy gradients with an implicit, problematic reward structure - represents an original theoretical connection between two well-established paradigms (SFT and RL). The authors' proposed solution (DFT) is elegantly simple but non-obvious, requiring just one line of code change that produces significant empirical improvements. The mathematical derivation that leads to this insight shows creative thinking in how the authors connect supervised learning to reinforcement learning through importance sampling. The paper also presents a compelling analysis of why this approach works through token probability distribution analysis. While building on established foundations in both supervised learning and reinforcement learning, the specific connection identified and the proposed solution represent a meaningful advance rather than an incremental improvement. The approach inverts conventional wisdom by showing that multiplying the loss by the token probability (opposite of focal loss) improves generalization, which is a novel insight in the era of large language models.\"\n  },\n  \"safety_ethics\": {\n    \"score\": 3,\n    \"analysis\": \"The safety and ethical considerations for this research are generally manageable. The proposed method aims to improve the generalization capabilities of language models in mathematical reasoning tasks, which has minimal direct negative implications. The approach does not introduce new safety risks beyond those already present in language model fine-tuning. Failure cases would primarily result in incorrect mathematical reasoning rather than harmful outputs. The research does not involve sensitive data or privacy concerns, as it uses publicly available mathematical benchmarks. The method actually improves robustness and reduces overfitting, potentially making models more reliable. The authors acknowledge limitations of their work and the need for further evaluation across different domains. There is limited discussion of broader societal impacts, though the focus on mathematical reasoning makes immediate misuse scenarios less likely than for general-purpose language models. The method does not significantly increase computational requirements, avoiding major environmental concerns associated with more compute-intensive approaches.\"\n  },\n  \"societal_economic_impact\": {\n    \"analysis\": \"The societal and economic implications of this research are predominantly positive:\\n\\n1. Research efficiency: The proposed DFT method offers a more efficient alternative to complex RL approaches, potentially reducing computational resources needed for effective model fine-tuning. This could democratize access to high-quality fine-tuning techniques for researchers with limited computational budgets.\\n\\n2. Educational applications: Improved mathematical reasoning capabilities in language models could enhance educational tools, making AI tutoring more effective and accessible for mathematics education.\\n\\n3. Scientific advancement: Better generalization in mathematical reasoning could accelerate scientific research that relies on mathematical problem-solving, benefiting fields from physics to economics.\\n\\n4. Resource optimization: The method's improved sample efficiency could reduce the energy consumption and carbon footprint associated with training large language models, contributing to more sustainable AI development.\\n\\n5. Algorithmic insights: The theoretical connections established between SFT and RL could inform future developments in machine learning algorithms beyond the specific application presented.\\n\\n6. Economic effects: While the method could potentially reduce the need for some specialized ML engineers focused on complex RL implementations, it would likely create more value through broader adoption of effective fine-tuning techniques.\\n\\nPotential negative impacts are limited but could include further automation of mathematical reasoning tasks currently performed by humans, though such displacement effects would likely be gradual and limited to narrow domains initially.\"\n  },\n  \"technical_maturity_needed\": {\n    \"score\": 3,\n    \"analysis\": \"The proposed DFT method is relatively close to practical implementation, requiring only incremental advances rather than fundamental breakthroughs. The core implementation is extremely simple - just one line of code change to the standard SFT loss function. The mathematical foundation is well-established, drawing on existing concepts from both supervised learning and reinforcement learning. The authors have already demonstrated the approach working across multiple model architectures (Qwen, LLaMA, DeepSeekMath) and various sizes (1.5B to 8B parameters). The primary technical developments needed are: (1) testing on a broader range of tasks beyond mathematical reasoning, (2) scaling to larger models (13B+), (3) validating on multimodal tasks, and (4) further analysis of when and why the method might underperform. None of these require fundamental breakthroughs, but rather systematic experimentation and engineering refinements. The authors already promise to release their code, further reducing implementation barriers. The simplicity of the approach makes it immediately applicable for practitioners with standard ML expertise.\"\n  },\n  \"three_year_feasibility\": {\n    \"probability_pct\": 90,\n    \"analysis\": \"The probability of full automation of this research within three years is very high (90%). Several factors support this assessment:\\n\\n1. Implementation simplicity: The core DFT method requires just one line of code change to standard SFT, making technical implementation straightforward.\\n\\n2. Mathematical foundation: The theoretical analysis connecting SFT and RL is now established, providing a framework future AI systems can leverage.\\n\\n3. Experimental pipeline: The entire experimental workflow - from data preparation to model training to evaluation - uses standard components that are already well-supported by existing frameworks.\\n\\n4. Limited domain expertise: While mathematical reasoning was the focus of this paper, the method itself is domain-agnostic and could be applied to various tasks without specialized knowledge.\\n\\n5. Current AI capabilities: Today's most advanced AI systems can already perform many components of this research, including implementing training procedures, running experiments, and analyzing results.\\n\\n6. Rapid progress in AI for science: The pace of advancement in AI for scientific discovery is accelerating, with systems becoming increasingly capable of identifying patterns and relationships in scientific data.\\n\\nThe main limiting factors are the initial creative insight connecting SFT and RL in this specific way, and the identification of the inverse probability weighting issue. However, with this insight now published, future AI systems could automate similar investigations across other training paradigms. Within three years, it's highly likely that AI systems will be able to propose, implement, and evaluate novel training approaches comparable to DFT.\"\n  },\n  \"overall_automatability\": {\n    \"score\": 3,\n    \"analysis\": \"The overall automatability of this research is high, though not yet complete. The paper presents a clear case where most components could be automated with current or near-future AI systems. The experimental implementation, evaluation, and analysis portions follow standard practices in machine learning research that are increasingly being automated. The mathematical derivations, while requiring some sophistication, involve manipulations that advanced reasoning systems could potentially perform. Where human contribution remains most essential is in the initial framing of the research question - specifically, the insight to view SFT through the lens of RL and identify the problematic implicit reward structure. This creative connection between different learning paradigms represents the kind of cross-domain insight that remains challenging for current AI systems. Once this insight was established, the proposed solution (DFT) follows quite naturally and could likely be discovered through systematic exploration by an AI system. The paper's experimental design, implementation, and analysis of results could largely be automated with existing technologies. Given the rapid advances in AI for scientific discovery, particularly in mathematics and computer science, it's reasonable to expect that similar research contributions could be substantially automated within the next 2-3 years, though the most creative insights may still benefit from human intuition.\"\n  }\n},",
-  "executive_summary": "This paper introduces Dynamic Fine-Tuning (DFT), a simple yet effective improvement to Supervised Fine-Tuning (SFT) for large language models that significantly enhances generalization capabilities. The authors provide a mathematical analysis revealing that standard SFT implicitly encodes a problematic reward structure inversely proportional to the model's confidence, leading to unstable optimization and poor generalization. Their solution—multiplying the SFT loss by the token probability—requires just one line of code change yet substantially outperforms standard SFT across multiple mathematical reasoning benchmarks and model architectures. The approach bridges supervised and reinforcement learning paradigms, offering the generalization benefits of RL without its complexity. This work represents a notable advance in fine-tuning methodology with immediate practical applications, combining theoretical insight with empirical validation. The research is highly automatable in most aspects, though the key theoretical insight connecting SFT and RL required human creativity that remains challenging for current AI systems.",
-  "limitations_uncertainties": [
-    "The evaluation is limited to mathematical reasoning tasks and hasn't been validated on other domains like code generation or general question answering",
-    "Experiments are limited to models up to 7B parameters, leaving questions about scalability to larger models (13B+)",
-    "The approach hasn't been tested on multimodal tasks to confirm its generality across different modalities",
-    "Limited analysis of potential negative cases where DFT might underperform compared to standard SFT",
-    "The research focuses on a specific modification to the training objective without exploring potential interactions with other training hyperparameters",
-    "The theoretical analysis assumes certain properties of the token distributions that may not hold universally across all domains",
-    "Limited discussion of computational efficiency implications for very large models",
-    "The assessment of existing AI coverage may underestimate the creative insights needed to formulate the theoretical connection between SFT and RL"
-  ],
-  "metadata": {
-    "assessed_at": "2025-08-08",
-    "model": "claude-4-sonnet",
-    "version": "1.0",
-    "paper_path": "https://huggingface.co/papers/2508.05629"
-  },
-  "recommendations": {
-    "for_researchers": [
-      "Extend DFT evaluation to non-mathematical domains such as code generation, common sense reasoning, and general question-answering tasks",
-      "Test the approach with larger models (13B+ parameters) to verify scalability",
-      "Explore the application of DFT to multimodal tasks to confirm cross-modality effectiveness",
-      "Conduct ablation studies on the interaction between DFT and other training hyperparameters like learning rate schedules",
-      "Investigate potential hybrid approaches combining DFT with selective aspects of RL methods",
-      "Analyze the token distribution patterns across different domains to better understand when and why DFT provides advantages"
-    ],
-    "for_institutions": [
-      "Invest in research that bridges theoretical understanding between different learning paradigms, as such connections can yield simple yet powerful improvements",
-      "Support comparative studies of fine-tuning approaches that consider both performance and computational efficiency",
-      "Prioritize funding for research that improves the efficiency of existing methods rather than focusing exclusively on novel architectures",
-      "Develop standardized benchmarks for evaluating generalization capabilities across diverse tasks beyond established domains",
-      "Encourage interdisciplinary collaboration between ML researchers with expertise in supervised learning and reinforcement learning"
-    ],
-    "for_ai_development": [
-      "Implement DFT as a standard option in fine-tuning frameworks and libraries for large language models",
-      "Develop automated systems that can explore mathematical connections between different learning objectives",
-      "Create tools that visualize and analyze token probability distributions during training to better understand model learning dynamics",
-      "Focus on improving mathematical reasoning capabilities in foundation models to enable more sophisticated theoretical analysis",
-      "Invest in systems that can automatically identify potential efficiency improvements in existing training methodologies",
-      "Develop automated experimental pipelines that can systematically evaluate novel training approaches across diverse tasks and model architectures"
-    ]
-  },
-  "scorecard": {
-    "task_formalization": 3,
-    "data_resource_availability": 3,
-    "input_output_complexity": 2,
-    "real_world_interaction": 4,
-    "existing_ai_coverage": 3,
-    "human_originality": 3,
-    "safety_ethics": 3,
-    "technical_maturity_needed": 3,
-    "three_year_feasibility_pct": 90,
-    "overall_automatability": 3
-  }
-}

papers_cache.db → workdir/paper_agent/papers_cache.db RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a94646bbdaafb17db996dc86bb7e9052ee147a68010b82e6c3095af235cf5d60
-size 360448

 version https://git-lfs.github.com/spec/v1
+oid sha256:7c1fc0b499832f97bf8288fee40f5dcf5207b0fea8b5ae970958bcc7b2e109bf
+size 3219456