simon-clmtd commited on
Commit
d4056bf
·
verified ·
1 Parent(s): c7139b2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -19
README.md CHANGED
@@ -7,28 +7,15 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- ## 📰 Impresso – Media Monitoring of the Past
11
-
12
  **Interdisciplinary ML‑powered platform for exploring historical media.**
13
 
14
- - **📚 Corpus**: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders [oai_citation:0‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
15
- - **🎯 Vision**: Enables a revolutionary semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio [oai_citation:1‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
16
- - **👥 Two consecutive phases**:
17
- - **Phase I (2017–2020)**: Funded by SNSF Sinergia (CR-SII5_173719); built scalable processing architecture + web app with semantic search for Swiss and Luxembourgish newspapers [oai_citation:2‡impresso-project.ch](https://impresso-project.ch/project/?utm_source=chatgpt.com).
18
- - **PIs**: Frédéric Kaplan & Maud Ehrmann (EPFL); Martin Volk & Simon Clematide (UZH); Andreas Fickers & Marten Düring (Uni Luxembourg).
19
- - **Citation**: _“Impresso. Media Monitoring of the Past.” Supported by the Swiss NSNF under grant CR‑SII5_173719, 2019._
20
- - **Phase II (2023–2027)**: Funded by SNSF (213585) & Luxembourg FNR (17498891); extends corpus to radio, adds cross-national and multilingual layers, develops Datalab + novel interfaces, explores ‘influences’ in media [oai_citation:3‡impresso-project.ch](https://impresso-project.ch/project/?utm_source=chatgpt.com).
21
- - **PIs**: Maud Ehrmann (EPFL), Simon Clematide (UZH), Raphaëlle Ruppen Coutaz (UNIL), Marten Düring (Uni Luxembourg).
22
- - **Citation**: _“Impresso … II. Beyond Borders: Connecting Historical Newspapers and Radio” funded by SNSF 213585 & FNR 17498891._
23
  - **💡 Outputs**:
24
- - Powerful Web App & Datalab platforms for exploratory analysis [oai_citation:4‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
25
- - NLP resources: OCR quality assessment, NER, entity linking, topic models [oai_citation:5‡huggingface.co](https://huggingface.co/impresso-project?utm_source=chatgpt.com).
26
- - Historical insights under the theme of media influences”.
27
- - **🏛️ Partners**:
28
- - Institutions: EPFL DHLAB; UZH Computational Linguistics; UNIL History Dept; C²DH Luxembourg.
29
- - Archives/broadcasters: Swiss and Luxembourg national libraries, ONB, SBB, BL, BnF, BBC, INA, NISV, NZZ, Le Temps, etc. [oai_citation:6‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
30
- - **🔗 GitHub**: [impresso-project](https://github.com/impresso) – codebases for Web App, pipelines, Datalab notebooks [oai_citation:7‡github.com](https://github.com/impresso?utm_source=chatgpt.com).
31
- - **🧑‍🤝‍🧑 Hugging Face org** hosts multilingual NER, NEL, OCR‑quality models, and Spaces for entity linking/light [oai_citation:8‡huggingface.co](https://huggingface.co/impresso-project?utm_source=chatgpt.com).
32
 
33
  ---
34
 
 
7
  pinned: false
8
  ---
9
 
 
 
10
  **Interdisciplinary ML‑powered platform for exploring historical media.**
11
 
12
+ - **📚 Corpus**: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders .
13
+ - **🎯 Vision**: Enables a revolutionary semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio.
 
 
 
 
 
 
 
14
  - **💡 Outputs**:
15
+ - Web App & Datalab platforms for exploratory analysis
16
+ - NLP resources: Language identificatino, OCR quality assessment, NER, Named Entity Linking, topic models
17
+ - Historical insights under the theme of media influences.
18
+ - **🧑‍🤝‍🧑 Hugging Face org** hosts multilingual NER, NEL, OCR‑quality assessment models, and Spaces for named entity processing
 
 
 
 
19
 
20
  ---
21