Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -7,4 +7,29 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
## 📰 Impresso – Media Monitoring of the Past
|
11 |
+
|
12 |
+
**Interdisciplinary ML‑powered platform for exploring historical media.**
|
13 |
+
|
14 |
+
- **📚 Corpus**: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders [oai_citation:0‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
|
15 |
+
- **🎯 Vision**: Enables a revolutionary semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio [oai_citation:1‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
|
16 |
+
- **👥 Two consecutive phases**:
|
17 |
+
- **Phase I (2017–2020)**: Funded by SNSF Sinergia (CR-SII5_173719); built scalable processing architecture + web app with semantic search for Swiss and Luxembourgish newspapers [oai_citation:2‡impresso-project.ch](https://impresso-project.ch/project/?utm_source=chatgpt.com).
|
18 |
+
- **PIs**: Frédéric Kaplan & Maud Ehrmann (EPFL); Martin Volk & Simon Clematide (UZH); Andreas Fickers & Marten Düring (Uni Luxembourg).
|
19 |
+
- **Citation**: _“Impresso. Media Monitoring of the Past.” Supported by the Swiss NSNF under grant CR‑SII5_173719, 2019._
|
20 |
+
- **Phase II (2023–2027)**: Funded by SNSF (213585) & Luxembourg FNR (17498891); extends corpus to radio, adds cross-national and multilingual layers, develops Datalab + novel interfaces, explores ‘influences’ in media [oai_citation:3‡impresso-project.ch](https://impresso-project.ch/project/?utm_source=chatgpt.com).
|
21 |
+
- **PIs**: Maud Ehrmann (EPFL), Simon Clematide (UZH), Raphaëlle Ruppen Coutaz (UNIL), Marten Düring (Uni Luxembourg).
|
22 |
+
- **Citation**: _“Impresso … II. Beyond Borders: Connecting Historical Newspapers and Radio” funded by SNSF 213585 & FNR 17498891._
|
23 |
+
- **💡 Outputs**:
|
24 |
+
- Powerful Web App & Datalab platforms for exploratory analysis [oai_citation:4‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
|
25 |
+
- NLP resources: OCR quality assessment, NER, entity linking, topic models [oai_citation:5‡huggingface.co](https://huggingface.co/impresso-project?utm_source=chatgpt.com).
|
26 |
+
- Historical insights under the theme of media “influences”.
|
27 |
+
- **🏛️ Partners**:
|
28 |
+
- Institutions: EPFL DHLAB; UZH Computational Linguistics; UNIL History Dept; C²DH Luxembourg.
|
29 |
+
- Archives/broadcasters: Swiss and Luxembourg national libraries, ONB, SBB, BL, BnF, BBC, INA, NISV, NZZ, Le Temps, etc. [oai_citation:6‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
|
30 |
+
- **🔗 GitHub**: [impresso-project](https://github.com/impresso) – codebases for Web App, pipelines, Datalab notebooks [oai_citation:7‡github.com](https://github.com/impresso?utm_source=chatgpt.com).
|
31 |
+
- **🧑🤝🧑 Hugging Face org** hosts multilingual NER, NEL, OCR‑quality models, and Spaces for entity linking/light [oai_citation:8‡huggingface.co](https://huggingface.co/impresso-project?utm_source=chatgpt.com).
|
32 |
+
|
33 |
+
---
|
34 |
+
|
35 |
+
> _Impresso is an open research ecosystem at the intersection of NLP, design, and computational history — enriching historical media for new forms of exploratory inquiry across Europe._
|