Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -7,28 +7,15 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
## 📰 Impresso – Media Monitoring of the Past
|
11 |
-
|
12 |
**Interdisciplinary ML‑powered platform for exploring historical media.**
|
13 |
|
14 |
-
- **📚 Corpus**: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders
|
15 |
-
- **🎯 Vision**: Enables a revolutionary semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio
|
16 |
-
- **👥 Two consecutive phases**:
|
17 |
-
- **Phase I (2017–2020)**: Funded by SNSF Sinergia (CR-SII5_173719); built scalable processing architecture + web app with semantic search for Swiss and Luxembourgish newspapers [oai_citation:2‡impresso-project.ch](https://impresso-project.ch/project/?utm_source=chatgpt.com).
|
18 |
-
- **PIs**: Frédéric Kaplan & Maud Ehrmann (EPFL); Martin Volk & Simon Clematide (UZH); Andreas Fickers & Marten Düring (Uni Luxembourg).
|
19 |
-
- **Citation**: _“Impresso. Media Monitoring of the Past.” Supported by the Swiss NSNF under grant CR‑SII5_173719, 2019._
|
20 |
-
- **Phase II (2023–2027)**: Funded by SNSF (213585) & Luxembourg FNR (17498891); extends corpus to radio, adds cross-national and multilingual layers, develops Datalab + novel interfaces, explores ‘influences’ in media [oai_citation:3‡impresso-project.ch](https://impresso-project.ch/project/?utm_source=chatgpt.com).
|
21 |
-
- **PIs**: Maud Ehrmann (EPFL), Simon Clematide (UZH), Raphaëlle Ruppen Coutaz (UNIL), Marten Düring (Uni Luxembourg).
|
22 |
-
- **Citation**: _“Impresso … II. Beyond Borders: Connecting Historical Newspapers and Radio” funded by SNSF 213585 & FNR 17498891._
|
23 |
- **💡 Outputs**:
|
24 |
-
-
|
25 |
-
- NLP resources: OCR quality assessment, NER,
|
26 |
-
- Historical insights under the theme of media
|
27 |
-
-
|
28 |
-
- Institutions: EPFL DHLAB; UZH Computational Linguistics; UNIL History Dept; C²DH Luxembourg.
|
29 |
-
- Archives/broadcasters: Swiss and Luxembourg national libraries, ONB, SBB, BL, BnF, BBC, INA, NISV, NZZ, Le Temps, etc. [oai_citation:6‡impresso-project.ch](https://impresso-project.ch/?utm_source=chatgpt.com).
|
30 |
-
- **🔗 GitHub**: [impresso-project](https://github.com/impresso) – codebases for Web App, pipelines, Datalab notebooks [oai_citation:7‡github.com](https://github.com/impresso?utm_source=chatgpt.com).
|
31 |
-
- **🧑🤝🧑 Hugging Face org** hosts multilingual NER, NEL, OCR‑quality models, and Spaces for entity linking/light [oai_citation:8‡huggingface.co](https://huggingface.co/impresso-project?utm_source=chatgpt.com).
|
32 |
|
33 |
---
|
34 |
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
|
|
|
|
10 |
**Interdisciplinary ML‑powered platform for exploring historical media.**
|
11 |
|
12 |
+
- **📚 Corpus**: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders .
|
13 |
+
- **🎯 Vision**: Enables a revolutionary semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
- **💡 Outputs**:
|
15 |
+
- Web App & Datalab platforms for exploratory analysis
|
16 |
+
- NLP resources: Language identificatino, OCR quality assessment, NER, Named Entity Linking, topic models
|
17 |
+
- Historical insights under the theme of media influences.
|
18 |
+
- **🧑🤝🧑 Hugging Face org** hosts multilingual NER, NEL, OCR‑quality assessment models, and Spaces for named entity processing
|
|
|
|
|
|
|
|
|
19 |
|
20 |
---
|
21 |
|