NB: this post was last updated 16 June 2024. In general, I add new sites but don't remove old sites that are no longer live. This post is now supplemented with another on National approaches to crowdsourcing / citizen science. I've also shared a 2015 list of 'participatory digital heritage sites' that includes many crowdsourcing sites. Contact me via my main website contact page to suggest a site.
It's all too easy to forget that there are international crowdsourcing projects in languages other than English so I thought I'd collect some projects related to cultural heritage, history and science here (following my definition of crowdsourcing in cultural heritage as 'asking the public to help with tasks that contribute to a shared, significant goal or research interest related to cultural heritage collections or knowledge'). This list is drawn from my PhD research, but this is a fast-moving field and I was focusing on early modern England, so inevitably this list will be missing loads of examples. Please suggest links to help people discover new projects! Also, I'm often taking my best guess at the correct translation for terms, so please correct me if I've misunderstood.
If you're interested in crowdsourcing in cultural heritage, my edited volume has chapters with lessons learnt from a range of projects.
- AfroCrowd is 'an outreach initiative and Wikimedia usergroup which seeks to increase awareness of the Wikimedia and free knowledge, culture, and software movements among potential editors of African descent' with links to Haitian, Igbo, Twi, Yoruba, Garifuna, French, Spanish Wikipedia and more
- Moravian Lives offers text transcription in English, German and Swedish. Thanks @KatherineFaull for sharing!
- DigiTalkoot has been and gone (launched February 2011, closed November 2012) but was a great example of tasks that helped correct scanned text for the Historical Newspaper Library of The National Library of Finland.
- The National Library of France was involved in a pilot project called 'Correct' to correct errors in scanned documents. Further information: Josse, Isabelle. La bnF engagée dans un projet de R&D pour la conception de la plateforme Correct (Correction et enrichissement collaboratifs de textes). Bulletin des bibliothèques de France. [en ligne], n° 5, 2013, http://bbf.enssib.fr/consulter/bbf-2013-05-0037-008. ISSN 1292-8399
- The French version of WikiSource has lots of books to be transcribed.
- There's a German-language transcription project for the Digitale Edition Nachlasses Franz Brümmer and related Refine!Editor – it looks like it was designed for student participation and that interested people can register to transcribe via the contact page. Via Simone Waidmann, “Erschließung Historischer Bestände Mittels Crowdsourcing: Eine Analyse Ausgewählter Aktueller Projekte,” Perspektive Bibliothek 3, no. 1 (2014): 33–58, http://journals.ub.uni-heidelberg.de/index.php/bibliothek/article/view/14020.
- ARTigo is a German project, with English, French and German-language interfaces. Tag images of artworks through six different games! They also have an active German-language blog.
- Red een Portret ('Save a portrait') from the Amsterdam City Archives – help identify photographs or donate money to support the project
- Ajapaik is an Estonian project asking for help identifying historical images.
- Transcriptorium has several non-English datasets you can review to help train their handwriting recognition software
- Ancient Lives is the site for you if you want to learn the ancient Greek alphabet while transcribing papyri.
- Arthur Schnitzler digital 'is using the Transcribo software to produce a digital transcription and annotation of both typescript and manuscript material'.
- The Bracero History Archive is collecting oral histories in both English and Spanish.
- Cymru1900Wales and Cynefin are both working on Welsh maps and have Welsh and English-language interfaces
- Danish Demographic Database includes transcriptions from volunteers.
- Europeana 1914-1918 and Europeana 1989 are collecting records in many European languages. Wir Waren So Frei is also collecting records about the fall of the Berlin Wall.
- You can index records in many languages on Ancestry's World Archives Project.
- You can also help Improve Google Translate (not really a heritage project but it helps other projects). Similarly, you can help translate the crowdsourcing platform Pybossa into Italian or learn a language while translating text with Duolingo.
- You can 'use the site's comment features to share any supplements (such as citations to published works, transcription of notes not yet addressed, authorial attribution for a particular text, etc.) or remarks on the significance of the manuscript codices and contents' to help Islamic Manuscripts at Michigan.
- Itinera Nova has volunteer transcribers
- You can help correct and annotate records from 'more than 100 European archives' in the Monasterium.Net collaborative archive.
- Help transcribe Dutch natural history collections with Naturalis.
- Transcribe Swedish census records from 1760 with Stockholms Stad.
- Help index Dutch records with Vele Handen.
- The Norwegian The Digital Inn is for 'sources/documents digitised by institutions, associations or persons outside the organisation of the National Archives of Norway' – a fantastic way of collecting the work that community historians are doing
- The Danish Politiets registerblade – help transcribe records from the city register.
- The Croatian Museum of Broken Relationships
- Dry stone walls crowdsourced
- The British Library's LibCrowds Convert-a-Card card catalogue transcription project has Pinyin and Indonesian cards for transcription
- The National Library of Israel has a crowdsourcing project in Hebrew (via this Pybossa post)
- Sefaria, 'a living library of Jewish texts', 'building a free living library of Jewish texts and their interconnections, in Hebrew and in translation'
- Footprints, Jewish books through time and place
- La Grande Collecte is collecting French records about the First World War
- KB Kranten – Editor, help correct digitized newspapers OCR. A collaboration between Dutch national library & Meertens Institute
- Edvard Munchs tekster
- Demogen, from the State Archives of Belgium
- The Estonian Digitalgud – digital 'working bees' to collect information about historical images
- Index records about Estonian soldiers in the two World Wars via Eestlased Esimeses maailmasõjas
- An L-Crowd Project: TranscribeJP@Japanese Association for DigitalHumanities and Microtasks
- Estoria de Espanna and Estoria de Espanna Project blog, 'aiming to transcribe these 13th-century manuscripts, tagging them (especially for person names and toponyms) so as to reconstruct afterwards biographies and itineraries'.
- Les herbonautes, a French herbarium transcription project led by the Paris Natural History Museum
- Loki is a Finnish project on maritime, coastal history
- Swedish Species Information Centre 'Species Observations' (hat tip Sanja Halling)
- sandbyborg.se, http://www.platsr.se, http://www.crowdculture.se (hat tip Max Valentin)
- Donald Sturgeon @donaldsturgeon said: '@chinesetextproj has an active Wiki section in which Chinese texts are transcribed/OCR post-corrected & annotated: http://ctext.org/wiki.pl?if=en'. Find out more about transcribing, proof-reading, translations, discussion and other forms of contribution on their 'Ways to Help' page.
- Danish Family Search projects include indexing church, school and census records, recording street names and categorising professions.
- Danish National Archives crowdsourcing https://cs.sa.dk/?locale=en and overview page (suggested by Alex Mendes)
- Crowd-correction platform Kokos was 'built to improve the OCR quality of the digitized yearbooks of the Swiss Alpine Club (SAC) from the 19th century', working with French and German
- j. Hocker @julianhocker said, 'take a look at interlinking.bbf.dipf.de, it is a project about a encyclopedia for children that was printed in the 19th century'
- @BenWBrum pointed me to a Chinese character transcription project on the Smithsonian's platform then @TranscribeSI pointed out some additional Chinese and Japanese-language projects
- VinKo ('Varieties in Contact') is an online questionnaire developed at the Universities of Trento and Verona to gather information about the minority languages and dialects spoken in the area between Innsbruck and Verona
- @BenWBrum's From the Page platform has French and Spanish language pages from the Louisiana Historical Center at the New Orleans Jazz Museum for transcription
- @Lisa_Chupin shared Noms de Vendée, aiming to deepen engagement as well as enrich and correct archival records.
- Judaica DH at Penn @judaicadh shared, 'Scribes of the Cairo Geniza classifies/transcribes Hebrew & Arabic fragments' https://www.scribesofthecairogeniza.org/
- http://openbolshoi.ru/ (Russian)
- Sweden's Digitala forskarsalen ('digital research hall') includes indexing and transcription projects
- The Dutch hetvolk.org set of tools / projects (thanks Enno Meijers)
- 'Maak de Surinaamse slavenregisters openbaar', crowdfunding/crowdsourced transcription project c 2017 (original instructions page) using hetvolk
- China – the Shengxuanhuai Manuscript Transcription Initiative, aka the Transcribe Sheng project
- The French RECITAL (Contribuez librement à une expérience de transcription participative des REgistres de la Comédie-ITALienne de Paris au XVIIIe siècle). 'Ces documents uniques donnent à réviser l'état des connaissances sur l'économie du spectacle et toute l'histoire culturelle du XVIIIe. Votre aide nous est précieuse' https://recital.univ-nantes.fr/
- Kino in der DDR at the University of Erfurt collects information, experience and documents on the cinema history of East Germany. Interview with the project leaders (in German).
- Also possibly other academic German citizen humanities projects
- Nikola Dyordyevich shared the Serbian 'Улице Панчева' / 'Streets of Panchevo' project with old maps, images, etc. Serbian site: https://улицепанчева.срб. English site: https://ulicepanceva.in.rs/en/
- “All Tolstoy in one click” was a Russian language crowdsourcing project that asked volunteers to correct OCR layouts and transcription. Technical details; main site https://readingtolstoy.ru.
- The Czech/German (Bavarian) PhotoStruk, crowdsourcing information related to
archival photographs of now-destroyed sites on the Czech – Bavarian border. More inL ‘Geoinformatics and Crowdsourcing in Cultural Heritage: A Tool for Managing Historical Archives’. Agris On-Line Papers in Economics and Informatics https://doi.org/10.7160/aol.2018.100207.
English-language projects tend to be easier to find, but for completeness:
UK – irecord.org.uk/ (thanks Rita Singer @_bydbach_)
USA – archives.gov/citizen-archivist and weather.gov/cle/CWOP (thanks @BuffaloResearch), crowd.loc.gov, transcription.si.edu/
- 'Your project goes here' – what have I missed?