8687 Tags

documentcloud

1 Project

View all...

DockIns: Machine Learning on Deadline for Journalists

★ Featured
As journalists dealing with data and document sets, we find that the most interesting information is usually hidden in large, unstructured, and incomplete sets of documents. Especially information in public contracts: what the government is buying, how much money is being spent, and who are the suppliers. To answer these questions, four media organizations — La Nacion, CLIP, Ojo Público, and MuckRock — joined forces under the JournalismAI Collab and experimented with different machine learning tools and techniques in order to build a platform that helps investigative reporters understand and process unstructured documents to get useful insights.

Learn more

47 Articles

View all...

Apply for funding to help analyze, publish and preserve the world's most important documents

Apply for funding to help analyze, publish and preserve the world’s most important documents

Access to reliable information powers civic health and strong democracy, whether in shaping governmental response to global events or helping communities invest in a better tomorrow by understanding the impacts of budget and policy choices.

We want to fund your projects to analyze, publish and preserve the documents needed for an informed world

Read More

Release Notes: Keep an eye on your favorite agencies with DocumentCloud’s new automated scraping and alerts

Release Notes: Keep an eye on your favorite agencies with DocumentCloud’s new automated scraping and alerts

Introducing DocumentCloud’s latest Add-On: A simplified scraper that, with just a few clicks, monitors the website of your choosing, automatically downloads and indexes any newly uploaded documents and then alerts you if there is something of interest. Read on for more on how it works plus other recent platform updates.

Read More

FFDW and MuckRock collaborate to bring DocumentCloud to the decentralized web

FFDW and MuckRock collaborate to bring DocumentCloud to the decentralized web

Filecoin Foundation for the Decentralized Web (FFDW) has announced a major award to MuckRock, furthering our vision of being the public’s engine for understanding our world. FFDW and MuckRock are teaming up to integrate decentralized storage technology for DocumentCloud, our open source platform that currently hosts over 8 million verified documents. This collaboration will also expand our leading accountability and transparency tools to support a much wider range of use cases.

Read More

DocumentCloud Add-Ons: Automate data extraction, alerts, ingestion, and much more with our simple, open source plugin system

DocumentCloud Add-Ons: Automate data extraction, alerts, ingestion, and much more with our simple, open source plugin system

Today, we’re launching Add-Ons, which makes it easier to launch, maintain, and share new capabilities right within DocumentCloud, ranging from exporting notes to applying machine learning techniques.

Read More

Release Notes: Better looking article images, more shareable request templates, and DocumentCloud improvements

Release Notes: Better looking article images, more shareable request templates, and DocumentCloud improvements

The tech team has been busy working on improvements to both our FOIA filing platform as well as DocumentCloud, our document analysis and hosting tool. For MuckRock, we gave a nice graphical bump to our article images — doubling the resolution — and fixed a bug that prevented sharing cloned request links with logged-out users. With DocumentCloud, the incorrectly labeled “rotate” button now does what it says on the tin. Plus: More localized versions coming soon!

Read More