9127 Tags

documentcloud

1 Project

View all...

DockIns: Machine Learning on Deadline for Journalists

As journalists dealing with data and document sets, we find that the most interesting information is usually hidden in large, unstructured, and incomplete sets of documents. Especially information in public contracts: what the government is buying, how much money is being spent, and who are the suppliers. To answer these questions, four media organizations — La Nacion, CLIP, Ojo Público, and MuckRock — joined forces under the JournalismAI Collab and experimented with different machine learning tools and techniques in order to build a platform that helps investigative reporters understand and process unstructured documents to get useful insights.

Learn more

62 Articles

View all...

A red klaxon on an industrial cement background.

Klaxon Cloud: Free, simple alerts when a webpage updates

Want to know when your favorite government agency posts new information? Wondering if a corporate press release might see some post-publication revisions? We’ve brought the power of The Marshall Project’s Klaxon site monitoring tool into DocumentCloud, and it’s now easier than ever to track changes and get alerts from websites you care about.

Read More

A screenshot of Document Rotator, a new Add-On that allows you to detect the orientation of pages in a document and auto-rotate the pages

Release Notes: New DocumentCloud tools, better bulk processing and more

In the last two weeks, MuckRock’s tech team has been hard at work enhancing the DocumentCloud platform. Notable updates include an improved method for changing the access level of documents in DocumentCloud, a range of new and upgraded Add-Ons and revamped functional tests for DocumentCloud’s frontend.

Read More

Release Notes: MuckRock's new donation page, easier pinning of Add-Ons and stability improvements

Release Notes: MuckRock’s new donation page, easier pinning of Add-Ons and stability improvements

Over the past few weeks, we’ve rolled out a new donation page, just in time for NewsMatch. We’ve also worked to revamp the DocumentCloud homepage with more information, along with adding the ability to pin your favorite DocumentCloud features more easily.

Read More

Our search for the best OCR tool in 2023, and what we found

Our search for the best OCR tool in 2023, and what we found

A side-by-side comparison of five OCR tools using multiple kinds of documents from DocumentCloud.

Read More

Hands are reviewing colorful files inside a filing cabinet.

Here’s why MuckRock and POGO had to archive FOIAonline

Last month, the Environmental Protection Agency (EPA) dismantled a vital tool for transparency when it decommissioned FOIAonline.gov, an online resource that allowed the public to make and track Freedom of Information Act (FOIA) requests to over 20 federal agencies, and to view responsive documents. The EPA, which oversaw FOIAonline on behalf of participating agencies, claims to have fulfilled over 1.5 million requests and attracted 34,000 active registered users over the decade-plus the portal was operating. POGO and MuckRock have partnered to host a publicly available archive of nearly 34,000 documents captured before FOIAonline was shuttered.

Read More