9283 Tags

release notes

1 Project

8 articles

As journalists dealing with data and document sets, we find that the most interesting information is usually hidden in large, unstructured, and incomplete sets of documents. Especially information in public contracts: what the government is buying, how much money is being spent, and who are the suppliers. To answer these questions, four media organizations — La Nacion, CLIP, Ojo Público, and MuckRock — joined forces under the JournalismAI Collab and experimented with different machine learning tools and techniques in order to build a platform that helps investigative reporters understand and process unstructured documents to get useful insights.

Learn more

130 Articles

View all...

An upside down stock photo of documents in Russian and manilla envelopes.

Release Notes: Making it easier to sort, filter and reprocess document OCR

by Sanjin, Open Source Fellow

April 23, 2024

Since our last release notes, we released a new Add-On OCR Tagger that allows you to tag your document(s) based on the OCR engine used and we added better logging for when scheduled Add-Ons like Klaxon or Scraper get disabled. This helps more easily diagnose and correct outages that impact Add-Ons.

Screenshot of DocumentCloud showing available documents sorted by key value pairs

Release Notes: Improved sorting, new revision control documentation and more

by Sanjin, Open Source Fellow

April 02, 2024

In recent weeks, we’ve rolled out a few updates on DocumentCloud. Users can now sort documents on DocumentCloud by their key/value pairs. We’ve documented API access for document revision control. Finally, a fellow DocumentCloud user contributed a write-up on how to run your own version of Klaxon.

A clean looking map of the continental United States.

Release Notes: How to make self-hosted maps that work everywhere and cost next to nothing

by Chris Amico

February 13, 2024

Can we build a better way to build — and maintain — maps? DocumentCloud developer Chris Amico shares his recent work exploring new advances and opportunities when it comes to self-hosting maps.

A screenshot of a recent map showing radioactive fallout zones, and their relative impact, in the United States.

Release Notes: Better site monitoring, revision control enhancements and more

by Sanjin, Open Source Fellow

January 30, 2024

MuckRock’s tech team has been hard at work enhancing our transparency platforms, all of which are open source. Read on for what’s changed.

A screenshot of DocumentCloud's new revision control tool, that lets you download older versions of a document.

Release Notes: Document revision control, improved free transcription tools, and other improvements

by Sanjin, Open Source Fellow

January 16, 2024

DocumentCloud premium users can now utilize revision control to store document changes and easily retrieve previous versions. The platform’s sidebar has undergone a redesign, while additional improvements include dropdown menu support in Add-Ons and the ability to select which Whisper model you would like to use with the Transcribe Audio Add-On.