9270 Tags

knight foundation

1 Project

View all...

DockIns: Machine Learning on Deadline for Journalists

As journalists dealing with data and document sets, we find that the most interesting information is usually hidden in large, unstructured, and incomplete sets of documents. Especially information in public contracts: what the government is buying, how much money is being spent, and who are the suppliers. To answer these questions, four media organizations — La Nacion, CLIP, Ojo Público, and MuckRock — joined forces under the JournalismAI Collab and experimented with different machine learning tools and techniques in order to build a platform that helps investigative reporters understand and process unstructured documents to get useful insights.

Learn more

10 Articles

View all...

Cómo correr Sidekick

Cómo correr Sidekick

Alguna vez ¿has tenido una pila de documentos y has querido comenzar a concentrarte rápidamente en una parte determinada de material? ¿Te gustaría contar con ayuda para trabajar solamente en los contratos, o quizás, los informes policiales que detallan un determinado tipo de encuentro, o bien, poder dividir rápidamente las cartas de respaldo de aquellas negativas dirigidas a un político sobre un tema clave?

Read More

Testing two Named Entity Recognition models on Spanish documents

Testing two Named Entity Recognition models on Spanish documents

As journalists dealing with data and document sets, we find that the most interesting information is usually hidden in large, unstructured, and incomplete sets of documents. Especially information in public contracts: what the government is buying, how much money is being spent, and who are the suppliers. To answer these questions, four media organizations joined forces under the JournalismAI Collab and experimented with different machine learning tools and techniques in order to build a platform that helps investigative reporters understand and process unstructured documents to get useful insights. This platform ended up being “Dockins”.

Read More

Categorize DocumentCloud collections in real-time with SideKick

Categorize DocumentCloud collections in real-time with SideKick

Ever get a pile of documents and want to start quickly honing in on a certain segment of material? Wish you had a little help pulling out just contracts, or maybe police reports that detail a certain type of encounter? With MuckRock’s DocumentCloud platform, that’s a challenge we know all too well — and we have a new solution to help.

Read More

Heading to #ONA19? Come celebrate DocumentCloud's 10th birthday!

Heading to #ONA19? Come celebrate DocumentCloud’s 10th birthday!

The MuckRock crew is heading to New Orleans for the Online News Association’s conference, and we’d love to see you! Come join us on Friday, September 13 for two events: DocumentCloud’s tenth birthday party at 4:30 pm, followed by a Hacks/Hackers cocktail hour at 6 pm.

Read More

Release Notes: A brand new home for crowdsourced projects

Release Notes: A brand new home for crowdsourced projects

We’ve launched a new homepage for MuckRock Assignments, our crowdsourcing tool that let’s you contribute your time, expertise, and curiosity to a journalistic investigation.

Read More