Content

Bringing order to the document jungle - Civic Innovation Platform

Bringing order to the document jungle

The idea of the association Open Knowledge Foundation Deutschland e.V. and the company binary butterfly GmbH would benefit administrations throughout Germany as they digitalise their filing systems. With the aid of text recognition and natural language processing, a structured process approach for the simple classification of documents will be introduced, thereby accelerating processes, and taking the strain off the administration employees.

Why are you a strong team?

We complement each other’s expertise in the various fields – artificial intelligence, technology and politics, dealing with official documents. Our collaboration runs smoothly and is a lot of fun!

Explain your idea in three sentences.

Authorities throughout Germany are digitising their filing systems – but millions of paper files have yet to be incorporated. We are developing a structured approach for this process in order to ensure easy readability, classification and organisation of the documents.

What makes your idea special?

We make it possible to process and classify very large German-language documents. The project is simply provided, including the infrastructure for document processing. We enable searchability while ensuring that text extraction works well enough for classification. This makes the project the first of its kind to use existing technology in this field with a special focus on the German language.

What are the next steps?

We have developed the concept, are testing our assumptions and will now start looking for test users from different areas of society to develop our idea together.