Saturday 11:45–12:30 in Small Room

Making contract documents fully searchable at KPN

Gianluigi Bardelloni

Audience level:
Intermediate

Description

A way to help KPN account managers in quickly and effectively looking for selected business contracts, by enabling text search on every single page.

Abstract

Signed contracts for the KPN Business Market are spread across several content management system (CMS) and generally not enriched with relevant metadata. In order to help our account managers in quickly and effectively looking for selected contracts we have implemented a solution in which documents from the most important CMS where extracted, converted to text, enriched with metadata and injected in the Elasticsearch engine. Account managers can now search via Kibana relevant documents in a fraction of the time. In this talk we'll describe the challenges we faced and how we (almost) solved them via several python packages (grab, pandas, fuzzywuzzy, etc) and ML algorythms (LSH, CNN deep-learning).

Subscribe to Receive PyData Updates

Subscribe

Tickets

Get Now