ML-Powered Invoice Management for a Medical Research Organization

Streamlined Invoice Management and Improved Duplicate Invoice Detection


The client struggled with identifying duplicate invoices due to variations in details like descriptions, amounts, and dates. Their existing solution produced false positives by detecting similarities across different suppliers’ invoices, leading to unnecessary reviews. They needed a more accurate solution to streamline the process and improve duplicate invoice detection.


IT Convergence used four machine learning models to process invoices from September 2018 and found that they outperformed the client’s current system, demonstrating higher accuracy and fewer false positives. This improvement meant that end users will need to review fewer invoices. ITC’s machine learning model also detected more complex scenarios, including instances where invoices had practically identical values in their features, which the current solution missed. We presented the results via Tableau, showcasing the invoices our model identified that were not in the client’s solution, along with new scenarios that hadn’t been considered before.


  • Processed data from September 2018 using four machine learning models, and each model outperformed the client’s existing solution
  • ITC solution demonstrated higher accuracy and fewer false positives, reducing the number of invoices for end-user review
  • Detected more complex scenarios, including instances the client’s solution overlooked
  • Identified invoices not in the client’s solution and presented the results in Tableau
Company Overview

The client is a prominent American non-profit medical research organization dedicated to advancing biomedical research and education. They are one of the largest private philanthropies in the world, supporting the work of scientists and researchers across various institutions, fostering groundbreaking discoveries in the life sciences.



Applications & Technologies