Gareth Clews

Senior Data Scientist

Gareth is a mathematician by trade and a programmer by nature. His interests lie mostly in the fields of algebra and calculus and their applications to machine learning and statistics. He enjoys problem solving and has become quite tool agnostic in his approach. Gareth’s current agenda is to spread the love of Haskell; tick more things off of his programming bucket list and to try to be a bit more Bayes.

Posts by Gareth Clews

Optimus – A natural language processing pipeline for turning free-text lists into hierarchical datasets

Many datasets contain variables that consist of short free-text descriptions of items or products. This technical report describes how we developed and implemented a natural language processing (NLP) pipeline that produces a well structured and hierarchical dataset.

Read more on Optimus – A natural language processing pipeline for turning free-text lists into hierarchical datasets