PDFMiner

Sep
01
Data Science with Judgement Data – My PDPC Decisions Journey

Data Science with Judgement Data – My PDPC Decisions Journey

An interesting experiment to apply what I learnt in Data Science to the area of law.
8 min read
Jun
24
Mining PDFs to obtain better text from Decisions

Mining PDFs to obtain better text from Decisions

After several attempts at wrangling with PDFs, I managed to extract more text information from complicated documents using PDFMiner.
9 min read
Dec
12
Get rid of the muff: pre-processing PDPC Decisions

Get rid of the muff: pre-processing PDPC Decisions

This post is part of a series on my Data Science journey with PDPC Decisions. Check it out for more
5 min read