Text Mining with Python

online (as of 26 April 2021)
KU Leuven

This course is a hands-on course covering the use of text mining tools for the purpose of data analysis. It covers basic text handling, natural language engineering and statistical modelling on top of textual data.

The following items are covered :

  •    Encodings, cleaning of text data, regular expressions
  •     Language identification
  •     String distances
  •     Graphical displays of text data
  •     Natural language processing: parts-of-speech tagging, tokenization, lemmatisation, keyword extraction, named-entity-recognition
  •     Sentiment analysis
  •     Statistical topic detection modelling using Gensim
  •     Automatic classification using predictive modelling based on text data
  •     Word embeddings, document similarities & Text alignment

Target audience

The course is for Python users in industry/academics who are interested in practical natural language processing and statistical learning on text data. People with a data science background with less knowledge of Python and which are interested in machine learning & text mining in general will find this course also very useful.


  • Date:
    • web lectures available after 26 April 2021
    • Online Q&A on 6 May 2021
  • Language: English


  • Price: min. €100 (check the website for discount rates)
  • Target audience:
    PhD-students, non-profit/social sector, private sector
  • Prerequisites
    • experience with Python of several weeks
    • limited practical knowledge of regression modelling
    • knowledge of statistical modelling

Ready to get started?

All practical information can be found on the website of KU Leuven.