Workshop: Analysing 200 Years of Political Debate
Merkel might not be familiar with 17th century British Parliamentary rules, but you will be after this workshop. You'll learn to analyse 200 years of British political debates with web scraping, data science and natural language processing.
Merkel might not be familiar with 17th century British Parliamentary rules, but you will be after this workshop. Dr Maryam Ahmed (BBC News) will share the unique challenges of analysing the Hansard Archive, an online record of every Parliamentary speech from 1803 to the present day.
You'll learn how to ethically scrape Hansard with the headless browser Selenium, and transform messy HTML into structured data with Pandas and BeautifulSoup. Maryam will explain how to find themes in political speeches with NLTK and Scikit-Learn methods including TF-IDF and Latent Dirichlet allocation. Spoiler: her talk will contain at least one clip of John Bercow shouting 'order'.