Home » Introduction to Data Science in Python – London

Introduction to Data Science in Python – London

Date/Time
Date(s) - 21/05/2020
9:45 am - 5:00 pm

Categories No Categories


*Please note this session will now take place online*

 

 

This is a training and capacity building event organised by the Consumer Data Research Centre (CDRC) in conjunction with the National Centre for Research Methods (NCRM), ESRC funded research projects.

This online course will introduce you to the nascent field of Data Science using the industry standard, the Python programming language. We will cover key steps involved in solving practical problems with data, from manipulation and processing, to visualisation and modelling.

These topics will be explored from a “hands-on” perspective using a modern Python stack (e.g. pandas, seaborn, scikit-learn), and examples using real-world spatial data. We will start with an overview of the main ways to access and manipulate data in Python. Then we will move on to visualisation, learning to create figures that allow you to better understand your data. The course will then move into unsupervised learning, with a K-Means example; and then on to supervised learning, covering linear regression and random forests, which will allow us to illustrate the challenge of overfitting and thus motivate cross-validation.  We will finish the course with some time for questions and to work on your own data.

This session will be held on Zoom and run over the course of two mornings – Thursday 21st May at 9:30am – 12:30pm & Friday 22nd May at 9:30am – 12:30pm.

All participants will be emailed with further details and guidance surrounding this online session.