Springe zum Hauptinhalt
Professur Numerische Mathematik
Professur Numerische Mathematik
Professur Numerische Mathematik 

Introduction to Data Science (4V, 2Ü) Prof. Ernst, WS 2018/19

Content

Tentative list of course topics:
  • Introduction: What is Data Science
  • Learning Theory
  • Regression
  • Neural Networks
  • Classification
  • Clustering and Tree-Based Methods
  • Support Vectors
  • Unsupervised Learning

Notices

Listing of this course in the electronic Vorlesungsverzeichnis (course directory):

Keine Lehrveranstaltung gefunden.

Lecture

Literature

  • James, Witten, Hastie & Tibshirani. An Introduction to Statistical Learning – with Applications in R. Springer 2013. Available online at this page.
  • Here's a continually updated annotated reading list for the course (16.01.2019).

Slides

Exercises

Installation of Programming Environment under Linux (64 bit)

If you want to do the homework on your personal computers, you may clone the programming environment used in the labs. Get miniconda from this web page and follow the steps in the installation dialogue. Next, download the specification file spec-file.txt used in the labs and create a conda environment (under Linux):

conda create --name DS2018 --file spec-file.txt

Installation of Programming Environment under Windows and MacOS

Download miniconda for your distribution by following this link and follow the installation instructions. Next, download the yml-file containing the packages used in the labs and create a conda environment in a miniconda/Anaconda shell:

conda env create -f DS2018.yml

If your plots are not displayed in the browser, this might be due to a missing package. After sourcing of the correct environment, the following might help in some cases python -m ipykernel install --user Please refer to Conda (Installation under Windows, Linux and MacOS) and Conda (Managing environments) for further information.

Material

In order to start the jupyter notebooks you have to open a terminal and source our conda environment DS2018 via

source /LOCAL/Software/DataScience2018/setup_env

Next, change the directory to your exercise folder and download the jupyter notebook (right click and "Save link as") into this folder. Finally, start the notebook via the command (make sure you see the (DS2018) in front of your username):

jupyter notebook Problem_Sheet_XX.ipynb