Data Science with Python

(in German: Data Science with Python )

Module-ID: FIN-INF-120513

Link:	LSF
Responsibility:	Dr. Christian Beyer
Lecturer:	Dr. Christian Beyer
Classes:	Data Science with Python
Applicability in curriculum:	- M.Sc. INF: Informatik - M.Sc. INF: Schlüssel- und Methodenkompetenzen - M.Sc. INGINF: Informatik - M.Sc. INGINF: Schlüssel- und Methodenkompetenzen - M.Sc. WIF: Informatik - M.Sc. WIF: Schlüssel- und Methodenkompetenzen - M.Sc. DKE: Applied Data Science - M.Sc. VC: Computer Science - M.Sc. VC: Schlüssel- und Methodenkompetenzen - M.Sc. DE: Methoden der Informatik - M.Sc. DE: Interdisziplinäres Team-Projekt

Abbreviation

DSWP

Credit Points

Semester

Winter

Term

Duration

1 Semester

Language

english

Level

Master

Intended learning outcomes:
The course is about learning from data to perform predictions and obtain useful insights. In the seminar, we will use the programming language Python.
Necessary skills to manage and analyze data will be taught and practiced on real-world applications. Programming knowledge from other courses is helpful but not mandatory. However, students are expected to have a profound knowledge of fundamental data-analysis techniques, such as classification, regression and clustering. After successful completion of this course, the student will be able to perform the following tasks in Python:

Import and preprocess raw data (files, databases, web APIs)
Transform data for modelling
Perform exploratory data analysis with summary statistics and visualization
Understand, build and evaluate predictive classification and regression models, including tree-based models, ensembles and boosted models
Communicate and disseminate results and findings through reproducible documents, presentations, websites and interactive web applications

Content:
Part Fundamentals & Visualization: Basics, scripts, workflows, vectors & functions in Python Explorative data visualization Data transformation Part Data Management & Exploratory Data Analysis: Data cleaning & scraping Generating hypotheses and an intuition about the data with exploratory data analysis Data import Data management Relational data Strings, categorical data, dates & time Iteration: imperative & functional programming Part Modeling: Linear regression Classification Evaluation Model selection & regularization (LASSO, Ridge) Feature selection & model interpretation Decision trees Ensembles: random forests Boosting: gradient boosted trees Unsupervised learning, e.g. k-means, hierarchical clustering, self-organizing maps, principal component analysis Topic modeling with simple graphical models Statistical testing Part Communication: Communication and dissemination of results through visualization and interpretable summaries with documents, notebooks, presentations & websites

Workload:
Attendance time = 28 h: - 2 SWS weekly seminar; Independent work outside the actual Seminar time = 152 h: - 76 h preparation and follow-up of the seminar topics - 76 h solving the tasks, incl. work in the laboratory 180h = 28h attendance time + 152h independent work

Pre-examination requirements:	Type of examination:	Teaching method / lecture hours per week (SWS):
	Project with presentation and project report	Seminar (2 SWS)

Prerequisites according to examination regulations:	Recommended prerequisites:
keine	Area 1: Data Mining, Machine Learning, Artificial Intelligence Area 2: Databases Area 3: Programming Languages and Software Engineering Area 4: Stochastics, Applied Statistics

Media:	Literature:
	Will provided during the seminar

Comments:

Über den Studiengang

Orientierung am Regelstudienplan

Hinweise zu Modulen im Bereich: Grundlagen Ingenieurwesen

Hinweise zu Modulen für den Bereich: Human Factors

Hinweise zu Modulen für den Bereich: Fachliche Spezialisierung

Hinweise zu Modulen für den Bereich: Methoden des Digital Engineering

Weitere Hinweise

Nutzung von generativer KI für Abschlussarbeiten und Prüfungs(vor)leistungen

Fachschaftsrat der Fakultät für Informatik (Studierendenvertretung)

Academic Club

Computer Graphics - an Introduction

Introduction to Computer Science for Engineers

Introduction to Simulation

Machine Learning

Musik Information Retrieval

Robust Geometric Computing

Eudaimonic Interaction Design

Human-Centred Approaches and Technologies

Human-Centred Natural Language Processing

Digitalhandwerk

Eudaimonic Interaction Design

Human-Centred Approaches and Technologies

Human-Centred Natural Language Processing

Recent Topics in Business Informatics

Scientific Writing

Wissenschaftliches Individualprojekt

Clean Code Development

Data Management for Engineering Applications

Estimation for Autonomous Mobile Robots

Seminar on Advanced Topics of Predictive Maintenance'

Sensor Networks Seminar

Software-Development for Industrial Robotics

Wissenschaftliches Individualprojekt

Advanced Interactive Information Organization

Advanced Topics in Deep Learning (Master)

Advanced Topics in Networking

Clean Code Development

Data Science with Python

Estimation for Autonomous Mobile Robots

In-Memory und Cloud-Technologien

Information Retrieval

Learning Generative Models

MLOps for Small Language Model Applications

Praktikum IT Sicherheit

Recent Topics in Business Informatics

Scrum-in-Practice

Selected Chapters of IT Security: Orchestration of Mechanisms and Tools

Seminar on Advanced Topics of Predictive Maintenance'

Sensor Networks Seminar

Software-Development for Industrial Robotics

Software-Produktlinien

Three-Dimensional & Advanced Interaction

Transaction Processing

VLBA – Cloud DevOps Technologies

Visual Analytics in Health Care

Visualization

Wissenschaftliches Individualprojekt

AMS Lab Projects

Data Science with Python

Interdisziplinäres Teamprojekt

Interdisciplinary Team Project

Scientific Project on Databases for Multi-Dimensional Data, Genomics, and modern Hardware

Wissenschaftliches Teamprojekt

Scientific Team Project

Advanced Topics in Networking

Applied Discrete Modelling

Clean Code Development

Data Mining II - Advanced Topics in Data Mining

Introduction to Data Warehousing

Distributed Data Management

Estimation for Autonomous Mobile Robots

Recent Topics in Business Informatics

Scientific Project on Databases for Multi-Dimensional Data, Genomics, and modern Hardware

Scientific Writing

Scrum-in-Practice

Selected Chapters of IT Security: Orchestration of Mechanisms and Tools

Sensor Networks Seminar

Software-Development for Industrial Robotics

Swarm Intelligence

Transaction Processing