PKU

MATH4995: Capstone Project for Data Science
Fall 2020


Course Information

Synopsis

This course is about projects with real world data for students in data science.

Prerequisite: (statistical) machine learning.

Instructors:

Yuan Yao

Time and Place:

TuTh 4:30-5:50pm, Zoom online, HKUST

Reference (参考教材)

An Introduction to Statistical Learning, with applications in R (ISLR). By James, Witten, Hastie, and Tibshirani

ISLR-python, By Jordi Warmenhoven.

ISLR-Python: Labs and Applied, by Matt Caudill.

Manning: Deep Learning with Python, by Francois Chollet [GitHub source in Python 3.6 and Keras 2.0.8]

MIT: Deep Learning, by Ian Goodfellow, Yoshua Bengio, and Aaron Courville

Kaggle Contest: Predict Survival on the Titanic .

Kaggle Contest: Home Credit Default Risk Prediction .

Kaggle Contest: Nexperia Image Classification (Second Stage, on-going) .

Kaggle Contest: Nexperia Image Classification (First Stage, finished) .

Tutorials: preparation for beginners

Python-Numpy Tutorials by Justin Johnson

scikit-learn Tutorials: An Introduction of Machine Learning in Python

Jupyter Notebook Tutorials

PyTorch Tutorials

Deep Learning: Do-it-yourself with PyTorch, A course at ENS

Tensorflow Tutorials

MXNet Tutorials

Theano Tutorials

statlearning-notebooks, by Sujit Pal, Python implementations of the R labs for the StatLearning: Statistical Learning online course from Stanford taught by Profs Trevor Hastie and Rob Tibshirani.

Homework and Projects:

TBA (To Be Announced)

Teaching Assistant:


Email: Mr. Weizhi ZHU wzhuai (add "AT connect DOT ust DOT hk" afterwards)

Schedule

Date Topic Instructor Scriber
08/09/2020, Tue Lecture 01: History and Overview of Artificial Intelligence. [ slides ] Y.Y.
10/09/2020, Thu Lecture 02: Supervised Learning: Linear Regression with Python [ slides ] Y.Y.
15/09/2020, Tue Lecture 03: Linear Classification with Python [ slides ] Y.Y.
17/09/2020, Thu Lecture 04: Project 1 [ project1.pdf ] Y.Y.
22/09/2020, Tue Lecture 05: Model Assessment and Selection: Subset, Forward, and Backward Selection [ YY's slides ]
Y.Y.
24/09/2020, Thu Lecture 06: Model Selection: Ridge, Lasso, and Principal Component Regression [ YY's slides ]
Y.Y.
29/09/2020, Tue Lecture 07: Decision Trees [ YY's slides ]
    [ Venue ]
  • Rm 6602, Lift 31-32
Y.Y.
06/10/2020, Tue Lecture 08: Bagging, Random Forests and Boosting [ YY's slides ]
    [ Venue ]
  • Rm 6602, Lift 31-32
Y.Y.
08/10/2020, Thu Lecture 09: Support Vector Machines [ YY's slides ]
Y.Y.
13/10/2020, Tue Today's class is cancelled due to Typoon level 8. [ Univeristy Notice ]
Y.Y.
15/10/2020, Thu Lecture 10: Seminar
    [ Venue ]
  • Zoom
    [ Speakers ]
  • Liao Yi-han. Home Credit Default Risk. [ slides (pptx) ]
  • HARJONO, Natasha Valerie; TSUI, Ying Tsz; WAHYU, Zoya Estella. Titanic: Machine Learning from Disaster. [ slides (pdf) ]
Y.Y.
20/10/2020, Tue Lecture 11: Seminar
    [ Venue ]
  • Zoom
    [ Speakers ]
  • Ching Kwok Yan.
  • LIN Chi Wing.
Y.Y.
22/10/2020, Thu Lecture 12: Seminar.
    [ Title ]
  • Applications of Quantitative Methods in Financial Market, Junxin Zhou (Jason), AQUMON, [ slides ]
  • A brief intro to applications of Machine Learning in Finance, Kaijun Hou (Jeff), AQUMON, [ slides ]
    [ Abstract ]
  • Jason will introduce some quantitative models in asset management applications based on his rich working experience.
  • Jeff will introduce the business of AQUMON and cover some machine learning applications in risk management in other financial institutions.
    [ Bio ]
  • Junxin Zhou (Jason): Senior Quantitative Researcher with passion and experience in quantitative research and investment strategy development. Working in the space of systematic alpha and asset allocation with proven track record, and keen on applying quantitative methods, data science techniques, and economic fundamentals to investment decision making.
    MSc degree in Financial Mathematics from HKUST and MSc degree in Information and Computational Science from Sun Yat-sen University. Chartered Financial Analyst (CFA), Financial Risk Manager (FRM) and SFC representative license holder of type 1, 4 & 9.
    Leading a quantitative research team with 5-6 quant analysts to develop multi-asset strategies and managing portfolios with total AUM of over 90 million Chinese Yuan.

  • Kaijun Hou (Jeff): Graduated from HKUST in 2020 majored in Computer Science, Risk Management and Business Intelligence. He is the software developer at Magnum Research Limited. He is responsible for developing financial data center and electronic trading system in the company.
Y.Y.
27/10/2020, Tue Lecture 13: Seminar
    [ Speakers ]
  • Ngai Nok Yiu, Cheung Hang Yee. Titanic Survival Problem. [ slides (pptx) ]
  • WONG, Wing Kin.
Y.Y.
29/10/2020, Thu Lecture 14: Seminar and Final Project [ pdf ]
    [ Speakers ]
  • LEE, Cheuk Yin.
Y.Y.
03/11/2020, Tue Lecture 15: An Introduction to Convolutional Neural Networks [ slides ]
Y.Y.
05/11/2020, Thu Lecture 16: Examples of Convolutional Neural Networks.
Y.Y.
10/11/2020, Tue Lecture 17: Topics in CNN: Visualization, Transfer Learning [ YY's slides ]
Y.Y.
12/11/2020, Thu Lecture 18: Topics in CNN: Visualization, Transfer Learning, Neural Style, and Adversarial Examples [ YY's slides ]
Y.Y.
17/11/2020, Tue Lecture 19: An Introduction to Recurrent Neural Networks (RNN) and Long Short Term Memory (LSTM) [ YY's slides ]
Y.Y.
19/11/2020, Thu Lecture 20: Attention, Transformer and BERT [ YY's slides ]
A.W.
Y.Y.
24/11/2020, Tue Lecture 21: Seminar
    [ Speakers ]
  • Lee Cheuk Yin. Discovery after reducing number of features on Titanic Problem.
  • TSUI, Ying Tsz; HARJONO, Natasha Valerie. Titanic [ slides (pptx)]
Y.Y.
26/11/2020, Thu Lecture 22: Seminar
    [ Speakers ]
  • Ngai Nok Yiu and Cheung Hang Yee. Home Credit Default Risk.
  • Liao Yi-Han. Semi-conductor Image Classification 2 mini.
Y.Y.
01/12/2020, Tue Lecture 23: Seminar
    [ Speakers ]
  • Ching Kwok Yan. Semi-conductor Image Classification.
  • Lin Chi Wing. Semiconductor image classification via Histogram-based SVM.
Y.Y.
03/12/2020, Thu Lecture 24: Seminar
    [ Speakers ]
  • WAHYU, Zoya Estella
  • Wong, Wing Kin
Y.Y.

by YAO, Yuan.