Learn

+DS provides training modules and learning experiences grounded in generalizable data science content, while partnering with individual units or groups to develop additional specialized content.

Online Learning

Coursera Online Modules

The online +DS modules introduce the basics of data science, across multiple important application domains. These online modules are used as prerequisites for the in-person learning experiences listed below. Together, the online content supports the in-person "flipped" learning experiences. 

Module 1: Simple Introduction to Machine Learning

The focus of this module is to introduce the concepts of machine learning with as little mathematics as possible. We will introduce basic concepts in machine learning, including logistic regression, a simple but widely employed machine learning (ML) method. Also covered is multilayered perceptron (MLP), a fundamental neural network. The concept of deep learning is discussed, and also related to simpler models.

Module 2: Basics of Model Learning

In this module we will be discussing the mathematical basis of learning deep networks. We’ll first work through how we define the issue of learning deep networks as a minimization problem of a mathematical function. After defining our mathematical goal, we will introduce validation methods to estimate real-world performance of the learned deep networks. We will then discuss how gradient descent, a classical technique in optimization, can be used to achieve this mathematical goal. Finally, we will discuss both why and how stochastic gradient descent is used in practice to learn deep networks.

Module 3: Image Analysis with Convolutional Neural Networks (CNNs)

This week will cover model training, as well as transfer learning and fine-tuning. In addition to learning the fundamentals of a CNN and how it is applied, careful discussion is provided on the intuition of the CNN, with the goal of providing a conceptual understanding.

Module 4: Recurrent Neural Networks for Natural Language Processing

This week will cover the application of neural networks to natural language processing (NLP), from simple neural models to the more complex. The fundamental concept of word embeddings is discussed, as well as how such methods are employed within model learning and usage for several NLP applications. A wide range of neural NLP models are also discussed, including recurrent neural networks, and specifically long short-term memory (LSTM) models.

Module 5: The Transformer Network for Natural Language Processing

This week we'll cover an Introduction to the Transformer Network, a deep machine learning model designed to be more flexible and robust than Recurrent Neural Network (RNN). We'll start by reviewing several machine learning building blocks of a Transformer Network: the Inner products of word vectors, attention mechanisms, and sequence-to-sequence encoders and decoders. Then, we'll put all of these components together to explore the complete Transformer Network.

Module 6: Introduction to Reinforcement Learning

This week will cover Reinforcement Learning, a fundamental concept in machine learning that is concerned with taking suitable actions to maximize rewards in a particular situation. After learning the initial steps of Reinforcement Learning, we'll move to Q Learning, as well as Deep Q Learning. We'll discuss the difference between the concepts of Exploration and Exploitation and why they are important.

Please note that Coursera for Duke is accessible to only Duke students, faculty, and staff. If you are not a member of the Duke community, you can access the public version of this Coursera course: https://www.coursera.org/duke.

Recorded Content from Machine Learning Summer School

In addition to the aforementioned Coursera content, +DS offers recordings from one of the Duke’s Machine Learning Summer School (MLSS), held in June of 2018. If you are a Duke student, staff or faculty member, you can review these classroom recordings on Panopto, with accompanying slides and links to github code demos.

In-Person and Virtual Learning Experiences

In addition to the online content, +DS offers in-person and virtual opportunities to dive deeper into the information introduced in the online modules. These learning experiences will be developed to target diverse units at Duke: from those that desire a broad understanding of what is possible with data science, and those who wish to use data-science tools (software) without a need for deep understanding of underlying methodology, to those who desire a rigorous technical proficiency of the details and methodology of data science.

See past learning experiences

Upcoming Learning Experiences
Wednesday, September 23 - 4:30pm to 6:00pm
Location: Virtual, Classroom
Instructor: Rachel Draelos

The goal of computer vision is for computers to be able to understand visual content (e.g. images, videos, 3D, stereo), usually for the purpose of making predictions (classification, detection, captioning, generation, etc.).

Monday, September 28 - 2:00pm to 3:00pm
Location: Virtual, Classroom
Instructor: Matthew Hirschey

Data visualization is part art and part science. A data visualization has to accurately convey the data, but also should be aesthetically pleasing. Great visual presentations of data will enhance the message and lead to deeper understanding of the underlying data.

Tuesday, September 29 - 12:00pm to 1:00pm
Location: Virtual, Classroom
Instructor: Jedrek Wosik

COVID-19 has led to the rapid adoption of telehealth strategies in order to maintain continuity of care. As compared to in-person visits, important changes in patient characteristics were seen in telephone and video visits as well as clinician ordering patterns.

Tuesday, October 6 - 4:30pm to 6:00pm
Location: Virtual, Classroom
Instructor: Rachel Draelos

Medical image analysis with machine learning holds immense promise for accelerating the radiology workflow and benefiting patient care. Computed tomography (CT) is a medical imaging technique that produces a high-resolution volumetric image of the internal organs.

Wednesday, October 7 - 4:30pm to 6:00pm
Location: Virtual, Classroom
Instructor: Lawrence Carin

Natural language processing (NLP) is a field focused on developing automated methods for analyzing text, and also for computer-driven text generation (synthesis, for example in translation and text summarization).

Wednesday, October 14 - 4:30pm to 6:00pm
Location: Virtual, Classroom
Instructor: Roarke Horstmeyer

Deep learning algorithms offer a powerful means to automatically analyze the content of biomedical images. However, many biological samples of interest are difficult to resolve with a standard optical microscope.

Wednesday, November 4 - 4:30pm to 6:00pm
Location: Virtual, Classroom
Instructor: Jon Reifschneider

Once considered to be niche technologies limited to the domain of academic research and the few large global tech companies, today machine learning and AI are finding innovative application in almost every industry by companies of every size.

Recorded Content from Lunch and Learns

+DS offers recordings from a series of lunch and learn sessions held in fall 2019 about how artificial intelligence (AI) is transforming healthcare. Both clinical experts and technical experts led each session, with content split roughly 50/50 between the clinical setup and technical approach, and a focus to share both the medical context and the applications of data science for health.

Digital Pathology: Identifying Thyroid Malignancy with Deep Learning

  • Danielle Range, MD, Assistant Professor of Pathology
  • Yoni Cohen, MD, Medical Instructor, Department of Head and Neck Surgery & Communication Sciences
  • Ricardo Henao, PhD, Assistant Professor of Biostatistics and Bioinformatics; Principal Data Scientist, Duke Forge
  • View the video of this session (requires Duke login): https://duke.mediasite.com/Mediasite/Play/a97f106d5dc846468e65fe274730f4bf1d

A Window to the Brain: Analysis of Retinal Images with Deep Neural Networks

  • Sharon Fekrat, MD, Professor of Ophthalmology; Associate Professor, Department of Surgery
  • Felipe Medeiros, MD, PhD, Joseph A.C. Wadsworth Professor of Ophthalmology
  • Dilraj Singh Grewal, MBBS, Associate Professor of Ophthalmology
  • Lawrence Carin, PhD, James L. Meriam Professor of Electrical and Computer Engineering; Vice President for Research, Duke University
  • View the video of this session (requires Duke login): https://duke.mediasite.com/Mediasite/Play/044abb60a25d4eea9705aa51343af36b1d

Early Autism Screening with Machine Learning

  • Geraldine Dawson, PhD, William Cleland Professor of Psychiatry & Behavioral Sciences; Director, Duke Center for Autism and Brain Development; Director, Duke Institute for Brain Sciences
  • Guillermo Sapiro, PhD, James B. Duke Professor of Electrical and Computer Engineering; Professor of Mathematics
  • View the video of this session (requires Duke login): https://duke.mediasite.com/Mediasite/Play/588abd8d0e8148aba7b8ea27dd82e9721d

Recommending MyChart Responses with Natural Language Processing