Course list

In this course, you will explore the foundational vocabulary of natural language processing (NLP) — and start writing code right away — by finding patterns in strings using both simple functions and regular expressions. This will prepare you for an important component of NLP work, which is preprocessing text to reduce the size of the vocabulary being analyzed: The fewer total words that need to be analyzed, the more computationally efficient your work will be. You will then tag sentences so that you will be able to relate keywords to one another. You will also gain extensive hands-on experience writing Python, first by practicing on individual sentences then working up to a larger body of text. Overall, your understanding of and skill in NLP with Python will support you as you continue through your career and meet your goals in this area and beyond.
  • Dec 31, 2025
  • Mar 11, 2026
  • May 20, 2026
  • Jul 29, 2026
  • Oct 7, 2026
  • Dec 16, 2026

If you want to compare two large bodies of text with each other, you can do that by making comparisons with the text itself: Turn the text into tokens then compare the overlap in tokens. Sometimes, however, you don't just want to know that two texts are different (a binary comparison), but you want to know how different, which is a fuzzy comparison. In this course, you will transform text into numeric vectors, which allows us to perform arithmetic operations on textual information to calculate similarity. This is a classical natural language processing (NLP) technique, and it begins by creating different kinds of vectors. You will create both sparse and dense vectors, and you will compare vectors of different sizes to see how information is captured. Finally, you will measure similarity among document vectors, which is the real power of turning text into vectors. The ability to determine how similar two or more documents are is a common use of NLP, and you will practice this technique through hands-on exercises and projects.

You are required to have completed the following course or have equivalent experience before taking this course:

  • Natural Language Processing Fundamentals
  • Nov 12, 2025
  • Jan 21, 2026
  • Apr 1, 2026
  • Jun 10, 2026
  • Aug 19, 2026
  • Oct 28, 2026

In this course, you will start to use machine learning methods to further your exploration of document term matrices (DTM). You will use a DTM to create train and test sets with the scikit-learn package in Python — an important first step in categorizing different documents. You will also examine different models, determining how to select the most appropriate model for your particular natural language processing task. Finally, after you have chosen a model, trained it, and tested it, you will work with several evaluation metrics to measure how well your model performed. The technical skills and evaluation processes you study in the course will provide valuable experience for the workplace and beyond.

You are required to have completed the following courses or have equivalent experience before taking this course:

  • Natural Language Processing Fundamentals
  • Transforming Text Into Numeric Vectors
  • Dec 3, 2025
  • Feb 11, 2026
  • Apr 22, 2026
  • Jul 1, 2026
  • Sep 9, 2026
  • Nov 18, 2026

Can a computer tell the difference between an article on “jaguar” the animal and “Jaguar” the car? It can if we teach it how. In this course, you will extract key phrases or words from a document, which is a key step in the process of text summarization. Part of what makes natural language processing (NLP) so powerful is that it processes text at scale, when a human would simply take too long to perform the same task given the sheer number of text documents to be read and processed. A classic use of NLP, then, is to summarize long documents, whether they are articles or books, in order to create a more easily readable abstract, or summary.

Extracting keywords or key phrases is a first step in this direction, which is where you will start in this course. Once you train a computer what the most important words in a document might be, you have to train it to identify the most important sentences. This is the second step in extracting information from a document to help create an abstract, and you will perform this step on larger text documents as well. Finally, you will calculate and interpret similarity metrics to compute the degree of similarity among documents that are possibly related to one another. The techniques you use throughout this course will prove useful in specific situations at work and beyond as you support your team or achieve your personal goals.

You are required to have completed the following courses or have equivalent experience before taking this course:

  • Natural Language Processing Fundamentals
  • Transforming Text Into Numeric Vectors
  • Classifying Documents With Supervised Machine Learning
  • Dec 24, 2025
  • Mar 4, 2026
  • May 13, 2026
  • Jul 22, 2026
  • Sep 30, 2026
  • Dec 9, 2026

In this course, you will focus on measuring distance — the dissimilarity of various documents. The goal is to discover how alike or unlike various groups of text documents are to one another. At scale, this is a problem you might encounter if you need to group thousands of products together purely by using their product description or if you would like to recommend a movie to someone based on whether they liked a different movie. You will work with several different data sets and use both hierarchical and k-means clustering to create clusters, and you will practice with several distance measures to analyze document similarity. Finally, you will create visualizations that help to convey similarity in powerful ways so stakeholders can easily understand the key takeaways of any clustering or distance measure that you create.

You are required to have completed the following courses or have equivalent experience before taking this course:

  • Natural Language Processing Fundamentals
  • Transforming Text Into Numeric Vectors
  • Classifying Documents With Supervised Machine Learning
  • Topic Modeling With Unsupervised Machine Learning
  • Nov 5, 2025
  • Jan 14, 2026
  • Mar 25, 2026
  • Jun 3, 2026
  • Aug 12, 2026
  • Oct 21, 2026
  • Dec 30, 2026

We have all been misunderstood when sending a text message or email, as tone often does not translate well in written communication. Similarly, computers can have a hard time discerning the meaning of words if they are being used sarcastically, such as when we say “Great weather” when it's raining. If you are automatically processing reviews of your product, a negative review will have many of the same key words as a positive one, so you will need to be able to train a model to distinguish between a good review and a bad review. This is where semantic and sentiment analysis come in.

In this course, you will examine many kinds of semantic relationships that words can have (such as hypernyms, hyponyms, or meronyms), which go a long way toward extracting the meaning of documents at scale. You will also implement named entity recognition to identify proper nouns within a document and use several techniques to determine the sentiment of text: Is the tone positive or negative? These invaluable skills can easily turn the tide in a difficult project for your team at work or on the path toward achieving your personal goals.

You are required to have completed the following courses or have equivalent experience before taking this course:

  • Natural Language Processing Fundamentals
  • Transforming Text Into Numeric Vectors
  • Classifying Documents With Supervised Machine Learning
  • Topic Modeling With Unsupervised Machine Learning
  • Clustering Documents With Unsupervised Machine Learning
  • Nov 26, 2025
  • Feb 4, 2026
  • Apr 15, 2026
  • Jun 24, 2026
  • Sep 2, 2026
  • Nov 11, 2026

How It Works

Completing a program from eCornell really has allowed me to think outside the box at work. It gave me the confidence I needed to take a seat at that table and say I am ready.
‐ Kasey M.
Kasey M.

Request Information Now by completing the form below.

Act today—courses are filling fast.
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.