In this course, you will start to use machine learning methods to further your exploration of document term matrices (DTM). You will use a DTM to create train and test sets with the scikit-learn package in Python — an important first step in categorizing different documents. You will also examine different models, determining how to select the most appropriate model for your particular natural language processing task. Finally, after you have chosen a model, trained it, and tested it, you will work with several evaluation metrics to measure how well your model performed. The technical skills and evaluation processes you study in the course will provide valuable experience for the workplace and beyond.

You are required to have completed the following courses or have equivalent experience before taking this course:

  • Natural Language Processing Fundamentals
  • Transforming Text Into Numeric Vectors
 

How It Works

Course Length
3 weeks

Effort
6 to 8 hours of study per week

Format
100% online, instructor-led
  • Engineers
  • Software developers
  • Computer scientists new to NLP
  • Data scientists
  • Analysts
  • Researchers
  • Linguists
Get It Done 100% Online
Our programs are expressly designed to fit the lives of busy professionals like you.

Learn From cornell's Top Minds
Courses are personally developed by faculty experts to help you gain today's most in-demand skills.

Power Your career
Cornell's internationally recognized standard of excellence can set you apart.

Request Information Now by completing the form below.

Act today—courses are filling fast.