Big Data Text Analytics with Vectorization

Invited Talk | Day 2 | 14:20:00 | 45 Minute Duration | GG-E/F
  • David J Corliss
    Fiat Chrysler Automobiles Business Analytics Lead Data Scientist

Did you attend? Rate this session

Big Data Text Analytics with Vectorization

Invited Talk | Day 2 | 14:20:00 | 45 Minute Duration | GG-E/F

A practical description on text vectorization using the Python packages word2vec and doc2vec. This step-by-step description includes processing the data, creating the document vectors, and feature extraction. The process is demonstrated using the National Highway Traffic Safety Administration (NHTSA) publicly-available vehicle concerns database.