One Step Further in Visualizing Consumer Review Data
An ensemble between part-of-speech tagging and related term search algorithms.
Data science, writing, and reading notes
Hi there. I am Xiao (Marshal) Ma, a UC Berkeley alum and data scientist. This site collects my writing, programming projects, and reading notes.
The name ALaughingHorse comes from my Chinese name, Ma Xiao, which can be read as horse laugh or horse roar. It is a strange little translation, but it fits my personality.
Latest writing
An ensemble between part-of-speech tagging and related term search algorithms.
Reduce the dimension of training data by over 99.7% with less than a 3% drop in performance.
Getting interpretable keywords with related terms to understand consumer reviews.
Projects
A Python package that provides a convenient pipeline for text data preprocessing.
A Python package that helps you understand the context of unigram results from bag-of-words analysis.