Preparing Text Data in Python
Text data is any data that contains words
- Text data can be gathered using an API, or by any other method that stores or collects text.
- Text data can be stored as a corpus, a .csv file, a .txt file or other methods.
- Text data can be converted to a dataframe format such as using CountVectorizer in Python.
- Text data can also be converted to transaction data.
- How you format your text data depends on model/method/visualization goals.
Preparing Record Data in Python
Record data are formatted as rows & columns
- Record data can be gathered using an API or can be downloaded, created via experiment or observational analysis, can be taken from a database, etc.
- Record data can be stored as .csv, as .xlx(s), as .txt, etc.
- Record data can read into a dataframe using pandas in Python.
- How you format your data depends on your model/method or visualization goals.