Learn practical skills, build real-world projects, and advance your career

Let's import the required libraries. We will be extensively dealing with 'nltk'.

import os
import sys
import pandas as pd
import numpy as np
import nltk
from nltk.corpus import stopwords
from nltk.stem import PorterStemmer
from nltk.tokenize import word_tokenize

Import dataset. Data is stored in 'DATA.csv' file

df = pd.read_csv('DATA.csv', names= ['v1', 'v2'])
df.head()