Learn practical skills, build real-world projects, and advance your career

COVID-19 FILE FILTERING

The metadata file contain large data up to 44K we have to filter down the essntial data that relates to outbreak of corona virus.
The process will be doing is to search the abstract and the title with the keywords related to Corona Virus

# importing libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from numpy import array

# Reading the metadata file 
unFilter_data= pd.read_csv('metadata.csv')
unFilter_data.head(2)
# printing how many rows are there in csv file
print(f'Count of rows in file   :{len(unFilter_data)}')

srData= unFilter_data[unFilter_data['authors'].str.contains('Overall, James C.',na=False)]

srData.head()
Count of rows in file :44220