Learn practical skills, build real-world projects, and advance your career
Created 4 years ago
import yake
import pke
from nltk.corpus import stopwords
References
-
Blog Post that introduced KPE https://nishkalavallabhi.github.io/KPE1/
-
https://github.com/kpnDataScienceLab/keyword-extraction - Survey and Evaluation of different keyword extraction algorithms by a data science team
-
[Papers] EmbedRank https://arxiv.org/pdf/1801.04470.pdf
Next Steps
Two approaches discussed in Text Analytics with Python and Code
- Collocations
- Weighted tag-based phrase extraction (extract noun phrases chunks using shallow parsing followed by computing tf-idf weights for each chunk and return top weighted phrases)
The following also provides a Context aware Text Analysis
https://learning.oreilly.com/library/view/Applied+Text+Analysis+with+Python/97814919630
#!python -m nltk.downloader stopwords
#!python -m nltk.downloader universal_target
#!python -m spacy download en # download the english model
text_content = """
Sources tell us that Google is acquiring Kaggle, a platform that hosts data science and machine learning
competitions. Details about the transaction remain somewhat vague , but given that Google is hosting
its Cloud Next conference in San Francisco this week, the official announcement could come as early
as tomorrow. Reached by phone, Kaggle co-founder CEO Anthony Goldbloom declined to deny that the
acquisition is happening. Google itself declined 'to comment on rumors'.
"""