Course playlist: https://www.youtube.com/playlist?list=PLw3N0OFSAYSEC_XokEcX8uzJmEZSoNGuS
We'll look at tagging our tokens with useful information including part-of-speech tags and named entity tags. We'll also explore different types of sentence parsing to help extract the meaning of a sentence. In the demo, we'll explore how to get these things done with spaCy and how to use the library's "matchers" and other features to build simple rules-based tools.
Colab notebook: https://colab.research.google.com/github/futuremojo/nlp-demystified/blob/main/notebooks/nlpdemystified_preprocessing.ipynb#scrollTo=o9HLYYUt1kOP
Timestamps:
00:00:00 Advanced Preprocessing
00:00:18 Part-of-Speech (PoS) Tagging
00:01:06 Uses of PoS tags
00:02:24 Named Entity Recognition (NER)
00:03:17 Uses of NER tags
00:04:08 The challenges of NER
00:04:57 PoS- and NER-tagging as sequence labelling tasks
00:07:30 Constituency parsing
00:10:24 Dependency parsing
00:12:22 Uses of parsing
00:13:46 Which parsing approach to use
00:14:14 DEMO: advanced preprocessing with spaCy
00:21:11 Preprocessing recap
This video is part of Natural Language Processing Demystified --a free, accessible course on NLP.
Visit https://www.nlpdemystified.org/ to learn more.