Text Data Analytics with NLTK and PYTHON

Page 1

Text Data Analytics with NLTK and PYTHON


CHAPTER – 4 THE BASICS OF SEARCH ENGINE FRIENDLY DESIGN & DEVELOPMENT


About NLTK: – The Natural Language Toolkit, or more commonly NLTK, is a suite of libraries and programs for symbolic and statistical natural language processing (NLP) for English written in the Python programming language. Course Description: – This course introduces Natural Language Processing (NLP) with the use of Natural Language Tool Kit (NLTK) and Python. Through practical approach, you will get handson experience with Natural language concepts and computational linguistics concepts.

Copyright @ 2019 Learntek. All Rights Reserved.

3


Course Outcome: – On completion of this course, the students will be able to 1.Understand the basic concepts of Natural Language Processing (NLP). 2.Understand how to use the Natural Language Tool Kit. 3.Work with text data using the Natural Language Tool Kit. 4.Load and manipulate your text data. 5.Perform syntax and semantics in natural language processing. 6.Ability to design and analyse various NLP algorithms. 7.Apply various concepts of NLP in other application areas.

Copyright @ 2019 Learntek. All Rights Reserved.

4


Module 1 Introduction to Natural Language Processing About Natural Language Toolkit Getting Started with NLTK NLTK installation Loading Book

Module 2 Searching Text Counting Vocabulary Texts as Lists of Word List, Indexing list, variables, strings Copyright @ 2019 Learntek. All Rights Reserved.

5


Module 3 Frequency Distributions Fine-Grained Selection of Words Collocations Counting Other Things

Module 4 Making Decisions and Taking Control Conditionals, Operating on Every Element, Nested Code Blocks, Looping with Conditions Copyright @ 2019 Learntek. All Rights Reserved.

6


Module 5 Accessing Text Corpora and Lexical Resources Gutenberg Corpus Gutenberg Corpus Web and Chat Text Corpus Brown Corpus Reuters Corpus Inaugural Address Corpus Loading your own Corpus

Copyright @ 2019 Learntek. All Rights Reserved.

7


Module 6 Lexical Resources Wordlist Corpora Stopword Name Corpus Comparative Wordlists WordNet

Copyright @ 2019 Learntek. All Rights Reserved.

8


Module 7 Processing Raw Text Accessing Text from the Web Tokenization Dealing with HTML Processing RSS Feeds Reading Local Files

Copyright @ 2019 Learntek. All Rights Reserved.

9


Module 8 Finding Word Stems Stemmers Lemmatization Segmentation: Sentence Segmentation and Word Segmentation Writing Results to a File Text Wrapping

Copyright @ 2019 Learntek. All Rights Reserved.

10


Module 9 Learning to Classify Text Supervised Classification Case Study 1: Gender Identification Case Study 2: Document Classification Part-of-Speech Tagging

Copyright @ 2019 Learntek. All Rights Reserved.

11


For more Training Information , Contact Us Email : info@learntek.org USA : +1734 418 2465 INDIA : +40 4018 1306 +7799713624 Copyright @ 2019 Learntek. All Rights Reserved.

12


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.