An Introduction to Text Mining
Share

An Introduction to Text Mining
Research Design, Data Collection, and Analysis



October 2017 | 336 pages | SAGE Publications, Inc

Students in social science courses communicate, socialize, shop, learn, and work online. When they are asked to collect data for course projects they are often drawn to social media platforms and other online sources of textual data. There are many software packages and programming languages available to help students collect data online, and there are many texts designed to help with different forms of online research, from surveys to ethnographic interviews. But there is no textbook available that teaches students how to construct a viable research project based on online sources of textual data such as newspaper archives, site user comment archives, digitized historical documents, or social media user comment archives. Gabe Ignatow and Rada F. Mihalcea's new text An Introduction to Text Mining will be a starting point for undergraduates and first-year graduate students interested in collecting and analyzing textual data from online sources, and will cover the most critical issues that students must take into consideration at all stages of their research projects, including: ethical and philosophical issues; issues related to research design; web scraping and crawling; strategic data selection; data sampling; use of specific text analysis methods; and report writing.

 
Part I: Foundations
 
Chapter 1 Text Mining and Text Analysis
Learning Objectives  
Introduction  
Six Approaches to Text Analysis  
Challenges and Limitations of Using Online Data  
Conclusions  
Key Terms  
Highlights  
Student Study Site  
Review Questions  
Discussion Questions  
Developing a Research Proposal  
Further Reading  
 
Chapter 2 Acquiring Data
Learning Objectives  
Introduction  
Online Data Sources  
Advantages and Limitations of Online Digital Resources for Social Science Research  
Examples of Social Science Research Using Digital Data  
Conclusions  
Key Terms  
Highlights  
Discussion Questions  
 
Chapter 3 Research Ethics
Learning Objectives  
Introduction  
Respect for Persons, Beneficence, and Justice  
Ethical Guidelines  
Institutional Review Boards  
Privacy  
Informed Consent  
Manipulation  
Publishing Ethics  
Conclusions  
Key Terms  
Highlights  
Student Study Site  
Review Questions  
Discussion Questions  
Web Resources  
Developing Your Research Proposal  
Further Reading  
 
Chapter 4 The Philosophy and Logic of Text Mining
Learning Objectives  
Introduction  
Ontological and Epistemological Positions  
Metatheory  
Making Inferences  
Conclusions  
Key Terms  
Highlights  
Student Study Site  
Discussion Questions  
Internet Resources  
Developing Your Research Proposal  
Further Reading  
 
Part II: Research Design and Basic Tools
 
Chapter 5 Designing Your Research Project
Learning Objectives  
Introduction  
Critical Decisions  
Idiographic and Nomothetic Research  
Levels of Analysis  
Qualitative, Quantitative, and Mixed-Methods Research  
Choosing Data  
Formatting Your Data  
Conclusions  
Key Terms  
Highlights  
Student Study Site  
Review Questions  
Discussion Questions  
Developing Your Research Proposal  
Further Reading  
 
Chapter 6 Web Scraping and Crawling
Learning Objectives  
Introduction  
Web Statistics  
Web Crawling  
Web Scraping  
Software for Web Crawling and Scraping  
 
Part III: Text Mining Fundamentals
 
Chapter 7 Lexical Resources
Learning Objectives  
Introduction  
WordNet  
WordNet Affect  
Roget’s Thesaurus  
Linguistic Inquiry and Word Count  
General Inquirer  
Wikipedia  
Conclusions  
Key Terms  
Highlights  
Discussion Topics  
 
Chapter 8 Basic Text Processing
Learning Objectives  
Introduction  
Basic Text Processing  
Language Models and Text Statistics  
More Advanced Text Processing  
Conclusions  
Key Terms  
Highlights  
Discussion Topics  
 
Chapter 9 Supervised Learning
Learning Objectives  
Introduction  
Feature Representation and Weighting  
Supervised Learning Algorithms  
Evaluation of Supervised Learning  
Conclusions  
Key Terms  
Highlights  
Discussion Topics  
 
Part IV: Text Analysis Methods from the Humanities and Social Sciences
 
Chapter 10 Analyzing Narratives
Learning Objectives  
Introduction  
Approaches to Narrative Analysis  
Planning a Narrative Analysis Research Project  
Qualitative Narrative Analysis  
Mixed Methods and Quantitative Narrative Analysis Studies  
Conclusions  
Key Terms  
Highlights  
Review Questions  
Developing a Research Proposal  
Further Reading  
 
Chapter 11 Analyzing Themes
Learning Objectives  
Introduction  
How to Analyze Themes  
Examples of Thematic Analysis  
Conclusions  
Key Terms  
Highlights  
Review Questions  
Developing a Research Proposal  
Further Reading  
 
Chapter 12 Analyzing Metaphors
Learning Objectives  
Introduction  
Cognitive Metaphor Theory  
Approaches to Metaphor Analysis  
Qualitative, Quantitative, and Mixed Methods  
Conclusions  
Key Terms  
Highlights  
Review Questions  
Developing a Research Proposal  
Further Reading  
 
Part V: Text Mining Methods from Computer Science
 
Chapter 13 Text Classification
Learning Objectives  
Introduction  
What Is Text Classification?  
Applications of Text Classification  
Approaches to Text Classification  
Conclusions  
Key Terms  
Highlights  
Discussion Topics  
 
Chapter 14 Opinion Mining
Learning Objectives  
Introduction  
What Is Opinion Mining?  
Resources for Opinion Mining  
Approaches to Opinion Mining  
Conclusions  
Key Terms  
Highlights  
Discussion Topics  
 
Chapter 15 Information Extraction
Learning Objectives  
Introduction  
Entity Extraction  
Relation Extraction  
Web Information Extraction  
Template Filling  
Conclusions  
Key Terms  
Highlights  
Discussion Topics  
 
Chapter 16 Analyzing Topics
Learning Objectives  
Introduction  
What Are Topic Models?  
How to Use Topic Models  
Examples of Topic Modeling  
Conclusions  
Key Terms  
Highlights  
Review Questions  
Developing a Research Proposal  
Internet Resources  
Further Reading  
 
Part VI: Writing and Reporting Your Research
 
Chapter 17 Writing and Reporting Your Research
Learning Objectives  
Introduction: Academic Writing  
Evidence and Theory  
The Structure of Social Science Research Papers  
Conclusions  
Key Terms  
Highlights  
Student Study Site  
Web Resources  
Undergraduate Research Journals  
Further Reading  
 
Glossary
 
References
 
Appendix A Data Sources for Text Mining
 
Appendix B Text Preparation and Cleaning Software
 
Appendix C: General Text Analysis Software
 
Appendix D: Qualitative Data Analysis Software
 
Appendix E: Opinion Mining Software
 
Appendix F: Concordance and Keyword Frequency Software
 
Appendix G: Visualization Software
 
Appendix H: List of Websites
 
Appendix I: Statistical Tools

Preview this book

For instructors

Purchasing options

Please select a format:

ISBN: 9781506337005