Text and Data Mining

Posted:

Users must use Python to analyze JSTOR text data. See below for JSTOR's introductory and intermediate guides to Python.
Register for a JSTOR account

If you are off-campus, connect via VPN so that Constellate recognizes that you are associated with… Read more.

Posted:

Getting started

Please visit our guide on logging into the Gale Digital Scholar Lab
For general information and frequently asked questions (FAQs), please refer to our overview of the platform

Learning resources
Workshops
Text Analysis Tasting Menu… Read more.

Posted:

Table of Contents

Introduction
Document Clustering
Named Entity Recognition
Ngrams
Parts of Speech Tagger
Sentiment Analysis
Topic Modeling

Introduction
(back to table of contents)

Now that we have our text collection and our cleaning… Read more.

Posted:

Presentation Description:
June 28, 2021, 3:00pm, presented by Nick FieldConstellate is a new platform for text analysis. Users can create datasets of thousands of items from JSTOR and other collections, and then filter, clean, analyze, and visualize… Read more.

Posted:

Workshop Description:
Are you interested in analyzing large bodies of texts, or teaching text analysis in your class? Not sure what tools are available for you to use? Join us to learn about some of the major tools for text analysis at the… Read more.

Posted:

Workshop Description:
Are you interested in analyzing large bodies of texts, or teaching text analysis in your class? Join us to learn about Constellate, a new tool for text analysis. Constellate provides tools for building a large dataset, using… Read more.

Posted:

Getting Started

How to Access Constellate
For general information, please refer to our overview of the platform.
Sign up for classes from Constellate

Learning Resources
Workshops
Text Analysis Tasting Menu: A Sampling of Available Tools

This… Read more.

Posted:

Table of Contents

Build a collection
Upload your own texts

Create a text document
Bulk upload, method 1: text files
Bulk upload, method 2: spreadsheet (CSV)



Build a Collection
(back to table of contents)
The DSL has access to … Read more.

Posted:

Presentation Description:
April 1, 2021, 2:00pm, presented by Leslie BarnesAPIs (Application Programing Interfaces) are a major way to access data from both free and licensed sources. Come to this session to learn: what is an API? Which APIs do you… Read more.

Posted:

Table of Contents
Our tutorial for the Digital Scholar Lab (DSL) includes the introductory page you are reading, plus six major sections. Click on any of the links below to jump to the relevant tutorial. We suggest following them in order:

Access… Read more.