Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
An easy to use web text analysis tool. Voyant is free and allows users to upload or paste text. The program will automatically determine word frequencies and colocates and display them graphically.
JSTOR Data for Research
Data for Research (dfr.jstor.org) is a free, self-service tool that allows computer scientists, digital humanists, and other researchers to select and interact with content on JSTOR.
Topic Modeling Tool
This is a simple free tool that allows users to topic model texts using MALLET, but with an easy-to-use graphical user interface.
Natural Language Tool (Advanced)
NLTK is a platform for building Python programs to work with human language data.
Stanford Named Entity Recognizer (Advanced)
This is a simple cut and paste interface with Stanford's entity extraction program (which is available for download). This interface allows users to provide small bits of text which it then automatically tags for Persons, Places, and Organizations.
MALLET (MAchine Learning for LanguagE Toolkit) is a collection of tools that facilitate document classification, sequence tagging, and topic modeling.
R is an open source programming language and software environment for statistical computing and graphics, but used by humanities scholars for large-scale text analysis, including data extraction, stylistic analysis, authorship attribution, genre detection, gender detection, unsupervised clustering, supervised classification, topic modeling, and sentiment analysis.
A free, open-source template for MS Excel 2007 and 2010 that makes it easy to explore network graphs. With NodeXL, you can enter a network edge list in a worksheet and click a button to see your graph, all in the environment of the Excel window.
Gephi is an open source, interactive visualization and exploration platform for all kinds of networks and complex systems, dynamic and hierarchical graphs.
AutoMap enables the extraction of information from texts using network text analysis methods. AutoMap supports the extraction of several types of data from unstructured documents.
ORA is a dynamic meta-network assessment and analysis tool developed by CASOS at Carnegie Mellon. It contains hundreds of social network, dynamic network metrics, trail metrics, procedures for grouping nodes, identifying local patterns, comparing and contrasting networks, groups, and individuals from a dynamic meta-network perspective.
Content Management and Web Publishing
WordPress is a free and open source popular blogging tool and a content management system.
Omeka is a free, flexible, and open source web-publishing platform for the display of library, museum, archives, and scholarly collections and exhibitions.
Scalar is a free, open source authoring and publishing platform that’s designed to make it easy for authors to write long-form, born-digital scholarship online. Scalar enables users to assemble media from multiple sources and juxtapose them with their own writing in a variety of ways, with minimal technical expertise required.
Drupal is an open source content management system for supporting resources like blogs and web sites.
ImageJ is a public domain, Java-based image processing program that can display, edit, analyze, process, save, and read many image formats including TIFF, PNG, GIF, JPEG, BMP, DICOM, FITS, as well as raw formats. It also can measure distances and angles, create density histograms and line profile plots among many other features.
ImagePlot is a free software tool that visualizes collections of images and video of any size. It is implemented as a macro which works with the open source image processing program ImageJ.
Audacity is a free open source digital audio editor and recording computer software application. In addition to recording audio from multiple sources, Audacity can be used for post-processing of all types of audio, including podcasts by adding effects such as normalization, trimming, and fading in and out. Audacity also can do audio spectrum analysis using the Fourier transform algorithm.
Wordle is a tool for generating word clouds from text that you provide. The clouds give greater prominence to words that appear more frequently in the source text.
A collection of data visualization tools. You can upload your own data and create web-based visualizations that are made available to the public for comments and discussions. You need to create an account to upload data.
Tableau Public is a free data storytelling application. Create and share interactive charts and graphs, maps, live dashboards and fun applications in minutes, then publish anywhere on the web.
Viewshare is a free platform for generating and customizing views (interactive maps, timelines, facets, tag clouds) that allow users to experience your digital collections.
ArcGIS Explorer Desktop is a free GIS viewer that gives you an easy way to explore, visualize, and share GIS information. ArcGIS Explorer, you can access ready-to-use ArcGIS Online basemaps and layers, fuse local data with map services to create custom maps, add photos, reports, videos, and other information to your maps, as well as perform spatial analysis (e.g., visibility, modeling, proximity search).
TimelineJS is an open-source tool that enables users to build visually rich, interactive timelines. It can pull in media from a variety of sources and has built-in support for Twitter, Flickr, Google Maps, YouTube, Vimeo, Vine, Dailymotion, Wikipedia, SoundCloud and more. Harvests data from Google Spreadsheets.
D3.js Data-Driven Documents (Advanced)