User Tools

Site Tools


datasets

Datasets

Please note that in the interest of space and clarity not every dataset available will be listed in the subcategories below. The Speech Resources Consortium page, for example, provides dozens of corpora, as does the Japan Data Catalog for the Humanities and Social Sciences (which had nearly 8,000 open-access datasets as of June 2022). Please refer to their individual pages for more updated information on available datasets.

Repositories and Portals

Text Data

OCR Training

Maps/GIS

Stopwords

Image & Video Data

IIIF

datasets.txt · Last modified: 2022/07/16 02:07 by prcurtis