datasets
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
datasets [2022/06/07 16:14] – prcurtis | datasets [2022/07/16 02:07] (current) – [Text Data] prcurtis | ||
---|---|---|---|
Line 1: | Line 1: | ||
======Datasets====== | ======Datasets====== | ||
- | Please note that in the interest of space and clarity not every dataset available will be listed in the subcategories below. The Speech Resources Consortium page, for example, provides dozens of corpora, as does the Japan Data Catalog for the Humanities and Social Sciences. Please refer to their pages for more updated information on available datasets. | + | Please note that in the interest of space and clarity not every dataset available will be listed in the subcategories below. The Speech Resources Consortium page, for example, provides dozens of corpora, as does the Japan Data Catalog for the Humanities and Social Sciences |
=====Repositories and Portals===== | =====Repositories and Portals===== | ||
Line 17: | Line 17: | ||
* [[http:// | * [[http:// | ||
* [[http:// | * [[http:// | ||
- | * [[https:// | + | * [[https:// |
- | * [[https:// | + | * [[https:// |
- | * [[https:// | + | * [[https:// |
- | * [[https:// | + | * [[https:// |
- | * [[https:// | + | * [[https:// |
- | * [[https:// | + | * [[https:// |
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
- | * [[https:// | + | * [[https:// |
- | * [[https:// | + | * [[https:// |
- | * [[https:// | + | * [[https:// |
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
=====OCR Training===== | =====OCR Training===== | ||
Line 39: | Line 56: | ||
* [[http:// | * [[http:// | ||
+ | * [[https:// | ||
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
Line 49: | Line 67: | ||
* [[http:// | * [[http:// | ||
+ | =====Stopwords===== | ||
+ | **[[https:// | ||
+ | [[https:// | ||
- | =====Image Data===== | + | =====Image |
* [[http:// | * [[http:// | ||
* [[http:// | * [[http:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
=====IIIF===== | =====IIIF===== | ||
* [[http:// | * [[http:// | ||
datasets.1654618468.txt.gz · Last modified: 2022/06/07 16:14 by prcurtis