User Tools

Site Tools


tools

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
tools [2020/02/26 23:21] – [OCR] prcurtistools [2022/12/03 17:02] (current) – [OCR & Kuzushiji Reading] yhashimoto
Line 1: Line 1:
-====== Tools ======+====== Digital Tools ======
  
 Below are sources that were specifically designed to work with Japanese-language materials and/or Asian languages more broadly. For a larger list of general digital humanities tools, see the final section at the bottom of this page. Below are sources that were specifically designed to work with Japanese-language materials and/or Asian languages more broadly. For a larger list of general digital humanities tools, see the final section at the bottom of this page.
  
 ===== Japan-specific ===== ===== Japan-specific =====
 +
 +[[https://digitalorientalist.com/2019/04/16/creating-a-launcher-extension-for-google-chrome/|Google Extension of quicklinks some commonly used DH Tools in Japan Studies]]
 +[[https://tsukuba.academia.edu/JamesMorris|James Harry Morris]], //[[https://digitalorientalist.com|The Digital Orientalist]]//
 +
  
 ==== Concordance ==== ==== Concordance ====
Line 14: Line 18:
  
   * [[https://osdn.net/projects/ipadic/|IPAdic]]   * [[https://osdn.net/projects/ipadic/|IPAdic]]
 +  * [[http://nlp.ist.i.kyoto-u.ac.jp/index.php?JUMAN|JumanDic]]
 +  * [[https://ja.osdn.net/projects/naist-jdic/|NAIST-jdic]]
 +  * [[https://github.com/neologd/mecab-ipadic-neologd|Neologd]]
   * [[http://chamame.ninjal.ac.jp/chamame_unidic_download.html|UniDic]]   * [[http://chamame.ninjal.ac.jp/chamame_unidic_download.html|UniDic]]
  
Line 19: Line 26:
  
   * [[https://www.actor-atlas.info/ja:japan|Actor Atlas]]   * [[https://www.actor-atlas.info/ja:japan|Actor Atlas]]
 +  * [[http://european-edo-network.org/projects/shintotosaijiki/|The Shin Tôto Saijiki (A DemiScript Augmented Picture Map)]]
 +  * [[https://github.com/code4history/Maplat/wiki|Maplat]]
 +  * [[https://www.nihu.jp/ja/database/source_map|歴史地名データ]]
  
-==== OCR ====+==== OCR & Kuzushiji Reading ====
  
   * [[https://github.com/tesseract-ocr|Tesseract]]   * [[https://github.com/tesseract-ocr|Tesseract]]
Line 26: Line 36:
   * [[https://www.abbyy.com/en-us/finereader/|ABBYY FineReader]]   * [[https://www.abbyy.com/en-us/finereader/|ABBYY FineReader]]
   * [[https://mp.ex.nii.ac.jp/kuronet/|KuroNet]]   * [[https://mp.ex.nii.ac.jp/kuronet/|KuroNet]]
-==== Temporal Conversion/Analysis ====+  * [[http://codh.rois.ac.jp/miwo/|miwo]] 
 +  * [[http://www.let.osaka-u.ac.jp/~okajima/PDF/5tai/|五體字類 ]] 
 +  * [[http://www.ai-kuzushiji.net/|AI 手書きくずし字検索 ]] 
 +  * [[http://komonjo.rokumeibunko.com/binran/bushu01.html|部首のくずし字 ]] 
 +  * [[http://codh.rois.ac.jp/char-shape/search/|くずし字データベース検索]] 
 +  * [[https://mojizo.nabunken.go.jp/|Mojizo 木簡・くずし字解読システム]] 
 +  * [[https://mojiportal.nabunken.go.jp/en/|Multi-database Search System for Historical Chinese Characters]] 
 +  * [[http://www.book-seishindo.jp/kana/index.html|変体仮名を調べる]] 
 +  * [[https://kula.honkoku.org/|くずし字学習アプリKuLA]] 
 +==== Temporal and Unit Conversion/Analysis ====
  
   * [[http://www.hutime.org/|HuTime]]   * [[http://www.hutime.org/|HuTime]]
   * [[https://maechan.net/kanreki/|Kanreki]]   * [[https://maechan.net/kanreki/|Kanreki]]
   * [[http://www.yukikurete.de/nengo_calc.htm|NengoCalc]]   * [[http://www.yukikurete.de/nengo_calc.htm|NengoCalc]]
 +  * [[https://www.vcalc.com/wiki/jmorris/Japanese+%28Shakkanh%C5%8D%29+Unit+Conversion+Calculator|vCalc]]
  
 ==== Text Analysis ==== ==== Text Analysis ====
  
-  * [[http://ctext.org/plugins/texttools/|ctext]] (works with pre-tokenized Japanese)+  * [[http://ctext.org/plugins/texttools/|CTP Text Tools]] (works with pre-tokenized Japanese)
   * [[http://khcoder.net/|KH Coder]]   * [[http://khcoder.net/|KH Coder]]
-  * [[http://voyant-tools.org/?lang=ja|Voyant]]+  * [[http://voyant-tools.org|Voyant]]
   * [[https://khcoder.net/en/|KH Coder]]   * [[https://khcoder.net/en/|KH Coder]]
  
Line 45: Line 65:
   * [[http://chasen-legacy.osdn.jp/|ChaSen]]   * [[http://chasen-legacy.osdn.jp/|ChaSen]]
   * [[http://comainu.org/|Comainu]]   * [[http://comainu.org/|Comainu]]
 +  * [[https://megagonlabs.github.io/ginza/|GiNZA]]
 +  * [[https://mocobeta.github.io/janome/|Janome]]
 +  * [[https://github.com/ikawaha/kagome|Kagome]]
 +  * [[https://github.com/wanasit/kotori|Kotori (in Kotlin)]]
   * [[http://www.phontron.com/kytea/|KyTea]]   * [[http://www.phontron.com/kytea/|KyTea]]
   * [[https://www.atilika.org/|kuromoji]]   * [[https://www.atilika.org/|kuromoji]]
   * [[https://play.google.com/store/apps/details?id=org.mightyfrog.android.japanesetextanalyzer|Japanese Text Analyzer]]   * [[https://play.google.com/store/apps/details?id=org.mightyfrog.android.japanesetextanalyzer|Japanese Text Analyzer]]
   * [[http://nlp.ist.i.kyoto-u.ac.jp/index.php?cmd=read&page=JUMAN&alias[]=%E6%97%A5%E6%9C%AC%E8%AA%9E%E5%BD%A2%E6%85%8B%E7%B4%A0%E8%A7%A3%E6%9E%90%E3%82%B7%E3%82%B9%E3%83%86%E3%83%A0JUMAN|JUMAN]]   * [[http://nlp.ist.i.kyoto-u.ac.jp/index.php?cmd=read&page=JUMAN&alias[]=%E6%97%A5%E6%9C%AC%E8%AA%9E%E5%BD%A2%E6%85%8B%E7%B4%A0%E8%A7%A3%E6%9E%90%E3%82%B7%E3%82%B9%E3%83%86%E3%83%A0JUMAN|JUMAN]]
 +  * [[https://github.com/ku-nlp/jumanpp|Juman++]]
   * [[https://github.com/taishi-i/nagisa|Nagisa]]   * [[https://github.com/taishi-i/nagisa|Nagisa]]
   * [[https://taku910.github.io/mecab/|MeCab]]   * [[https://taku910.github.io/mecab/|MeCab]]
 +  * [[https://github.com/neologd/mecab-ipadic-neologd|mecab-ipadic-NEologd]] (neologism dictionary for MeCab)
 +  * [[https://github.com/ikegami-yukino/neologdn|neologdn]] (normalizer for mecab-neologd)
 +  * [[https://github.com/WorksApplications/Sudachi|Sudachi]]
 +  * [[https://github.com/WorksApplications/SudachiPy|SudachiPy]] (for Python)
   * [[http://chasen.org/~taku/software/TinySegmenter/|TinySegmenter]] (Javascript)   * [[http://chasen.org/~taku/software/TinySegmenter/|TinySegmenter]] (Javascript)
   * [[https://github.com/rakuten-nlp/rakutenma|Rakuten]] (Javascript)   * [[https://github.com/rakuten-nlp/rakutenma|Rakuten]] (Javascript)
tools.1582759306.txt.gz · Last modified: 2020/02/26 23:21 by prcurtis