User Tools

Site Tools


regex

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
regex [2022/06/21 15:08] prcurtisregex [2022/06/21 18:06] (current) prcurtis
Line 1: Line 1:
 ===== Regular Expressions (Regex) for Japanese ===== ===== Regular Expressions (Regex) for Japanese =====
  
-//Expressions below provided by Hoyt LongDownload as text file by clicking the link in the tab.//+The regular expressions provided below are separated by source and included as code blocks for easy copy-pastingThey can also be downloaded as text files by clicking the link in the tab of each code block. 
 + 
 +//Expressions below provided by Hoyt Long (University of Chicago).//
 <file text jpn_reg.txt> <file text jpn_reg.txt>
 HTML TAGS: <[^<]+?> HTML TAGS: <[^<]+?>
Line 20: Line 22:
 </file> </file>
  
-//Expressions below collected from the defunct [[https://web.archive.org/web/20120422073323/http://crunchytoast.com/2009/12/12/japanese-regex-alzheimers-and-why-cant-i-remember/|Crunchytoast page]].//+//Additional formulas from [[https://regex101.com/r/xhHFs2/1|Regular Expressions 101]].// 
 + 
 +//Expressions below collected from the defunct [[https://web.archive.org/web/20120422073323/http://crunchytoast.com/2009/12/12/japanese-regex-alzheimers-and-why-cant-i-remember/|Crunchytoast page]] and Terrance Snyder's [[https://gist.github.com/terrancesnyder/1345094|Github repository]].//
 <file text jpn_reg_crunchytoast.txt> <file text jpn_reg_crunchytoast.txt>
 Regex for matching ALL Japanese common & uncommon Kanji (4e00 – 9fcf) Regex for matching ALL Japanese common & uncommon Kanji (4e00 – 9fcf)
regex.1655824124.txt.gz · Last modified: 2022/06/21 15:08 by prcurtis