{"id":661,"date":"2019-12-18T16:12:36","date_gmt":"2019-12-18T07:12:36","guid":{"rendered":"http:\/\/www.cl.cs.okayama-u.ac.jp\/?p=661"},"modified":"2019-12-27T13:49:22","modified_gmt":"2019-12-27T04:49:22","slug":"kyotocorpus4-0%e3%81%a8ntc1-5%e3%82%92%e5%8f%96%e3%82%8a%e5%87%ba%e3%81%99%e9%9a%9b-2019%e5%b9%b412%e6%9c%88","status":"publish","type":"post","link":"https:\/\/www.cl.cs.okayama-u.ac.jp\/?p=661","title":{"rendered":"KyotoCorpus4.0\u3068NTC1.5\u3092\u5fa9\u5143\u3059\u308b\u969b\u306e\u30a8\u30e9\u30fc (2019\/12\/18)"},"content":{"rendered":"<p>\u4eac\u90fd\u5927\u5b66\u30b3\u30fc\u30d1\u30b94.0\u304a\u3088\u3073NAIST Text Corpus 1.5 \u3092\u53d6\u308a\u51fa\u3059\u90e8\u5206\u3067\u5c11\u3057\u82e6\u52b4\u3057\u305f\u306e\u3067\u66f8\u3044\u3066\u304a\u304d\u307e\u3059\uff0e<br \/>\nNAIST Text Corpus \u306f\u4eac\u90fd\u5927\u5b66\u30b3\u30fc\u30d1\u30b9\u304c\u5b8c\u6210\u3057\u306a\u3044\u3068\u4f5c\u308b\u3053\u3068\u304c\u3067\u304d\u306a\u3044\u306e\u3067\uff0c\u4eac\u90fd\u5927\u5b66\u30b3\u30fc\u30d1\u30b9\u3092\u6b63\u78ba\u306b\u53d6\u308a\u51fa\u3059\u306e\u304c\u91cd\u8981\u3067\u3059\uff0e<\/p>\n<p><a href=\"http:\/\/nlp.ist.i.kyoto-u.ac.jp\/index.php?%E4%BA%AC%E9%83%BD%E5%A4%A7%E5%AD%A6%E3%83%86%E3%82%AD%E3%82%B9%E3%83%88%E3%82%B3%E3%83%BC%E3%83%91%E3%82%B9\">\u4eac\u90fd\u5927\u5b66\u30b3\u30fc\u30d1\u30b94.0<\/a><br \/>\n\u306f\u6bce\u65e5\u65b0\u805e95\u5e74\u7248\u306e\u30c7\u30fc\u30bf\u306b\u30bf\u30b0\u4ed8\u3051\u3055\u308c\u305f\u30b3\u30fc\u30d1\u30b9\u3067\uff0c\u5fa9\u5143\u306b\u306f\uff0c\u6bce\u65e5\u65b0\u805e95\u5e74\u306e\u30c7\u30fc\u30bf\u3068\u4e0a\u8a18\u306e\u30ea\u30f3\u30af\u304b\u3089download\u3057\u305f\u30d7\u30ed\u30b0\u30e9\u30e0\u304c\u5fc5\u8981\u3067\u3059\uff0e<\/p>\n<p>Ubuntu18.04LTS\u3067\u5b9f\u884c\u3059\u308b\u3068\u30a8\u30e9\u30fc\u304c\u51fa\u3066\u53d6\u308a\u51fa\u305b\u307e\u305b\u3093\u3067\u3057\u305f\uff0e<br \/>\n<code><br \/>\neuc-jp \"\\xE3\" does not map to Unicode at .\/src\/dupli.pl line 16, <STDIN> line 584.<br \/>\n<\/code><br \/>\n\u3069\u3046\u3082Ubuntu\u306elocale\u3092\u5909\u3048\u308b\u306e\u306f\u5927\u5909\u305d\u3046\u306a\u306e\u3067CentOS 6\u7cfb\uff0c7\u7cfb \u306e\u30de\u30b7\u30f3\u3067\u4e0b\u8a18\u3092\u5b9f\u884c\u3057\u307e\u3059<\/p>\n<ol>\n<li> CentOS\u7cfb\u306b\u30c7\u30fc\u30bf\u3092\u30b3\u30d4\u30fc\n<li> src\/format.pl \u3068 src\/dupli.pl\u306e\u4e0a\u8a18\u306e\u90e8\u5206\u3092\u5909\u66f4<br \/>\n<code><br \/>\nformat.pl <\/p>\n<p>use encoding 'euc-jp';<br \/>\n#use open IO => ':encoding(euc-jp)';<br \/>\n#binmode(STDERR, ':encoding(euc-jp)');<br \/>\n#binmode STDOUT, ':encoding(euc-jp)';<br \/>\nuse encoding 'euc-jp', STDOUT => 'euc-jp';<br \/>\n<\/code><br \/>\n<code><br \/>\ndupli.pl \u306f\u8ffd\u52a0<br \/>\nuse encoding 'euc-jp', STDOUT => 'euc-jp';<br \/>\n<\/code><br \/>\n\u3053\u308c\u3067\u4ed8\u5c5e\u306e auto_conv \u3092\u5b9f\u884c\u3059\u308b\uff0e<\/p>\n<li> 950106.KNP\u306e\u90e8\u5206\u304c\u5168\u89d2\u7a7a\u767d\u304c1\u3064\u305a\u308c\u308b\u305f\u3081\u5468\u8fba\u3092\u624b\u3067\u4fee\u6b63<br \/>\n950106 \u306e\u6700\u521d\u306e\u8a18\u4e8b\u304c\u305a\u308c\u307e\u3057\u305f<br \/>\n<code><br \/>\ndat\/rel\/950106.KNP  \u3053\u308c\u306f dat\/syn\/950106.KNP\u3082\u540c\u69d8\u306b\u305a\u308c\u3066\u3044\u307e\u3059<br \/>\n# S-ID:950106001-001 \u90e8\u5206\u524a\u9664:0:\u3000 \u90e8\u5206\u524a\u9664:12:\u8535\u76f8 KNP:2002\/12\/11 MOD:2004\/12\/29<br \/>\n* 0 1D<br \/>\n+ 0 2D<br \/>\n\u3000\u3055\u304d\u304c \u3055\u304d\u304c\u3051 * \u540d\u8a5e \u7d44\u7e54\u540d * *<br \/>\n\u3051 \u306e * \u52a9\u8a5e \u63a5\u7d9a\u52a9\u8a5e * *<br \/>\n* 1 3P<br \/>\n+ 1 2D<br \/>\n\u306e\u6b66 \u305f\u3051\u3080\u3089 * \u540d\u8a5e \u4eba\u540d * *<br \/>\n\u6751\u6b63 \u307e\u3055\u3088\u3057 * \u540d\u8a5e \u4eba\u540d * *<br \/>\n<\/code><br \/>\n\u5143\u306e\u6587(dat\/num\/950106.org)\u3092\u307f\u308b\u3068<br \/>\n<code><br \/>\n# S-ID:950106001-001<br \/>\n\u3000\u3055\u304d\u304c\u3051\u306e\u6b66\u6751\u6b63\u7fa9\u4ee3\u8868\uff08\u8535\u76f8\uff09\u3068\u793e\u4f1a\u515a\u306e\u4e94\u5cf6\u6b63\u898f\u526f\u66f8\u8a18\u9577\u304c\uff0e\uff0e\uff0e\uff0e\uff0e<br \/>\n<\/code><br \/>\n\u3068\u5168\u89d2\u7a7a\u767d\u304c\u306f\u3044\u3063\u3066\u3044\u308b\uff0e\u3053\u306e\u9664\u53bb\u306b\u5931\u6557\u3057\u3066\u3044\u308b\u3088\u3046\u3059\uff0e<br \/>\n\u5225\u306e\u74b0\u5883\u3067\u305f\u3081\u3057\u305f\u3068\u304d\u306b\uff0c\u305f\u307e\u305f\u307e\u3046\u307e\u304f\u3044\u3063\u3066\u3044\u305f\u30c7\u30fc\u30bf\u304c\u3042\u3063\u305f\u306e\u3067 950106.KNP\u3060\u3051\u30b3\u30d4\u30fc\u3057\u307e\u3057\u305f<br \/>\n<code><br \/>\ndat\/rel\/950106.KNP  \u3068 dat\/syn\/950106.KNP\u3000\u3092\u4fee\u6b63<br \/>\n<\/code>\n<\/ol>\n<ul>\n<li>\u8ffd\u52a0\u60c5\u58312019\/12\/27<\/li>\n<p>\u4ed6\u306b\u30bf\u30b0\u304c\u305a\u308c\u3066\u3044\u308b\u3068\u3053\u308d(\u5168\u89d2\u7a7a\u767d\u304c\u3044\u308d\u3044\u308d\u304a\u3053\u308b\u307f\u305f\u3044)<br \/>\nS-ID:950104062-001<br \/>\n\u3053\u3053\u3082\u4eba\u624b\u3067\u4fee\u6b63\u3059\u308b\u5fc5\u8981\u3042\u308a<br \/>\n<code><br \/>\n# S-ID:950104062-001 \u90e8\u5206\u524a\u9664:0:\u3000\u25c7 KNP:2002\/08\/22 MOD:2005\/03\/01<br \/>\n* 0 2D<br \/>\n+ 0 3D<br \/>\n\u3000 \u3064\u307e * \u540d\u8a5e \u666e\u901a\u540d\u8a5e * *<br \/>\n\u25c7 \u304c * \u52a9\u8a5e \u683c\u52a9\u8a5e * *<br \/>\n* 1 2D<br \/>\n+ 1 2D <rel type=\"\u30ac\" target=\"\u30bf\u30a4\u30e0\" sid=\"950104062-001\" tag=\"2\"\/><br \/>\n\u59bb \u300c * \u7279\u6b8a \u62ec\u5f27\u59cb * *<br \/>\n\u304c\u300c \u30d5\u30eb \u30d5\u30eb\u3060 \u5f62\u5bb9\u8a5e * \u30ca\u5f62\u5bb9\u8a5e \u8a9e\u5e79<br \/>\n+ 2 3D<br \/>\n\u30d5\u30eb\u30bf \u30bf\u30a4\u30e0 * \u540d\u8a5e \u666e\u901a\u540d\u8a5e * *<br \/>\n\u30a4 \u300d * \u7279\u6b8a \u62ec\u5f27\u7d42 * *<br \/>\n<\/code>\n<\/ul>\n<p><strong>\u3064\u304e\u306b\uff0cNTC1.5\u306b\u3064\u3044\u3066\uff0e<\/strong><br \/>\n\u307e\u305a\uff0c\u5b9f\u884c\u5f8c\uff0c dat\/ntc\/knp \u307e\u305f\u306f ipa\u306e\u3057\u305f\u3067\u3067\u304d\u305f\u30d5\u30a1\u30a4\u30eb\u304c<strong>2927\u500b\u3042\u308b\u3053\u3068\u3092\u78ba\u8a8d\u3057\u307e\u3059<\/strong>\uff0e\u7121\u3044\u3068\u3069\u3053\u304b\u3067\uff0c\u30d5\u30a1\u30a4\u30eb\u304c\u751f\u6210\u3055\u308c\u3066\u3044\u306a\u3044\u306e\u3067\uff0c\u5b9f\u9a13\u306e\u969b\u306b\u6570\u5024\u304c\u304b\u308f\u306a\u304f\u306a\u308a\u307e\u3059\uff0e<br \/>\nKyotoCorpus\u3092\u4e0a\u8a18\u306e\u3088\u3046\u306b\u4fee\u6b63\u3057\u3066\u4f5c\u6210\u3057\u305f\u306e\u3060\u304c\uff0c\u6b8b\u5ff5\u306a\u304c\u3089\uff0c<strong>\u540c\u3058\u3068\u3053\u308d\u3067\u53cd\u5bfe\u306e\u73fe\u8c61<\/strong>\u304c\u304a\u3053\u308b\u306e\u3067\u624b\u3067\u4fee\u6b63\u3059\u308b\u5fc5\u8981\u304c\u3042\u308a\u307e\u3059\uff0e<\/p>\n<p>NTC1.5\u3067\u306f NTC_1.5\/dat\/ntc\/knp \u306b\u4eac\u90fd\u5927\u5b66\u30b3\u30fc\u30d1\u30b9\u30d5\u30a9\u30fc\u30de\u30c3\u30c8\u306e\u30c7\u30fc\u30bf\u304c\u4f5c\u6210\u3055\u308c\u308b\uff0e<br \/>\n\u3053\u308c\u306f\u554f\u984c\u306a\u304f\u3067\u304d\u307e\u3057\u305f\uff0e<br \/>\nNTC_1.5\/dat\/ntc\/ipa\/ \u5074\u306e\u5834\u5408\uff0c 950106 \u306e\u8a18\u4e8b\u304c\u4eca\u5ea6\u306f\u5168\u89d2\u7a7a\u767d\u306e\u5206\u306e\u30bf\u30b0\u304c\u5b58\u5728\u3057\u3066\u304a\u308a\uff0c\u305a\u308c\u3066\u3057\u307e\u3044\u307e\u3059\uff0e<br \/>\n<code><br \/>\nipa\/950106-0000-950106001.ntc<\/p>\n<p># S-ID:950106001-001 \u90e8\u5206\u524a\u9664:0:\u3000 \u90e8\u5206\u524a\u9664:12:\u8535\u76f8 KNP:2002\/12\/11 MOD:2004\/12\/29<br \/>\n* 0 1D 1\/0<br \/>\n\u3055      \u3000      \u3000      \u8a18\u53f7-\u7a7a\u767d       _       _       O       _<br \/>\n\u304d\u304c\u3051\u306e        \u30b5\u30ad\u30ac\u30b1        \u3055\u304d\u304c\u3051        \u540d\u8a5e-\u56fa\u6709\u540d\u8a5e-\u7d44\u7e54      _       _<br \/>\n       B-ORGANIZATION   _<br \/>\n* 1 3P 3\/0<br \/>\n\u6b66      \u30ce      \u306e      \u52a9\u8a5e-\u9023\u4f53\u5316     _       _       O       _<br \/>\n\u6751\u6b63    \u30bf\u30b1\u30e0\u30e9        \u6b66\u6751    \u540d\u8a5e-\u56fa\u6709\u540d\u8a5e-\u4eba\u540d-\u59d3   _       _       B-PERSON<br \/>\n        _<br \/>\n\u7fa9\u4ee3    \u30de\u30b5\u30e8\u30b7        \u6b63\u7fa9    \u540d\u8a5e-\u56fa\u6709\u540d\u8a5e-\u4eba\u540d-\u540d   _       _       I-PERSON<br \/>\n        _<br \/>\n\u8868\u3068    \u30c0\u30a4\u30d2\u30e7\u30a6      \u4ee3\u8868    \u540d\u8a5e-\u30b5\u5909\u63a5\u7d9a   _       _       O       _<br \/>\n<\/code><br \/>\n\u3053\u308c\u306f\u624b\u3067\u4fee\u6b63\u3059\u308b\u3057\u304b\u306a\u3044\u3088\u3046\u3067\u3059\u306d\uff0e\uff0e\uff0e\uff0e\u3053\u306e\u6587\u3060\u3051\u306a\u306e\u3067 NTC_1.5\/dat\/ntc\/ipa\/\u5074\u306e\u30c7\u30fc\u30bf\u3067\u306f\u306a\u304f NTC_1.5\/dat\/ntc\/knp\/\u5074\u306e\u30c7\u30fc\u30bf\u3067\u51e6\u7406\u3059\u308b\u3068\u3088\u3044\u304b\u3082\u3057\u308c\u307e\u305b\u3093\uff0e<\/p>\n<p>\u4ed6\u306b\u305a\u308c\u3066\u3044\u308b\u3068\u3053\u308d\u304c\u306a\u3044\u304b\u306f\u30d7\u30ed\u30b0\u30e9\u30e0\u51e6\u7406\u3057\u3066\u3044\u304f\u306a\u304b\u3067\u30a8\u30e9\u30fc\u304c\u51fa\u306a\u3044\u304b\u304e\u308a\u306a\u304b\u306a\u304b\u6c17\u304c\u3064\u304b\u306a\u3044\uff0e\u3068\u3044\u3046\u3053\u3068\u3067\u8fd1\u5e74\u306eutf8\u74b0\u5883\u3067\u30bf\u30b0\u4ed8\u304d\u30b3\u30fc\u30d1\u30b9\u3092\u5fa9\u5143\u3059\u308b\u306e\u306f\u96e3\u3057\u3044\u3068\u3044\u3046\u3053\u3068\u304c\u308f\u304b\u308a\u307e\u3057\u305f\uff0e<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u4eac\u90fd\u5927\u5b66\u30b3\u30fc\u30d1\u30b94.0\u304a\u3088\u3073NAIST Text Corpus 1.5 \u3092\u53d6\u308a\u51fa\u3059\u90e8\u5206\u3067\u5c11\u3057\u82e6\u52b4\u3057\u305f\u306e\u3067\u66f8\u3044\u3066\u304a\u304d\u307e\u3059\uff0e NAIST Text Corpus \u306f\u4eac\u90fd\u5927\u5b66\u30b3\u30fc\u30d1\u30b9\u304c\u5b8c\u6210\u3057\u306a\u3044\u3068\u4f5c\u308b\u3053\u3068\u304c\u3067\u304d\u306a\u3044\u306e\u3067\uff0c\u4eac\u90fd\u5927 &hellip; <a href=\"https:\/\/www.cl.cs.okayama-u.ac.jp\/?p=661\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-661","post","type-post","status-publish","format-standard","hentry","category-1"],"_links":{"self":[{"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=\/wp\/v2\/posts\/661","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=661"}],"version-history":[{"count":19,"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=\/wp\/v2\/posts\/661\/revisions"}],"predecessor-version":[{"id":681,"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=\/wp\/v2\/posts\/661\/revisions\/681"}],"wp:attachment":[{"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=661"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=661"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.cl.cs.okayama-u.ac.jp\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=661"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}