用户工具

站点工具


zh:notes:techemergence

差别

这里会显示出您选择的修订版和当前版本之间的差别。

到此差别页面的链接

两侧同时换到之前的修订记录 前一修订版
后一修订版
前一修订版
zh:notes:techemergence [2020/10/17 15:52]
pzczxs [Detect and Tokenize Sentences, and Recognize Entities]
zh:notes:techemergence [2020/10/17 16:19] (当前版本)
pzczxs [Rum the DIM Model]
行 54: 行 54:
  
 ===== Detect and Tokenize Sentences, and Recognize Entities ===== ===== Detect and Tokenize Sentences, and Recognize Entities =====
-To run <color red>​Converter2Genia.java</​color>​ in the package <color red>​cn.edu.bjut.genia</​color>​ of the project <color red>​EmergingTopicsConverter</​color>​. Thus, the articles will be saved in the directories <color red>​data/​genia/​DIM</​color>​ and <color red>​data/​genia/​CIM</​color>​. Each article is named by its resulting id. +To run <color red>​Converter2Genia.java</​color>​ in the package <color red>​cn.edu.bjut.genia</​color>​ of the project <color red>​EmergingTopicsConverter</​color>​. Thus, the articles will be saved in the directories <color red>​data/​contest-Genia/​DIM</​color>​ and <color red>​data/​contest-Genia/​CIM</​color>​. Each article is named by its resulting id. 
  
 <code bash> <code bash>
-> ./​run_geniass.sh geniass data/genia/DIM & +> ./​run_geniass.sh geniass data/contest-Genia/DIM & 
-> ./​run_geniatagger.sh geniatagger data/genia/DIM & +> ./​run_geniatagger.sh geniatagger data/contest-Genia/DIM & 
-> ./​run_geniass.sh geniass data/genia/CIM & +> ./​run_geniass.sh geniass data/contest-Genia/CIM & 
-> ./​run_geniatagger.sh geniatagger data/genia/CIM &+> ./​run_geniatagger.sh geniatagger data/contest-Genia/CIM &
 </​code>​ </​code>​
  
-For each document, two files will be generated with the extension name <color red>​.txt.ss</​color>​ and <color red>​.txt.ss.tag</​color>​. To save all <color red>​.txt.ss</​color>​ and <color red>​.txt.ss.tag</​color>​ files in the directories <color red>​data/​genia/​DIM</​color>​ and <color red>​data/​genia/​CIM</​color>​.+For each document, two files will be generated with the extension name <color red>​.txt.ss</​color>​ and <color red>​.txt.ss.tag</​color>​. To save all <color red>​.txt.ss</​color>​ and <color red>​.txt.ss.tag</​color>​ files in the directories <color red>​data/​contest-Genia/​DIM</​color>​ and <color red>​data/​contest-Genia/​CIM</​color>​.
  
 +===== Rum the CIM Model =====
 +To run <color red>​Converter2DIM.java</​color>​ in the package <color red>​cn.edu.bjut.genia</​color>​ of the project <color red>​EmergingTopicsConverter</​color>​. Several files will be generated for the DIM model in the directory <color red>​data/​contest-DIM/​emergence</​color>​.
 +
 +===== Rum the DIM Model =====
zh/notes/techemergence.1602921124.txt.gz · 最后更改: 2020/10/17 15:52 由 pzczxs