用户工具

站点工具


zh:notes:atot_credit

差别

这里会显示出您选择的修订版和当前版本之间的差别。

到此差别页面的链接

两侧同时换到之前的修订记录 前一修订版
后一修订版
前一修订版
zh:notes:atot_credit [2020/09/10 08:12]
pzczxs [Update Sequence No. and Corresponding Author]
zh:notes:atot_credit [2022/06/30 11:28] (当前版本)
pzczxs 讨论状态变化了
行 1: 行 1:
-====== ​AToT Model armed with Authorship Credit ======+====== ​AT Model armed with Authorship Credit ======
 ===== Requirements ===== ===== Requirements =====
   * [[http://​scikit.ml/​index.html|Scikit-Multilearn]]   * [[http://​scikit.ml/​index.html|Scikit-Multilearn]]
行 22: 行 22:
 Once correction is done, to run <color red>​TargetArticleDoiUpdater.java</​color>​ to import the related information in the file <color red>​data/​target_article_dois.xlsx</​color>​ into MySQL database.  ​ Once correction is done, to run <color red>​TargetArticleDoiUpdater.java</​color>​ to import the related information in the file <color red>​data/​target_article_dois.xlsx</​color>​ into MySQL database.  ​
  
-There are still three duplications with id = "​WOS:​000246296800029"​ and "​WOS:​000247372300026",​ id = "​WOS:​000297670800005"​ and "​WOS:​000293697700003",​ and id = "​WOS:​000393719000030"​ and id = "​WOS:​000394061000172"​. To run the following ​sql, the duplications will be removed.  ​+There are still three duplications with id = "​WOS:​000246296800029"​ and "​WOS:​000247372300026",​ id = "​WOS:​000297670800005"​ and "​WOS:​000293697700003",​ and id = "​WOS:​000393719000030"​ and id = "​WOS:​000394061000172"​. To run the following ​SQL statements, the duplications will be removed.  ​
 <code sql> <code sql>
 DELETE FROM target_article_author WHERE target_article_id = "​WOS:​000247372300026"; ​ DELETE FROM target_article_author WHERE target_article_id = "​WOS:​000247372300026"; ​
行 37: 行 37:
 </​code>​ </​code>​
  
-However, several target articles are attached wrong DOI names or have still no DOI names. To run the following ​sql, the resulting DOI names will be corrected or added. ​+However, several target articles are attached wrong DOI names or have still no DOI names. To run the following ​SQL statements, the resulting DOI names will be corrected or added. ​
 <code sql> <code sql>
 UPDATE target_article SET doi = "​10.2307/​24102078"​ WHERE id = "​WOS:​000252249700032"; ​ UPDATE target_article SET doi = "​10.2307/​24102078"​ WHERE id = "​WOS:​000252249700032"; ​
行 50: 行 50:
 UPDATE target_article SET doi = "​10.1097/​00006231-200306000-00013"​ WHERE id = "​WOS:​000183373800013"; ​ UPDATE target_article SET doi = "​10.1097/​00006231-200306000-00013"​ WHERE id = "​WOS:​000183373800013"; ​
 UPDATE target_article SET doi = "​10.1097/​00005176-200406001-00720"​ WHERE id = "​WOS:​000227354700101"; ​ UPDATE target_article SET doi = "​10.1097/​00005176-200406001-00720"​ WHERE id = "​WOS:​000227354700101"; ​
 +UPDATE target_article SET doi = "​10.14670/​HH-26.471"​ WHERE id = "​WOS:​000287804300007"; ​
 </​code>​ </​code>​
  
行 64: 行 65:
 Once correction is done, to run <color red>​TargetArticleSeqNoAndIsReprintUpdater.java</​color>​ in the package <color red>​cn.edu.bjut.ui</​color>​. Once correction is done, to run <color red>​TargetArticleSeqNoAndIsReprintUpdater.java</​color>​ in the package <color red>​cn.edu.bjut.ui</​color>​.
  
-For unknown reasons, three coauthors are missed from the publication with id = ""​WOS:​000365103600006""; ​+For unknown reasons, three coauthors are missed from the publication with id = ""​WOS:​000365103600006""​. To conduct the following sql statements to supplement them.  
 +<code sql> 
 +INSERT author (id, full_name, last_name, first_name) VALUES (10846, "​Linard,​ Alban",​ "​Linard",​ "​Alban"​) 
 +INSERT author (id, full_name, last_name, first_name) VALUES (10849, "​Bóbeda,​ Edmundo López",​ "​Bóbeda",​ "​Edmundo López"​);​  
 +INSERT author (id, full_name, last_name, first_name) VALUES (10851, "​Marechal,​ Alexis",​ "​Marechal",​ "​Alexis"​);​  
 + 
 +INSERT target_article_author (target_article_id,​ author_id, seq_no_original,​ seq_no, is_reprint_original,​ is_reprint) VALUES ("​WOS:​000365103600006",​ 10846, 4, 4, 0, 0);  
 +INSERT target_article_author (target_article_id,​ author_id, seq_no_original,​ seq_no, is_reprint_original,​ is_reprint) VALUES ("​WOS:​000365103600006",​ 10849, 5, 5, 0, 0);  
 +INSERT target_article_author (target_article_id,​ author_id, seq_no_original,​ seq_no, is_reprint_original,​ is_reprint) VALUES ("​WOS:​000365103600006",​ 10851, 6, 6, 0, 0);  
 +</​code>​
 ===== Fetch and Import Citing Articles ===== ===== Fetch and Import Citing Articles =====
 To run <color red>​DownloadByWosId.java</​color>,​ a file <color red>​citing_article.wos_id</​color>​ in the directory <color red>​data</​color>​ will be generated. According to this file, to fetch full-record and cited references in the format of bibtex from Web of Science, and to save them in the directory <color red>​data/​wos/​citing</​color>​. To run <color red>​DownloadByWosId.java</​color>,​ a file <color red>​citing_article.wos_id</​color>​ in the directory <color red>​data</​color>​ will be generated. According to this file, to fetch full-record and cited references in the format of bibtex from Web of Science, and to save them in the directory <color red>​data/​wos/​citing</​color>​.
行 160: 行 170:
  
 <code bash> <code bash>
-> python split_data.py multi-label/​syn_bio.corpus 0.45 multi-label/​syn_bio.splits+> python split_data.py ​data/multi-label/​syn_bio.corpus 0.45 data/multi-label/​syn_bio.splits
 </​code>​ </​code>​
  
 To run <color red>​TrainTestSetSplitter.java</​color>​ in the package <color red>​cn.edu.bjut.multilabel</​color>​. In this time, two files <color red>​syn_bio.train.docs</​color>​ and <color red>​syn_bio.test.docs</​color>​ in the directory <color red>​data/​multi-label</​color>​ will be generated. ​ To run <color red>​TrainTestSetSplitter.java</​color>​ in the package <color red>​cn.edu.bjut.multilabel</​color>​. In this time, two files <color red>​syn_bio.train.docs</​color>​ and <color red>​syn_bio.test.docs</​color>​ in the directory <color red>​data/​multi-label</​color>​ will be generated. ​
 +===== Parameter Tuning =====
 +To run <color red>​ATArithmeticCreditTuningParam.java</​color>,​ <color red>​ATAxiomaticCreditTuningParam.java</​color>,​ <color red>​ATGeometricCreditTuningParam.java</​color>,​ <color red>​ATGoldenNumberCreditTuningParam.java</​color>,​ <color red>​ATHarmonicCreditTuningParam.java</​color>,​ and <color red>​ATNetworkCreditTuningParam.java</​color>​ in the package <color red>​cn.edu.bjut.ui</​color>​. Note that if one wants to turn on the hyper-authorship strategy, the second parameter is set to <color red>​true</​color>​ in these java files, otherwise false.
 +
 +<code matlab>
 +> load train_perplexity; ​
 +
 +> figure
 +> plotPerplexity(arithmetic_disabled,​ arithmetic_legends); ​
 +> figure ​
 +> plotPerplexity(arithmetic_enabled,​ arithmetic_legends); ​
 +
 +> figure
 +> plotPerplexity(geometric_disabled,​ geometric_legends); ​
 +> figure ​
 +> plotPerplexity(geometric_enabled,​ geometric_legends); ​
 +
 +> figure
 +> plotPerplexity(harmonic_disabled,​ harmonic_legends); ​
 +> figure ​
 +> plotPerplexity(harmonic_enabled,​ harmonic_legends); ​
 +
 +> figure
 +> plotPerplexity(network_disabled,​ network_legends); ​
 +> figure ​
 +> plotPerplexity(network_enabled,​ network_legends); ​
 +
 +> figure
 +> plotPerplexity(axiomatic_disabled,​ axiomatic_legends); ​
 +> figure ​
 +> plotPerplexity(axiomatic_enabled,​ axiomatic_legends); ​
 +
 +> figure
 +> plotPerplexity(golden_number_disabled,​ golden_number_legends); ​
 +> figure ​
 +> plotPerplexity(golden_number_enabled,​ golden_number_legends); ​
 +</​code> ​
 +
 +
 +===== Author Interest Discovery =====
 +To run <color red>​ATArithmeticCreditRunner.java</​color>,​ <color red>​ATAxiomaticCreditRunner.java</​color>,​ <color red>​ATGeometricCreditRunner.java</​color>,​ <color red>​ATGoldenNumberCreditRunner.java</​color>,​ <color red>​ATHarmonicCreditRunner.java</​color>,​ and <color red>​ATNetworkCreditRunner.java</​color>​ in the package <color red>​cn.edu.bjut.ui</​color>​.
  
-~~DISCUSSION~~+~~DISCUSSION:closed~~
zh/notes/atot_credit.1599696720.txt.gz · 最后更改: 2020/09/10 08:12 由 pzczxs