这里会显示出您选择的修订版和当前版本之间的差别。
两侧同时换到之前的修订记录 前一修订版 后一修订版 | 前一修订版 | ||
zh:notes:atot_credit [2020/10/07 13:54] pzczxs [Perplexity] |
zh:notes:atot_credit [2022/06/30 11:28] (当前版本) pzczxs 讨论状态变化了 |
||
---|---|---|---|
行 1: | 行 1: | ||
- | ====== AToT Model armed with Authorship Credit ====== | + | ====== AT Model armed with Authorship Credit ====== |
===== Requirements ===== | ===== Requirements ===== | ||
* [[http://scikit.ml/index.html|Scikit-Multilearn]] | * [[http://scikit.ml/index.html|Scikit-Multilearn]] | ||
行 22: | 行 22: | ||
Once correction is done, to run <color red>TargetArticleDoiUpdater.java</color> to import the related information in the file <color red>data/target_article_dois.xlsx</color> into MySQL database. | Once correction is done, to run <color red>TargetArticleDoiUpdater.java</color> to import the related information in the file <color red>data/target_article_dois.xlsx</color> into MySQL database. | ||
- | There are still three duplications with id = "WOS:000246296800029" and "WOS:000247372300026", id = "WOS:000297670800005" and "WOS:000293697700003", and id = "WOS:000393719000030" and id = "WOS:000394061000172". To run the following sql, the duplications will be removed. | + | There are still three duplications with id = "WOS:000246296800029" and "WOS:000247372300026", id = "WOS:000297670800005" and "WOS:000293697700003", and id = "WOS:000393719000030" and id = "WOS:000394061000172". To run the following SQL statements, the duplications will be removed. |
<code sql> | <code sql> | ||
DELETE FROM target_article_author WHERE target_article_id = "WOS:000247372300026"; | DELETE FROM target_article_author WHERE target_article_id = "WOS:000247372300026"; | ||
行 37: | 行 37: | ||
</code> | </code> | ||
- | However, several target articles are attached wrong DOI names or have still no DOI names. To run the following sql, the resulting DOI names will be corrected or added. | + | However, several target articles are attached wrong DOI names or have still no DOI names. To run the following SQL statements, the resulting DOI names will be corrected or added. |
<code sql> | <code sql> | ||
UPDATE target_article SET doi = "10.2307/24102078" WHERE id = "WOS:000252249700032"; | UPDATE target_article SET doi = "10.2307/24102078" WHERE id = "WOS:000252249700032"; | ||
行 50: | 行 50: | ||
UPDATE target_article SET doi = "10.1097/00006231-200306000-00013" WHERE id = "WOS:000183373800013"; | UPDATE target_article SET doi = "10.1097/00006231-200306000-00013" WHERE id = "WOS:000183373800013"; | ||
UPDATE target_article SET doi = "10.1097/00005176-200406001-00720" WHERE id = "WOS:000227354700101"; | UPDATE target_article SET doi = "10.1097/00005176-200406001-00720" WHERE id = "WOS:000227354700101"; | ||
+ | UPDATE target_article SET doi = "10.14670/HH-26.471" WHERE id = "WOS:000287804300007"; | ||
</code> | </code> | ||
行 174: | 行 175: | ||
To run <color red>TrainTestSetSplitter.java</color> in the package <color red>cn.edu.bjut.multilabel</color>. In this time, two files <color red>syn_bio.train.docs</color> and <color red>syn_bio.test.docs</color> in the directory <color red>data/multi-label</color> will be generated. | To run <color red>TrainTestSetSplitter.java</color> in the package <color red>cn.edu.bjut.multilabel</color>. In this time, two files <color red>syn_bio.train.docs</color> and <color red>syn_bio.test.docs</color> in the directory <color red>data/multi-label</color> will be generated. | ||
===== Parameter Tuning ===== | ===== Parameter Tuning ===== | ||
- | To run <color red>ATArithmeticCreditTuningParam.java</color>, <color red>ATAxiomaticCreditTuningParam.java</color>, <color red>ATGeometricCreditTuningParam.java</color>, <color red>ATGoldenNumberCreditTuningParam.java</color>, <color red>ATHarmonicCreditTuningParam.java</color>, and <color red>ATNetworkCreditTuningParam.java</color> in the package <color red>cn.edu.bjut.ui</color>. Note that if one wants to turn on the hyper-authorship strategy, the second parameter is set to <color red>true</color> in these java files, otherwise false. | + | To run <color red>ATArithmeticCreditTuningParam.java</color>, <color red>ATAxiomaticCreditTuningParam.java</color>, <color red>ATGeometricCreditTuningParam.java</color>, <color red>ATGoldenNumberCreditTuningParam.java</color>, <color red>ATHarmonicCreditTuningParam.java</color>, and <color red>ATNetworkCreditTuningParam.java</color> in the package <color red>cn.edu.bjut.ui</color>. Note that if one wants to turn on the hyper-authorship strategy, the second parameter is set to <color red>true</color> in these java files, otherwise false. |
+ | |||
+ | <code matlab> | ||
+ | > load train_perplexity; | ||
+ | |||
+ | > figure | ||
+ | > plotPerplexity(arithmetic_disabled, arithmetic_legends); | ||
+ | > figure | ||
+ | > plotPerplexity(arithmetic_enabled, arithmetic_legends); | ||
+ | |||
+ | > figure | ||
+ | > plotPerplexity(geometric_disabled, geometric_legends); | ||
+ | > figure | ||
+ | > plotPerplexity(geometric_enabled, geometric_legends); | ||
+ | |||
+ | > figure | ||
+ | > plotPerplexity(harmonic_disabled, harmonic_legends); | ||
+ | > figure | ||
+ | > plotPerplexity(harmonic_enabled, harmonic_legends); | ||
+ | |||
+ | > figure | ||
+ | > plotPerplexity(network_disabled, network_legends); | ||
+ | > figure | ||
+ | > plotPerplexity(network_enabled, network_legends); | ||
+ | |||
+ | > figure | ||
+ | > plotPerplexity(axiomatic_disabled, axiomatic_legends); | ||
+ | > figure | ||
+ | > plotPerplexity(axiomatic_enabled, axiomatic_legends); | ||
+ | |||
+ | > figure | ||
+ | > plotPerplexity(golden_number_disabled, golden_number_legends); | ||
+ | > figure | ||
+ | > plotPerplexity(golden_number_enabled, golden_number_legends); | ||
+ | </code> | ||
行 180: | 行 215: | ||
To run <color red>ATArithmeticCreditRunner.java</color>, <color red>ATAxiomaticCreditRunner.java</color>, <color red>ATGeometricCreditRunner.java</color>, <color red>ATGoldenNumberCreditRunner.java</color>, <color red>ATHarmonicCreditRunner.java</color>, and <color red>ATNetworkCreditRunner.java</color> in the package <color red>cn.edu.bjut.ui</color>. | To run <color red>ATArithmeticCreditRunner.java</color>, <color red>ATAxiomaticCreditRunner.java</color>, <color red>ATGeometricCreditRunner.java</color>, <color red>ATGoldenNumberCreditRunner.java</color>, <color red>ATHarmonicCreditRunner.java</color>, and <color red>ATNetworkCreditRunner.java</color> in the package <color red>cn.edu.bjut.ui</color>. | ||
- | ~~DISCUSSION~~ | + | ~~DISCUSSION:closed~~ |