相似文献/References:
[1]徐济惠. 基于Simhash算法的海量文档反作弊技术研究[J].计算机技术与发展,2014,24(09):103.
XU Ji-hui. Research on Huge Amounts of Documents Anti-spamming Technique Based on Simhash Algorithm[J].,2014,24(08):103.
[2]石雁,李朝锋. 结合统计和词间关系的文本关键词计算方法[J].计算机技术与发展,2015,25(12):22.
SHI Yan,LI Chao-feng. A Method of Text Keyword Calculation by Combining Statistics with Relationship Between Words[J].,2015,25(08):22.
[3]彭双和,图尔贡·麦提萨比尔,周巧凤. 基于Simhash的中文文本去重技术研究[J].计算机技术与发展,2017,27(11):137.
PENG Shuang-he,Tuergong MAITISABIER,ZHOU Qiao-feng. Research on Deduplication Technique of Chinese Text with Simhash[J].,2017,27(08):137.
[4]王诚,王宇成.基于Simhash 的大规模文档去重改进算法研究[J].计算机技术与发展,2019,29(02):115.[doi:10.3969/j.issn.1673-629X.2019.02.024]
WANG Cheng,WANG Yucheng.Research on Improved Large-scale Documents Deduplication Algorithm Based on Simhash[J].,2019,29(08):115.[doi:10.3969/j.issn.1673-629X.2019.02.024]