谷丙转氨酶偏高吃什么药| 慢性结肠炎是什么症状| 带环了月经推迟不来什么原因| 黑色的猫是什么品种| 腿酸是什么原因| 痢疾是什么原因引起的| 什么是碱性磷酸酶| 女人要的是什么| 胆结石什么原因引起的| 为什么会晕3d| white是什么意思颜色| 国保大队是干什么的| 糖尿病人晚餐吃什么最好| 喘不上气挂什么科| 三围是什么| 什么是干冰| 脚痒脱皮是什么原因| 家族是什么意思| 蟹柳是什么做的| 不是一路人是什么意思| 斑鸠喜欢吃什么食物| 梦到自己掉牙齿是什么预兆| 可定是什么药| 床咚是什么意思啊| mask是什么意思| 高压偏低是什么原因造成的| 肌张力高有什么表现| 什么是再生纤维面料| 癌胚抗原高是什么意思| 下肢动脉闭塞吃什么药| 血脉是什么意思| 外交部长是什么级别| 产褥期是什么意思| 安乃近片是什么药| 脖子上长癣是什么原因| 血液病有什么症状| 奶酪是什么做的| 维酶素片搭配什么药治萎缩性胃炎| 火把节是什么时候| 高油酸是什么意思| 酸菜鱼一般用什么鱼| 正佳广场有什么好玩的| afc是什么意思| 什么叫做焦虑症| 全身皮肤瘙痒是什么原因| 香云纱是什么面料| 调教什么意思| 什么睡姿对髋关节好| 梦见海龟是什么意思| 小孩子为什么会得抽动症| 什么叫磨玻璃结节| 破气是什么意思| 射手后面的星座是什么| 心脏造影是什么意思| 桑蚕丝被有什么好处| 吃什么利尿消肿| 碧玺是什么| 料酒和黄酒有什么区别| 修复胃粘膜吃什么药| 鹿鞭泡酒有什么功效| 若干是什么意思| 神经过敏是什么意思| 口腔溃疡喝什么饮料| 比例是什么| 不是经期有少量出血是什么原因| 河北古代叫什么| 什么食物含维生素b| 胎盘是什么| 突然想吐是什么原因| 南瓜是什么形状| 半梦半醒是什么意思| 什么好| 伊玛目是什么意思| 脚发麻什么原因| 什么叫个人修养| 芝士是什么做的| 三轮体空什么意思| 男人气血不足吃什么药| 高血压吃什么菜| 清奇是什么意思| 左眼跳什么预兆| 贫血缺什么| 种植牙是什么| 副乳有什么危害吗| 一九九八年属什么生肖| 爸爸的爸爸叫什么儿歌| 什么眉什么目| 王火火念什么| 单元剧是什么意思| 什么情况下需要做活检| 巨蟹座是什么象| 吃山药有什么好处| 伟哥叫什么| 小孩子走神是什么原因| 图谋不轨什么意思| 微腺瘤是什么| 一生一世是什么生肖| 为什么尿频繁怎么回事| 口苦口干吃什么药最好| 一键挪车什么意思| 身上长扁平疣是什么原因造成的| fashion是什么意思| 梦见猪是什么意思| 巴基斯坦是什么语言| chip什么意思| cdr是什么意思| 什么水适合婴儿冲奶粉| 百香果什么时候开花结果| 壑是什么字| 骑驴找马什么意思| 无证之罪什么意思| 什么魏什么赵| 神经鞘瘤挂什么科| 天朝是什么意思| 溏是什么意思| kfc是什么| 睡醒咳嗽是什么原因| 什么是有源音箱| 佛珠生菇讲述什么道理| 炎帝叫什么| 传度是什么意思| 什么叫白癜风| 精液什么颜色| 做梦梦到别人死了是什么征兆| 为什么隔夜茶不能喝| 吃坏东西拉肚子吃什么药| 得了子宫肌瘤注意什么| 脾胃虚弱吃什么中药| 回乳是什么意思| 圣诞节是什么时候| 红细胞偏高有什么危害| 蚂蝗长什么样| 五行缺土是什么意思| 喉咙痛是什么原因| 什么水果解酒| 耳朵痒痒是什么原因| fs是什么单位| 双规什么意思| 梦到小男孩是什么意思| 前列腺炎忌口什么食物| 含羞草长什么样| 沈阳有什么好玩的地方| 梦见苍蝇很多是什么意思| 九月一号是什么节日| 半月板是什么意思| ntr什么意思| cmb是什么意思| 蒲地蓝消炎片主治什么| 天相是什么意思| 吃什么水果补钙| 低血糖和贫血有什么区别| 后嗣是什么意思| 何弃疗是什么意思| 新生儿呛奶是什么原因引起的| jio是什么意思| 脉冲是什么| 箨是什么意思| 腹腔积水是什么原因造成的| 血热是什么原因引起的| 低血压吃什么好的最快女性| 逸五行属什么| 王朝马汉是什么意思| 炎症是什么引起的| poems是什么意思| 美国为什么不建高铁| 食管裂孔疝什么意思| 小孩干咳吃什么药| 累觉不爱是什么意思| 月经量少是什么原因啊| 黑色裤子配什么颜色t恤| 神经性皮炎用什么药| 单亲妈妈是什么意思| 候车是什么意思| 梦见自己找工作是什么意思| 聊是什么意思| 打嗝不停是什么病前兆| 膝盖酸痛什么原因| 组织液是什么| 1997年出生属什么| 阑尾炎手术后吃什么好| 梦见已故母亲预示什么| 白痰多是什么原因造成的| 梦见做饭是什么意思| o型血和ab型血生的孩子是什么血型| 爱出者爱返福往者福来什么意思| 六九年属什么| 睾丸扭转有什么症状| 肩膀疼吃什么药| 燕子吃什么| 转铁蛋白阳性什么意思| 扁食是什么| 女生私处长什么样| 什么是素质教育| charcoal是什么颜色| 梦见放生鱼是什么意思| 什么水果降血压| 心病有什么症状| 史迪仔是什么动物| 房速是什么意思| 为什么打雷闪电| 欲生欲死是什么意思| 疱疹是什么原因引起的| 胆囊是干什么用的| 2001年属蛇五行属什么| 对调什么意思| 啪啪啪什么感觉| 与会是什么意思| 1902年属什么生肖| 怀孕后吃避孕药有什么后果| 梦龙什么口味好吃| 避孕药什么牌子好| 肺阴虚吃什么食物最好| 什么草药能治痔疮| 尿潜血阴性什么意思| 叶子像什么| 一个既一个旦念什么| 电荷是什么| 中药不能和什么一起吃| 令羽读什么| 梦见水是什么预兆| 机遇什么意思| 为什么说婴儿摔床没事| 什么地坐着| 办残疾证需要什么条件| 秉承是什么意思| 三聚净戒是指什么戒| 七月初一是什么日子| 沐沐是什么意思| 什么痣不能点| 尾货是什么意思| 531是什么意思| 胃窦小弯是什么意思| 草莓的花是什么颜色| 胸闷是什么原因| 晚上吃什么能减肥| 检查前列腺需要做什么检查| 后天是什么日子| 天公作美是什么生肖| 什么意思啊| 天下乌鸦一般黑是什么意思| 干酪是什么| 薄荷不能和什么一起吃| 喉咙发炎不能吃什么食物| 12月16号是什么星座| 梨花是什么颜色的| 牛黄安宫丸什么季节吃| 什么时候期末考试| 42是什么意思| 尿发黄什么原因| 哪吒属什么生肖| 什么药不能喝酒| 身体怕冷什么原因| 什么门不能开| 蚕豆病是什么病有什么症状| 澈是什么意思| 政绩是什么意思| 芒果鱼是什么鱼| 乳腺增生什么意思| 办理结婚证需要什么材料| hp-是什么意思| 是什么原因导致肥胖| 金牛座有什么特点| value是什么意思| 穿刺活检是什么意思| 百度Jump to content

财政部关于印发《会计师事务所信息化促进工作方案》的通知

From Wikipedia, the free encyclopedia
(Redirected from CiteSeerX (identifier))
CiteSeerX
Type of site
Bibliographic database
Available inEspa?ol
OwnerPennsylvania State University College of Information Sciences and Technology
RevenueActive
URLciteseerx.ist.psu.edu Edit this at Wikidata
RegistrationOptional
Launched2008; 17 years ago (2008) / 1997; 28 years ago (1997)
Current statusActive
Content license
Creative Commons BY-NC-SA license[1]
百度 你能不能找个蒙古族青年呀?过去王昭君不就是做了蒙古族人的媳妇吗?周秉建清楚地记得,1972年回到北京探望伯父时,伯父认真地与她谈起了她的婚姻问题。

CiteSeerX (formerly called CiteSeer) is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science.

CiteSeer's goal is to improve the dissemination and access of academic and scientific literature. As a non-profit service that can be freely used by anyone, it has been considered part of the open access movement that is attempting to change academic and scientific publishing to allow greater access to scientific literature. CiteSeer freely provided Open Archives Initiative metadata of all indexed documents and links indexed documents when possible to other sources of metadata such as DBLP and the ACM Portal. To promote open data, CiteSeerX shares its data for non-commercial purposes under a Creative Commons license.[1]

CiteSeer is considered a predecessor of academic search tools such as Google Scholar and Microsoft Academic Search.[2] CiteSeer-like engines and archives usually only harvest documents from publicly available websites and do not crawl publisher websites. For this reason, authors whose documents are freely available are more likely to be represented in the index.

CiteSeer changed its name to ResearchIndex at one point and then changed it back.[3]

History

[edit]

CiteSeer and CiteSeer.IST

[edit]

CiteSeer was created by researchers Lee Giles, Kurt Bollacker and Steve Lawrence in 1997 while they were at the NEC Research Institute (now NEC Labs), Princeton, New Jersey, US. CiteSeer's goal was to actively crawl and harvest academic and scientific documents on the web and use autonomous citation indexing to permit querying by citation or by document, ranking them by citation impact. At one point, it was called ResearchIndex.

CiteSeer became public in 1998 and had many new features unavailable in academic search engines at that time. These included:

  • Autonomous Citation Indexing automatically created a citation index that can be used for literature search and evaluation.
  • Citation statistics and related documents were computed for all articles cited in the database, not just the indexed articles.
  • Reference linking, allowing browsing of the database using citation links.
  • Citation context showed the context of citations to a given paper, allowing a researcher to quickly and easily see what other researchers have to say about an article of interest.
  • Related documents were shown using citation and word based measures, and an active and continuously updated bibliography is shown for each document.

CiteSeer was granted a United States patent # 6289342, titled "Autonomous citation indexing and literature browsing using citation context", on September 11, 2001. The patent was filed on May 20, 1998, and has priority to January 5, 1998. A continuation patent (US Patent # 6738780) was filed on May 16, 2001, and granted on May 18, 2004.[citation needed]

After NEC, in 2004 it was hosted as CiteSeer.IST on the World Wide Web at the College of Information Sciences and Technology, The Pennsylvania State University, and had over 700,000 documents. For enhanced access, performance and research, similar versions of CiteSeer were supported at universities such as the Massachusetts Institute of Technology, University of Zürich and the National University of Singapore. However, these versions of CiteSeer proved difficult to maintain and are no longer available. Because CiteSeer only indexes freely available papers on the web and does not have access to publisher metadata, it returns fewer citation counts than sites, such as Google Scholar, that have publisher metadata.

CiteSeer had not been comprehensively updated since 2005 due to limitations in its architecture design. It had a representative sampling of research documents in computer and information science but was limited in coverage because it was limited to papers that are publicly available, usually at an author's homepage, or those submitted by an author. To overcome some of these limitations, a modular and open source architecture for CiteSeer was designed – CiteSeerX.

CiteSeerX

[edit]

CiteSeerX replaced CiteSeer and all queries to CiteSeer were redirected. CiteSeerX[4] is a public search engine and digital library and repository for scientific and academic papers, primarily with a focus on computer and information science.[4] However, recently CiteSeerX has been expanding into other scholarly domains such as economics, physics and others. Released in 2008, it was loosely based on the previous CiteSeer search engine and digital library and is built with a new open source infrastructure, SeerSuite, and new algorithms and their implementations. It was developed by researchers Isaac Councill and C. Lee Giles at the College of Information Sciences and Technology, Pennsylvania State University. It continues to support the goals outlined by CiteSeer to actively crawl and harvest academic and scientific documents on the public web and to use a citation inquiry by citations and ranking of documents by the impact of citations. Currently, Lee Giles, Prasenjit Mitra, Susan Gauch, Min-Yen Kan, Pradeep Teregowda, Juan Pablo Fernández Ramírez, Pucktada Treeratpituk, Jian Wu, Douglas Jordan, Steve Carman, Jack Carroll, Jim Jansen, and Shuyi Zheng are or have been actively involved in its development. Recently, a table search feature was introduced.[5] It has been funded by the National Science Foundation, NASA, and Microsoft Research.

CiteSeerX continues to be rated as one of the world's top repositories, and was rated number 1 in July 2010.[6] It currently has over 6 million documents with nearly 6 million unique authors and 120 million citations.[timeframe?]

CiteSeerX also shares its software, data, databases and metadata with other researchers, currently by Amazon S3 and by rsync.[7] Its new modular open source architecture and software (available previously on SourceForge but now on GitHub) is built on Apache Solr and other Apache and open source tools, which allows it to be a testbed for new algorithms in document harvesting, ranking, indexing, and information extraction.

CiteSeerX caches some PDF files that it has scanned. As such, each page includes a DMCA link which can be used to report copyright violations.[8]

Current features

[edit]

Automated information extraction

[edit]

CiteSeerX uses automated information extraction tools, usually built on machine learning methods such ParsCit, to extract scholarly document metadata such as title, authors, abstract, citations, etc. As such, there are sometime errors in authors and titles. Other academic search engines have similar errors.

Focused crawling

[edit]

CiteSeerX crawls publicly available scholarly documents primarily from author webpages and other open resources, and does not have access to publisher metadata. As such, citation counts in CiteSeerX are usually less than those in Google Scholar and Microsoft Academic Search who have access to publisher metadata.

Usage

[edit]

CiteSeerX has nearly one million users worldwide based on unique IP addresses and has millions of hits daily. Annual downloads of document PDFs were nearly 200 million for 2015.

Data

[edit]

CiteSeerX data is regularly shared under a Creative Commons BY-NC-SA license with researchers worldwide and has been and is used in many experiments and competitions.

Thanks to its OAI-PMH endpoint,[9] CiteSeerX is an open archive and its content is indexed like an institutional repository in academic search engines, for instance BASE and Unpaywall consumers.

Other SeerSuite-based search engines

[edit]

The CiteSeer model had been extended to cover academic documents in business with SmealSearch and in e-business with eBizSearch. However, these were not maintained by their sponsors. An older version of both of these could be once found at BizSeer.IST but is no longer in service.

Other Seer-like search and repository systems have been built for chemistry, ChemXSeer and for archaeology, ArchSeer. Another had been built for robots.txt file search, BotSeer. All of these are built on the open source tool SeerSuite, which uses the open source indexer Lucene.

See also

[edit]

References

[edit]
  1. ^ a b "CiteSeerX Data Policy". Archived from the original on 2025-08-14. Retrieved 2025-08-14.
  2. ^ Kodakateri Pudhiyaveetil, Ajith; Gauch, Susan; Luong, Hiep; Eno, Josh (2009). "Conceptual recommender system for CiteSeerX". Proceedings of the third ACM conference on Recommender systems. New York, New York, US: ACM Press. p. 241. doi:10.1145/1639714.1639758. ISBN 978-1-60558-435-5. S2CID 13900679.
  3. ^ Lawrence, Steve (2001). "ResearchIndex: Inside the world's largest free full-text index of scientific literature". Proceedings of the international conference on Knowledge capture - K-CAP 2001. p. 3. doi:10.1145/500737.500740. ISBN 1-58113-380-4. S2CID 19592721.
  4. ^ a b "About CiteSeerX". Archived from the original on 2025-08-14. Retrieved 2025-08-14.
  5. ^ "The CiteSeerX Team". Pennsylvania State University. Archived from the original on 2025-08-14. Retrieved 2025-08-14.
  6. ^ "Ranking Web of World Repositories: Top 800 Repositories". Cybermetrics Lab. July 2010. Archived from the original on 2025-08-14. Retrieved 2025-08-14.
  7. ^ "About CiteSeerX Data". Pennsylvania State University. Archived from the original on 2025-08-14. Retrieved 2025-08-14.
  8. ^ For example, "CiteSeerx – DMCA Notice". CiteSeerX 10.1.1.604.4916. Archived from the original on 2025-08-14. The document with the identifier "10.1.1.604.4916" has been removed due to a DMCA takedown notice. If you believe the removal has been in error, please contact us through the feedback page, along with the identifier mentioned in this page.
  9. ^ Hirst, Tony (2025-08-14). "Using OAI-PMH as a Single Record Level Query Interface to Citeseer". Archived from the original on 2025-08-14. Retrieved 2025-08-14.

Further reading

[edit]
[edit]
红细胞偏高是什么意思 黑茶色是什么颜色 十二指肠胃溃疡吃什么药 扁桃体肥大有什么影响 双鱼座是什么象星座
球拍状胎盘对胎儿有什么影响 载波是什么意思 为什么怀孕了就不来月经了 百香果吃了有什么好处 韭菜花炒什么好吃
胃炎不能吃什么 女人不排卵是什么原因造成的 3月13日是什么星座 人工荨麻疹是什么原因引起的 湖北九头鸟是什么意思
泡什么喝可以降血糖 肝硬化吃什么好 边界欠清是什么意思 嘴角周围长痘痘是什么原因 空调用什么插座
在什么的前面用英语怎么说hcv8jop2ns1r.cn 家庭教育是什么hcv9jop6ns1r.cn 智齿拔了有什么影响hcv9jop2ns4r.cn 什么水果清肝火hcv8jop6ns7r.cn 属蛇和什么属相相冲wuhaiwuya.com
小龙虾不能和什么一起吃hcv8jop7ns2r.cn 猫鼻支是什么症状hcv7jop6ns1r.cn 青字五行属什么hcv8jop7ns1r.cn 切除子宫对身体有什么伤害hcv8jop3ns9r.cn 多囊挂什么科aiwuzhiyu.com
胸部有硬块挂什么科fenrenren.com 甲状腺结节是什么原因引起的qingzhougame.com 小儿手足口病吃什么药hcv8jop1ns3r.cn 面部痉挛是什么原因引起的zsyouku.com 梦见金蛇有什么预兆hcv8jop5ns5r.cn
肚脐眼周围疼是什么原因hcv7jop7ns4r.cn 兰花叶子发黄是什么原因cl108k.com 照护保险是什么hcv9jop0ns2r.cn 刚怀孕需要注意什么bfb118.com 尿带血什么原因hcv8jop4ns0r.cn
百度