000048712 000__ 02725cam\a22003495i\4500 000048712 001__ 48712 000048712 003__ SzGeWIPO 000048712 005__ 20240708150415.0 000048712 006__ m e d 000048712 008__ 231117s2021 sz|||||| |||||000|0 eng|d 000048712 022__ $$a1932-6203 000048712 035__ $$a(OCoLC)1412052383 000048712 040__ $$aSzGeWIPO$$beng$$erda$$dCaBNVSL 000048712 041__ $$aeng 000048712 24500 $$aMeasuring novelty in science with word embedding. 000048712 264_1 $$aSan Francisco, California :$$bPublic Library of Science (PLoS),$$c2021. 000048712 336__ $$atext$$btxt$$2rdacontent 000048712 337__ $$aunmediated$$bn$$2rdamedia 000048712 338__ $$avolume$$bnc$$2rdacarrier 000048712 4901_ $$aPublic Library of Science (PLoS) ;$$vVolume 18, No. 6 000048712 520__ $$aNovelty is a core value in science, and a reliable measurement of novelty is crucial. This study proposes a new approach of measuring the novelty of scientific articles based on both citation data and text data. The proposed approach considers an article to be novel if it cites a combination of semantically distant references. To this end, we first assign a word embedding–a vector representation of each vocabulary–to each cited reference on the basis of text information included in the reference. With these vectors, a distance between every pair of references is computed. Finally, the novelty of a focal document is evaluated by summarizing the distances between all references. The approach draws on limited text information (the titles of references) and publicly shared library for word embeddings, which minimizes the requirement of data access and computational cost. We share the code, with which one can compute the novelty score of a document of interest only by having the focal document’s reference list. We validate the proposed measure through three exercises. First, we confirm that word embeddings can be used to quantify semantic distances between documents by comparing with an established bibliometric distance measure. Second, we confirm the criterion-related validity of the proposed novelty measure with self-reported novelty scores collected from a questionnaire survey. Finally, as novelty is known to be correlated with future citation impact, we confirm that the proposed measure can predict future citation. 000048712 525__ $$aPublished in : PLoS ONE, vol. 18, no. 6 (2021). 000048712 650_0 $$aResearch. 000048712 650_0 $$aResearch$$xData processing. 000048712 650_0 $$aNovelties in literature. 000048712 7001_ $$aShibayama, Sotaro,$$eauthor. 000048712 7001_ $$aYin, Deyun,$$eauthor. 000048712 7001_ $$aMatsumoto, Kuniko,$$eauthor. 000048712 830_0 $$aPublic Library of Science (PLoS) ;$$vVolume 18, No. 6. 000048712 85641 $$uhttps://journals.plos.org/plosone/article?id=10.1371/journal.pone.0254034$$yView this resource 000048712 904__ $$aJournal article 000048712 980__ $$aBIB