Artificial intelligence in scholarly communications: An elsevier case study

Created at 6pm, Jan 4

kFTgSHfQ

Artificial Intelligence

Contract ID

XQQXTG2z_YGxyqnXOVQiQI7Ugl00QyD0Ars13czkCvI

File Type

PDF

Entry Count

Embed. Model

jina_embeddings_v2_base_en

Index Type

hnsw

Articial Intelligence/Machine Learning Solution. 4. Case study 2: Topics of prominence The idea of modelling science using computers and citation linkages is not a new idea. Eugene Gareld and Henry Small took this approach back in the 1980s, and their pioneer work played a role in the development of what is now known as Topic Prominence. Back in 1985, it took eight weeks to analyze 2% of the citation base in order to come up with a list of hot research topics namely, those in the Top 1%. Now, we use the entire corpus of literature to create a comprehensive view of the roughly one hundred thousand global research topics currently extant. 325 326

id: 490cdd475da355326e67f104f4f6fff3 - page: 7

A. Gabriel / Articial intelligence in scholarly communications: An elsevier case study When we think about modeling research, there are a few key design principles to consider: full coverage, the right level of topic granularity, accuracy of topics that contain the right papers, and the stability of topics over time. Based on researcher needs and aimed at portfolio analysis, we chose to identify roughly one hundred thousand topics in science using direct citation on citation linkages (including those to cited non-indexed items) in the full Scopus database.

id: c6c957b6a36c39f04b11c5b393d12df2 - page: 8

As I mentioned previously, it is important to use the full corpus of literature when modeling. Elseviers Topics of Prominence (TOP) tool does this. I wont go into too much detail here, but the following gure should give you an idea of the level of calculation required for such an analysis calculation of over half a billion cited-citing pairs to yield around ninety-seven thousand specic research topics (topic clusters) across every eld of research, from basic science to highly applied elds related to manufacturing and commercial technologies. A. Gabriel / Articial intelligence in scholarly communications: An elsevier case study To determine the right level of granularity, prior research and expert interviews were used to look at

id: f4c3f97c5bc863d9c3d9759ab8005e76 - page: 8

Perhaps one of the most important elements in any model is how accurately that model captures scientic activity. Using direct citation analysis, we have a higher accuracy at 105 scale than most of the current classication schema have using around one hundred to three hundred categories. No pre-existing categories are assumed, unlike journal-based classication schema, and the clusters are calculated from the bottom up. These clusters are small enough so that we can see manufacturing-oriented research in specic geographies, and multiple topics around a single larger specialty or discipline. This also captures interdisciplinary research in a fundamentally dierent way. 327 328

id: 95033a13a302ac73001e2bf278918f35 - page: 9

How to Retrieve?

# Search

curl -X POST "https://search.dria.co/hnsw/search" \
-H "x-api-key: <YOUR_API_KEY>" \
-H "Content-Type: application/json" \
-d '{"rerank": true, "top_n": 10, "contract_id": "XQQXTG2z_YGxyqnXOVQiQI7Ugl00QyD0Ars13czkCvI", "query": "What is alexanDRIA library?"}'
        
# Query

curl -X POST "https://search.dria.co/hnsw/query" \
-H "x-api-key: <YOUR_API_KEY>" \
-H "Content-Type: application/json" \
-d '{"vector": [0.123, 0.5236], "top_n": 10, "contract_id": "XQQXTG2z_YGxyqnXOVQiQI7Ugl00QyD0Ars13czkCvI", "level": 2}'