# Query

curl -X POST "https://search.dria.co/hnsw/query" \
-H "x-api-key: <YOUR_API_KEY>" \
-H "Content-Type: application/json" \
-d '{"vector": [0.123, 0.5236], "top_n": 10, "contract_id": "PIXxyL9ZpQsLMBZ_QkNehWa-JXD6w7m2qGta0lh5Wek", "level": 2}'
        

Bash


# Query

response = requests.post(
    "https://search.dria.co/hnsw/query",
    headers={'x-api-key': '<YOUR_API_KEY>', 'Content-Type': 'application/json'},
    json={
    "vector": [
        0.123,
        0.5236
    ],
    "top_n": 10,
    "contract_id": "PIXxyL9ZpQsLMBZ_QkNehWa-JXD6w7m2qGta0lh5Wek",
    "level": 2
}
)
print(response.status_code)
print(response.json())
        

Python

The introduction of ChatGPT has led to a significant increase in the
utilization of Large Language Models (LLMs) for addressing downstream tasks.
There's an increasing focus on cost-efficient training and deployment within
this context. Low-cost training and deployment of LLMs represent the future
development trend. This paper reviews the evolution of large language model
training techniques and inference deployment technologies aligned with this
emerging trend. The discussion on training includes various aspects, including
data preprocessing, training architecture, pre-training tasks, parallel
training, and relevant content related to model fine-tuning. On the inference
side, the paper covers topics such as model compression, parallel computation,
memory scheduling, and structural optimization. It also explores LLMs'
utilization and provides insights into their future development.