微信扫码
添加专属顾问
我要投稿
.01
.02
.03
{
"id": 0,
"vector": [0.01, -0.03, 0.15, ..., -0.08],
"payload": {
"company": "Apple Inc.",
"ticker": "AAPL",
"price": 175.50,
"market_cap": "2.8T",
"industry": "Technology",
"pe_ratio": 28.5
}
}
.04
.05
.06
import pandas as pd
from sentence_transformers import SentenceTransformer
# 加载句子嵌入模型
encoder = SentenceTransformer('all-MiniLM-L6-v2')
# 读取JSON格式的数据
df = pd.read_json('../../stock_data.json')
df = pd.json_normalize(df['stocks'])
df = df[df['company'].notna()]
data = df.to_dict('records')
from qdrant_client import QdrantClient
# 创建内存中的向量数据库
qdrant = QdrantClient(":memory:")
# 创建集合以存储向量数据
qdrant.recreate_collection(
collection_name="top_stocks",
vectors_config=models.VectorParams(
size=encoder.get_sentence_embedding_dimension(),
distance=models.Distance.COSINE
)
)
# 向量化数据并上传至向量数据库
valid_data = [doc for doc in data if isinstance(doc.get("company", ""), str) and doc["company"].strip()]
qdrant.upsert(
collection_name="top_stocks",
points=[
models.PointStruct(
id=idx,
vector=encoder.encode(doc["company"]).tolist(),
payload=doc
) for idx, doc in enumerate(valid_data)
]
)
# 执行查询并获取相似结果
query_prompt = "市值较高的科技公司"
query_vector = encoder.encode(query_prompt).tolist()
search_results = qdrant.search(
collection_name="top_stocks",
query_vector=query_vector,
limit=3,
with_payload=True
)
for result in search_results:
print(f"公司:{result.payload['company']},行业:{result.payload['industry']},市值:{result.payload['market_cap']}")
from openai import OpenAI
client = OpenAI(base_url="http://127.0.0.1:8080/v1", api_key="your_api_key")
completion = client.chat.completions.create(
model="LLaMA_CPP",
messages=[
{"role": "system", "content": "你是股票领域的专家,帮助用户选择股票并回答他们的问题。"},
{"role": "user", "content": "NVIDIA的市值和市盈率是多少?"},
{"role": "assistant", "content": str(search_results)}
]
)
print(completion.choices[0].message["content"])
.07
53AI,企业落地大模型首选服务商
产品:场景落地咨询+大模型应用平台+行业解决方案
承诺:免费POC验证,效果达标后再合作。零风险落地应用大模型,已交付160+中大型企业
2026-06-23
RAG之后,知识库开始自己长大
2026-06-23
AI 知识库开始分叉:LLM Wiki 和 GBrain 真正的差别
2026-06-23
谷歌发布OKF(Open Knowledge Format)规范,它与Karpathy的LLM-wiki是什么关系?
2026-06-23
RAG 的尽头,是 SQL?
2026-06-22
传统RAG已经落伍了?清华大神开源的这个 rag-skill,让知识库检索直接升维
2026-06-22
从个人知识库到企业级 RAG:我们最终选了 WeKnora
2026-06-22
RAG 不是先向量检索再回答:Metadata Filter 才是企业知识库的第一道门
2026-06-21
使用 LangSmith 进行 RAG 评估:构建生产级 RAG 系统的 AI 开发者指南
2026-04-06
2026-04-27
2026-04-02
2026-03-31
2026-04-23
2026-04-20
2026-04-09
2026-04-12
2026-04-22
2026-04-10
2026-06-23
2026-06-23
2026-06-15
2026-06-10
2026-06-10
2026-05-20
2026-05-18
2026-05-11