Hello Alteryx Community,
I'm hoping someone can kindly help me with this scenario. I've built a vector search index using Google Cloud's Vertex AI, and it works well, but the costs are adding up, so I'd like to find a way to run it offline in Alteryx.
Here's what I've done so far:
- I created the vector embeddings and deployed the index in Vertex AI using Python code in a Jupyter notebook.
- The resulting vectorized data is in a JSON file (it's JSONL format with fields like 'id', 'title', 'embedding' (dense vector), and 'dimensions' (sparse indices)).
My question: Is there a way to replicate this vector search functionality offline in Alteryx? For example, using the Python Tool with libraries like Faiss for dense search and something for sparse/hybrid queries? Since I already have the JSON file with the vectors, I'd love to load it, create a local index, and run queries without relying on Google Cloud.
Any guidance, sample workflows, or code snippets would be greatly appreciated! I'm open to suggestions on handling the dense and sparse embeddings in Alteryx.
Thanks in advance for your help!