Google Gemini RAG Product Search for Square Online SMB Stores

AI-powered natural language product search that answers customer queries by indexing Square Online catalog items.

google-gemini rag square vector-search nextjs qdrant semantic-cache llm-router product-search

The problem

Small Square Online merchants lose sales because customers can't find products using the default keyword search.

Built from

Intro

This recipe builds an AI-powered natural-language product search for Square Online stores using a RAG (Retrieval-Augmented Generation) pipeline. You’ll index a Square Catalog into a Qdrant vector database, then serve search queries through Google Gemini — with semantic caching, cost-aware model routing, and multi-turn conversation support. By the end, you’ll have a set of Next.js API endpoints that let shoppers ask “red sneakers size 10” and get a human-like answer pointing them to the right products.

Prerequisites

Node.js >= 22 and pnpm 10 installed
Docker — to run Qdrant locally via docker run -p 6333:6333 qdrant/qdrant
Square account — create an app at developer.squareup.com, generate an OAuth token, and grant the ITEMS_READ scope
Google Cloud Platform project with the Vertex AI API enabled and a service account key with the aiplatform.user role
Langfuse account (optional) — for LLM observability and tracing
Basic familiarity with TypeScript, Next.js App Router, and vector databases

Step 1: Bootstrap the Next.js project and install dependencies

Start from an empty directory. Initialize a Next.js project with the App Router:

terminal

npx create-next-app@latest . --typescript --app --use-pnpm

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

190 kB·112 tests·96.0% coverage·vitest passing

SHA-2563067797b1085ff153b7389d8020149ccb08be34435f2ddc0c84e44db45751020

Book a conversation All solutions

Comments

Loading comments…

import { SquareClient, SquareError, type Square } from "square"; import type { Product } from "../types.js"; import type { Config } from "./config.js"; function isItemObject( obj: Square.CatalogObject, ): obj is Square.CatalogObject.Item { return obj.type === "ITEM"; } function isItemVariationObject( obj: Square.CatalogObject, ): obj is Square.CatalogObject.ItemVariation { return obj.type === "ITEM_VARIATION"; } export class SquareCatalogService { private client: SquareClient; constructor(client: SquareClient) { this.client = client; } async listProducts(): Promise<Product[]> { try { const pageableResponse = await this.client.catalog.list({ types: "ITEM", }); const products: Product[] = []; for await (const item of pageableResponse) { if (!isItemObject(item)) continue; const itemData = item.itemData; if (!itemData) continue; const variation = itemData.variations?.find(isItemVariationObject); const priceMoney = variation?.itemVariationData?.priceMoney; products.push({ id: item.id, name: itemData.name ?? "", description: itemData.description ?? "", category: itemData.categoryId ?? "", price: { amount: priceMoney?.amount ? Number(priceMoney.amount) : 0, currency: priceMoney?.currency ?? "USD", }, tags: itemData.reportingCategory?.id ? [itemData.reportingCategory.id] : [], }); } return products; } catch (error) { if (error instanceof SquareError) { throw new Error(`Square API error: ${error.message}`); } throw error; } } async getProduct(id: string): Promise<Product | null> { try { const response = await this.client.catalog.object.get({ objectId: id, }); const item = response.object; if (!item || !isItemObject(item)) return null; const itemData = item.itemData; if (!itemData) return null; const variation = itemData.variations?.find(isItemVariationObject); const priceMoney = variation?.itemVariationData?.priceMoney; return { id: item.id, name: itemData.name ?? "", description: itemData.description ?? "", category: itemData.categoryId ?? "", price: { amount: priceMoney?.amount ? Number(priceMoney.amount) : 0, currency: priceMoney?.currency ?? "USD", }, tags: itemData.reportingCategory?.id ? [itemData.reportingCategory.id] : [], }; } catch (error) { if (error instanceof SquareError) { throw new Error(`Square API error: ${error.message}`); } throw error; } } } export function createSquareClient(config: Config): SquareCatalogService { const client = new SquareClient({ token: config.SQUARE_ACCESS_TOKEN }); return new SquareCatalogService(client); }

import { CacheEngine, InMemoryAdapter, type CacheConfig, type CacheResult, type CacheEntry, } from "@reaatech/llm-cache"; import { QdrantAdapter } from "@reaatech/llm-cache-adapters-qdrant"; import type { Config } from "./config.js"; import type { FastembedEmbedder } from "./cache-embedder.js"; export class CatalogCacheService { private cache: CacheEngine; private adapter: QdrantAdapter; private embedder: FastembedEmbedder; private config: CacheConfig; constructor( adapter: QdrantAdapter, embedder: FastembedEmbedder, config: CacheConfig, ) { this.adapter = adapter; this.embedder = embedder; this.config = config; this.cache = new CacheEngine({ storage: new InMemoryAdapter(), vectorStorage: adapter, embedder, config, }); } async get( prompt: string, options?: { useCase?: string; model?: string }, ): Promise<CacheResult> { return this.cache.get(prompt, { useCase: options?.useCase, model: options?.model, modelVersion: options?.model, }); } async set( prompt: string, response: Record<string, unknown>, options?: { model: string; useCase: string }, ): Promise<CacheEntry> { return this.cache.set(prompt, response, { model: options?.model, modelVersion: options?.model, useCase: options?.useCase, }); } async invalidateProductCache(productId: string): Promise<void> { await this.cache.invalidate({ useCase: "product-search", promptHash: productId }); } } export async function createCacheService( config: Config, embedder: FastembedEmbedder, ): Promise<CatalogCacheService> { const adapter = new QdrantAdapter({ url: config.QDRANT_URL, collectionName: "llm-cache", vectorSize: 384, apiKey: config.QDRANT_API_KEY, }); await adapter.connect(); const cacheConfig: CacheConfig = { similarity: { threshold: 0.85, metric: "cosine", maxResults: 5 }, ttl: { default: 3600, factual: 1800, creative: 3600, analytical: 3600, sensitive: 3600, byUseCase: {} }, segmentation: { enabled: true, defaultUseCase: "product-search" }, storage: { adapter: "memory" }, vectorStorage: { adapter: "qdrant" }, cost: { enabled: false, currency: "USD" }, embedding: { provider: "openai", model: "text-embedding-ada-002", dimensions: 384, batchSize: 256, maxRetries: 3 }, observability: { metrics: false, tracing: false, logging: "error" }, }; return new CatalogCacheService(adapter, embedder, cacheConfig); }

import { QdrantClient } from "@qdrant/js-client-rest"; import type { IndexingStatus } from "../types.js"; import type { Config } from "./config.js"; import type { SquareCatalogService } from "./square-client.js"; import type { LocalEmbedder } from "./embedder.js"; export class ProductIndexerService { private qdrantClient: QdrantClient; private squareService: SquareCatalogService; private embedder: LocalEmbedder; private collectionName: string; constructor( qdrantClient: QdrantClient, squareService: SquareCatalogService, embedder: LocalEmbedder, collectionName: string, ) { this.qdrantClient = qdrantClient; this.squareService = squareService; this.embedder = embedder; this.collectionName = collectionName; } async ensureCollection(): Promise<void> { const collections = await this.qdrantClient.getCollections(); const exists = collections.collections.some( (c) => c.name === this.collectionName, ); if (!exists) { await this.qdrantClient.createCollection(this.collectionName, { vectors: { size: 384, distance: "Cosine" }, }); } } async runIndex(): Promise<IndexingStatus> { try { await this.ensureCollection(); const products = await this.squareService.listProducts(); const descriptions = products.map((p) => p.description); const ids: string[] = []; const points: Array<{ id: string; vector: number[]; payload: Record<string, unknown>; }> = []; for await (const batch of this.embedder.embedDocuments(descriptions)) { for (let i = 0; i < batch.length; i++) { const product = products[points.length]; points.push({ id: product.id, vector: batch[i], payload: { name: product.name, description: product.description, category: product.category, price: product.price, tags: product.tags, }, }); ids.push(product.id); } } if (points.length > 0) { await this.qdrantClient.upsert(this.collectionName, { points: points.map((p) => ({ id: p.id, vector: p.vector, payload: p.payload, })), }); } return "completed"; } catch { return "failed"; } } async getIndexStats(): Promise<{ pointsCount: number }> { const collection = await this.qdrantClient.getCollection( this.collectionName, ); return { pointsCount: collection.points_count ?? 0 }; } } export function createIndexer( config: Config, squareService: SquareCatalogService, embedder: LocalEmbedder, ): ProductIndexerService { const qdrantClient = new QdrantClient({ url: config.QDRANT_URL, apiKey: config.QDRANT_API_KEY, }); return new ProductIndexerService( qdrantClient, squareService, embedder, config.QDRANT_COLLECTION_NAME, ); }

import { createRouter, type RouterRouteSummary, } from "@reaatech/llm-router-engine"; import type { ModelDefinition } from "@reaatech/llm-router-core"; import { GoogleGenAI } from "@google/genai"; import type { Config } from "./config.js"; export class GeminiRouterService { private ai: GoogleGenAI; private router: ReturnType<typeof createRouter>; constructor(config: Config) { this.ai = new GoogleGenAI({ enterprise: true, project: config.GOOGLE_CLOUD_PROJECT, location: config.GOOGLE_CLOUD_LOCATION, }); const geminiFlash: ModelDefinition = { id: "gemini-2.5-flash", provider: "google", costPerMillionInput: 0.15, costPerMillionOutput: 0.6, maxTokens: 1000000, capabilities: ["general", "code"], }; const geminiPro: ModelDefinition = { id: "gemini-2.5-pro", provider: "google", costPerMillionInput: 1.25, costPerMillionOutput: 5.0, maxTokens: 1000000, capabilities: ["general", "code", "reasoning", "complex-reasoning"], }; const modelMap = new Map<string, ModelDefinition>(); modelMap.set(geminiFlash.id, geminiFlash); modelMap.set(geminiPro.id, geminiPro); this.router = createRouter({ executeModel: async (model, request) => { const resp = await this.ai.models.generateContent({ model: model.id, contents: request.prompt, config: { maxOutputTokens: request.maxTokens ?? 4096 }, }); return { content: resp.text ?? "", inputTokens: resp.usageMetadata?.promptTokenCount ?? 0, outputTokens: resp.usageMetadata?.candidatesTokenCount ?? 0, }; }, }); const registry = this.router.registry; registry.register(geminiFlash); registry.register(geminiPro); } async routeQuery(query: string): Promise<RouterRouteSummary> { return this.router.route({ prompt: query, strategy: "cost-optimized", }); } } export function createRouterService(config: Config): GeminiRouterService { return new GeminiRouterService(config); }

import { z } from "zod"; import { type NextRequest, NextResponse } from "next/server"; import type { SearchQuery } from "../../../src/types.js"; const SearchQuerySchema = z.object({ query: z.string().min(1, "query is required"), sessionId: z.string().optional(), maxResults: z.number().optional(), }); let services: Awaited<ReturnType<typeof initServices>> | null = null; async function initServices() { const { parseConfig } = await import("../../../src/services/config.js"); const { createEmbedder } = await import("../../../src/services/embedder.js"); const { createCacheEmbedder } = await import( "../../../src/services/cache-embedder.js" ); const { createCacheService } = await import( "../../../src/services/cache-service.js" ); const { createRouterService } = await import( "../../../src/services/router-service.js" ); const { createSessionService } = await import( "../../../src/services/session-service.js" ); const { ProductSearchService } = await import( "../../../src/services/search-service.js" ); const { QdrantClient } = await import("@qdrant/js-client-rest"); const config = parseConfig(); const embedder = await createEmbedder(); const cacheEmbedder = createCacheEmbedder(embedder); const cacheService = await createCacheService(config, cacheEmbedder); const routerService = createRouterService(config); const sessionService = createSessionService(); const qdrantClient = new QdrantClient({ url: config.QDRANT_URL, apiKey: config.QDRANT_API_KEY, }); const searchService = new ProductSearchService( cacheService, routerService, sessionService, embedder, qdrantClient, config.QDRANT_COLLECTION_NAME, ); return { searchService, sessionService }; } export async function POST(request: NextRequest): Promise<NextResponse> { try { if (!services) { services = await initServices(); } let body: SearchQuery; try { body = SearchQuerySchema.parse(await request.json()); } catch { return NextResponse.json( { error: "query is required" }, { status: 400 }, ); } const response = await services.searchService.search(body); return NextResponse.json(response); } catch (error) { const message = error instanceof Error ? error.message : "Internal server error"; return NextResponse.json({ error: message }, { status: 500 }); } }

import { type NextRequest, NextResponse } from "next/server"; let sessionService: Awaited<ReturnType<typeof initSession>> | null = null; async function initSession() { const { createSessionService } = await import( "../../../src/services/session-service.js" ); return createSessionService(); } export async function POST(request: NextRequest): Promise<NextResponse> { try { if (!sessionService) { sessionService = await initSession(); } const body = (await request.json()) as { userId?: string }; const session = await sessionService.createSession( body.userId ?? "anonymous", ); return NextResponse.json({ sessionId: session.id, userId: session.userId }, { status: 201 }); } catch (error) { const message = error instanceof Error ? error.message : "Internal server error"; return NextResponse.json({ error: message }, { status: 500 }); } } export async function GET(request: NextRequest): Promise<NextResponse> { try { if (!sessionService) { sessionService = await initSession(); } const sessionId = request.nextUrl.searchParams.get("sessionId"); if (!sessionId) { return NextResponse.json( { error: "sessionId query parameter is required" }, { status: 400 }, ); } const messages = await sessionService.getContext(sessionId); return NextResponse.json({ sessionId, messages }); } catch (error) { const message = error instanceof Error ? error.message : "Internal server error"; return NextResponse.json({ error: message }, { status: 500 }); } } export async function DELETE(request: NextRequest): Promise<NextResponse> { try { if (!sessionService) { sessionService = await initSession(); } const body = (await request.json()) as { sessionId?: string }; if (!body.sessionId) { return NextResponse.json( { error: "sessionId is required" }, { status: 400 }, ); } await sessionService.endSession(body.sessionId); return NextResponse.json({ ok: true }, { status: 200 }); } catch (error) { const message = error instanceof Error ? error.message : "Internal server error"; return NextResponse.json({ error: message }, { status: 500 }); } }

Endpoint	Purpose
`POST /api/search`	Natural-language product search with RAG
`POST /api/index`	Trigger full product re-index
`GET /api/index`	Get index stats (point count)
`POST /api/session`	Create a new conversation session
`GET /api/session?sessionId=...`	Get session message history
`DELETE /api/session`	Delete a session

import { describe, it, expect, vi } from "vitest"; vi.mock("langfuse", () => ({ Langfuse: class { trace = vi.fn().mockReturnValue({ update: vi.fn() }); shutdownAsync = vi.fn().mockResolvedValue(undefined); }, })); describe("observability", () => { it("initObservability returns no-op stub when keys missing", async () => { const { initObservability } = await import("../../src/lib/observability.js"); const config = { SQUARE_ACCESS_TOKEN: "test", SQUARE_LOCATION_ID: "test", QDRANT_URL: "http://localhost:6333", QDRANT_COLLECTION_NAME: "product-catalog", GOOGLE_CLOUD_PROJECT: "test", GOOGLE_CLOUD_LOCATION: "us-central1", }; const result = initObservability(config); expect(result).toBeDefined(); expect(typeof result.trace).toBe("function"); expect(() => result.trace({ name: "test" })).not.toThrow(); }); it("initObservability returns client when keys present", async () => { const { initObservability } = await import("../../src/lib/observability.js"); const config = { SQUARE_ACCESS_TOKEN: "test", SQUARE_LOCATION_ID: "test", QDRANT_URL: "http://localhost:6333", QDRANT_COLLECTION_NAME: "product-catalog", GOOGLE_CLOUD_PROJECT: "test", GOOGLE_CLOUD_LOCATION: "us-central1", LANGFUSE_PUBLIC_KEY: "pk-test", LANGFUSE_SECRET_KEY: "sk-test", LANGFUSE_HOST: "https://cloud.langfuse.com", }; const result = initObservability(config); expect(result).not.toBeNull(); }); it("traceSearch and traceIndexing do not throw", async () => { const { initObservability, traceSearch, traceIndexing } = await import("../../src/lib/observability.js"); const config = { SQUARE_ACCESS_TOKEN: "test", SQUARE_LOCATION_ID: "test", QDRANT_URL: "http://localhost:6333", QDRANT_COLLECTION_NAME: "product-catalog", GOOGLE_CLOUD_PROJECT: "test", GOOGLE_CLOUD_LOCATION: "us-central1", LANGFUSE_PUBLIC_KEY: "pk-test", LANGFUSE_SECRET_KEY: "sk-test", LANGFUSE_HOST: "https://cloud.langfuse.com", }; const langfuse = initObservability(config); const searchResponse = { results: [], answer: "test answer", query: "test", cacheHit: false, modelUsed: "gemini-2.5-flash", }; expect(() => { traceSearch("test query", searchResponse, langfuse); }).not.toThrow(); expect(() => { traceIndexing("completed", 2, langfuse); }).not.toThrow(); }); });

Google Gemini RAG Product Search for Square Online SMB Stores

The problem

Built from

Intro

Prerequisites

Step 1: Bootstrap the Next.js project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Bootstrap the Next.js project and install dependencies

Step 2: Configure environment variables with Zod

Step 3: Define shared types

Step 4: Create the Square catalog client

Step 5: Create the fastembed embedding service

Step 6: Create the cache adapter for semantic deduplication

Step 7: Create the Qdrant vector indexer

Step 8: Create the Gemini model router with cost-aware routing

Step 9: Create the session continuity service

Step 10: Create the RAG search service

Step 11: Create the observability layer

Step 12: Create the API route handlers

POST /api/search — the main search endpoint

POST /api/index — trigger product re-indexing

POST /api/session, GET /api/session, DELETE /api/session — session management

Step 13: Write tests

Route handler test (POST /api/search)

Observability test

Run the tests

Next steps