Semantic search: Qdrant + qwen3-embedding:4b + Local Ollama

Building a Local Semantic Search Engine with Qdrant, Qwen3-Embedding (4b), and Spring Boot

Today we will help Dipper manage his notes and search through the vector database Qdrant. Embeddings will be evaluated using local Ollama with model qwen3-embedding:4b. A Spring Boot WebFlux application will enable creating new notes and later finding relevant entries for a given request. How semantic search works Some LLMs process prompts, while others help generate embeddings. An embedding is just a vector — a sequence of numbers evaluated based on your input (note content in our example) ...

June 16, 2026 · 13 min · Alexey Y.
Part 1: Running Local LLM for Java Tests — Ollama + Gemma 4b + Devoxx Genie

Part 1: Running Local LLM for Java Tests — Ollama + Gemma 4b + Devoxx Genie

Maintaining high code coverage is essential but often tedious — especially when you need to cover controllers, services, repositories, and edge cases individually. AI‑powered test generation can automate most of this work, cutting hours of manual effort (in some cases, up to 40–50% of testing time). However, many enterprises — especially in finance, healthcare, or regulated industries — are not willing to share their codebase with third‑party LLM providers like OpenAI or Anthropic. Security policies, intellectual property concerns, and compliance requirements (e.g., GDPR, SOC2) often mandate that no data leaves the corporate network. ...

June 10, 2026 · 8 min · Alexey Y.