- Weaviate Newsletter
- Posts
- RAG Everywhere: Late Chunking, Verba Updates, Advanced Strategies, and More
RAG Everywhere: Late Chunking, Verba Updates, Advanced Strategies, and More
Hello Weaviate Community, π€
This week, we're diving into late chunking for better context retention, our Q3 Community Survey, upcoming events, and introducing new team members.
Let's get started!
Late Chunking: Balancing Precision and Cost in Long Context Retrieval
Late chunking, a novel method introduced by Jina.ai, enhances context retention and retrieval performance. It is now available in Weaviate with minimal code changes!
It offers ColBERT-like functionality but maintains storage costs comparable to standard embedding models.
π‘ How late chunking works:
1οΈβ£ Embed the entire document into multiple token embeddings.
2οΈβ£ Mark the positions of your chunks (e.g., sentences).
3οΈβ£ Create late chunk embeddings by pooling token embeddings at chunk positions.
β‘οΈ Result: "Late chunks" retain more contextual information from surrounding text than traditional chunking methods.
Learn more
Stay tuned! Late chunking will soon be a native feature in Weaviate's embedding service.
AI [in Prod] Seattle
One week away! Our in-person roadshow is landing at the AWS offices in Seattle on September 19th.
We know you want to attend the tech talks π‘ But there are a few other reasons you might want to stop by:
ποΈ Show off your Seattle pride with our limited-edition t-shirts
β Get a training certificate by completing Zain Hasanβs afternoon workshop
π«Άπ» Make new friends who are just as excited about GenAI as you are
Space is limited β request a ticket here.
AI data tips, tricks, and tech
How-to
βοΈ Enriching and Ingesting Data into Weaviate with Aryn by Shukri and Erika
Demo on how to ingest PDFs into Weaviate using Aryn.
βοΈ Evolving AI use cases for large-scale enterprises by Ieva
Watch GenAI talks by Morningstar and Innovative Solutions from our Chicago Roadshow.
Verba: Chat with your data
Edward will guide you through Verba's latest updates, which include:
Custom metadata
Async ingestion powered by the latest Weaviate client
Vector visualization
And much more!
Learn how to run free, open-source RAG models and chat with your data in minutes.
Try out our Verba Live demo β featuring Weaviate documentation, blog posts, and videos
Upcoming events
Online
Tuesday, September 17th | Weaviate Office Hours with Duda
Wednesday, September 18th | Dive into Chunking Strategies for RAG with Zain
Thursday, September 19th | GlassFlow Webinar: How to improve AI response with feedback loops in real-time
Thursday, September 19th | Introduction to Weaviate with Duda
In-person
London | Tuesday, September 17th | LLM Meetup with Daniel
Austin | Tuesday, September 17th | Current 2024
Seattle | Thursday, September 19th | AI [in Prod]
Berlin | Tuesday, September 24th | [hands-on] AI Workshop: building AI-native applications
Remember to check out the provided links for all the details on how to sign up. We can't wait to meet you!
Give us feedback π Q3 Community Survey
We'd love to hear your feedback again!
Our Q3 Weaviate Experience Survey is open until September 30th, 2024. This time, we're eager to learn about:
β’ How you evaluate models
β’ Your preferred deployment options
β’ The use cases you're tackling with Weaviate
Let's build and improve Weaviate together!
Take the survey β to shape Weaviate's future and enhance our community experience.
Welcoming new faces
Mary joins us from Germany π©πͺ as a Machine Learning Engineer
Rodrigo comes from Spain πͺπΈ and joins us as a QA Engineer
Luke is joining us from Georgia, US πΊπΈ as a Commercial Account Executive
Michael comes from Tennessee, US πΊπΈ as our Marketing Operations and Lifecycle Manager
Danny joins us from Germany π©πͺ as a Full Stack Engineer
Weaviate open roles
Weaviate has many exciting opportunities available. Join our exceptional team and be part of a company recognized for its strong culture. We're proud to be included in Will Reed's Top 100 list!
Discover even more roles and opportunities on our career page! β¨
Thank you for reading
Do you have questions about Weaviate, vector databases, documentation, or other topics? We'd love to hear from you! Say hello in our Community Slack or join our Weaviate Forum to engage in community conversations. We're excited to see your participation in the coming weeks!
Weaviate is open source. Come by our GitHub repo, and don't forget to give us a star while you're there β
Until next time,
Femke