Embedditor is a software application designed to optimize vector search in Language Model (LLM) related applications. It features a user-friendly interface equipped with advanced Natural Language Processing (NLP) cleansing techniques like TF-IDF, normalize, and enrich your embedding tokens. Notably, Embedditor offers a means to intelligently split or merge content based on its structure, adding void or hidden tokens to make the chunks semantically coherent.

In addition, it's designed to be cost-effective, filtering out from embedding irrelevant tokens like stop-words, punctuations, and frequently low-relevant words. That way, users can save up to 40% on the cost of embedding and vector storage while increasing search results efficiency. It's deployable locally on PCs or enterprise cloud/on-premises environments, offering users control over their data.

The platform is designed to enhance the relevance of the content retrieved from a vector database—aiming to streamline efficiency and accuracy for users in the artificial intelligence (AI) and LLM-related applications industry. Its remarkable feature is the optimized way it handles embedding pre-processing, even for users with no data science background or technical skills.


  • Cleansing Techniques: Offers advanced NLP for efficient and accurate vector searches.
  • Content Optimization: Contains intelligent splitting or merging of content for more coherent chunks.
  • Cost-effectiveness: Filters out irrelevant tokens reducing up to 40% cost on embedding and vector storage.


  • Embedditor is an open-source application available for all at no cost. Users can directly install it from the repository or as a Docker image.

Popular Use Cases

Videos (Official, Reviews, How-To's)



Notion AI

Find Out How How Best To Utilise AI Tools

Our newsletter comes with exclusive discounts, trials and practical insights from within the industry