Hello loggers! I’d like to share a few thoughts on the ongoing AI feature and privacy.
To provide some context, several users have expressed apprehension regarding the in-progress feat/ai-lab branch. We didn’t anticipate that people would be so closely monitoring the source code and caring about a project that has yet to be released. However, we do appreciate the privacy concerns (which are crucial to us) and welcome any feedback or criticism to enhance both the product and our communication.
- Relevant links:
What is the purpose of the AI experiment?
- The aim of Logseq is to establish a better environment for both learning and collaboration, enabling us to form a network that connects our ideas and enhances the collective knowledge of humanity.
- AI possesses the ability to inquire and acquire knowledge from a vast amount of information on the network, allowing us to learn anything.
The plan
-
Features:
- Comprehensive semantic search throughout your entire graph, including assets like PDFs
- Text generation such as summarization and translation
- Image generation
- Audio transcription
- Chat
-
AI solutions:
- Local AI
- A plugin named LogMind, designed to offer local AI features, currently undergoing internal testing. Our engineer Junyi has been working diligently to ensure its performance and flexibility, allowing it to serve as a foundation for other plugins.
- Vector Database
- A runtime for any custom model
- Transformer embedding
-
LangChain HTTP Proxy (TODO)
- A local HTTP bridge for LangChain server to make it easy to develop new AI solutions in any language (not just Python and Javascript).
- A plugin named LogMind, designed to offer local AI features, currently undergoing internal testing. Our engineer Junyi has been working diligently to ensure its performance and flexibility, allowing it to serve as a foundation for other plugins.
- Cloud services
- The initial integration will be with OpenAI
- Local AI
-
For plugin developers:
- We’re working on APIs to make it easier for you guys to hook in. We’re also working on local AI models that will not transfer any data over the internet nor give us access to people’s data in any way.
I’ll use this post to answer some questions, feel free to leave a comment if you have any additional questions on the AI integration.
-
How to preserve privacy-first?
- A user has to do nothing by default as AI features are disabled by default.
- Each function can be individually enabled or disabled. For instance, you can choose to disable chat while still utilizing text generation.
- As fans of local-first technology, we anticipate the arrival of fully-featured local AI, and we’re putting more effort into local AI instead of cloud services such as the OpenAI integration.
- Regarding local AI:
- All data, including chat conversations, will be stored exclusively on your local device. If you use Logseq Sync, your information will be encrypted before being transmitted to our server.
- Concerning OpenAI integration:
- For those who prefer not to use this feature, no additional action is required, as it is disabled by default. If you choose to use it, the general guideline is to avoid sending sensitive information to OpenAI. Logseq will not transmit your entire graph to OpenAI; instead, you can select the data (specific blocks or pages) to be sent and review it before sharing it with OpenAI.
- We welcome any suggestions or ideas to further safeguard our privacy!
-
Why incorporate it into the core product instead of a plugin?
- A seamless user experience is crucial when integrating AI with other features like whiteboards, flashcards, and queries.
-
Will there be a charge for it?
- There will be no charge for OpenAI integration using your API token.
- We might offer a service in the future for users who lack an OpenAI API token but still wish to utilize it.
-
Will it be open-sourced?
- Yes, the OpenAI integration can be found in the branch feat/ai-lab, and the local AI plugin will be released following the testing phase.
-
How do you plan to enhance communication with the community?
- We will establish a monthly Q&A thread on this forum, followed by a Discord session where we address the most urgent and popular questions of the month.