2024's Premier Selection: 39 Leading Personal and Voice Assistant Apps and Frameworks
This post provides a guide to personal, virtual, and voice assistant apps for 2024, highlighting over 30 top applications for the upcoming year.
In an era where time is of the essence and productivity is paramount, Personal Assistant, Virtual, and Voice Assistant Apps have become indispensable tools.
These apps, powered by advancements in Artificial Intelligence (AI), serve as digital companions that can handle a wide array of tasks and responsibilities, saving us precious time and effort. From setting reminders and making reservations to conducting research and managing emails, these apps are designed to streamline our lives and increase our efficiency.
The importance of these assistants in the realm of AI cannot be overstated. They serve as tangible applications of AI technology, showcasing its potential in everyday settings. Furthermore, as AI continues to evolve and improve, so too do these assistants, becoming more intuitive, versatile, and reliable.
As we move in 2024, let's take a look at over 30 of the best Personal Assistant, Virtual, and Voice Assistant Apps that stand at the forefront of this exciting intersection of AI and productivity.
However, it is important to note that many projects started as an experiment, and they are updated occasionally by their respected developers.
1. Kalliope
Kalliope is a framework for creating personal assistants, allowing users to connect input signals to actions, or neurons. It supports creating bots with existing neurons without coding, but also allows for custom module creation.
It can run on all Linux Debian based distributions, including Raspberry Pi, and supports multiple languages. It is designed for home automation and is always-on voice controlled.
2. Jarvis
Jarvis is a personal assistant for Linux, MacOS, and Windows that operates on the command line. It can provide voice interaction, weather updates, and location-based recommendations, among other features.
3. AIPA
AIPA (Artificial Intelligence Personal Assistant) is an intelligent personal assistant based on speech, vision, and IoT, inspired by JARVIS. It can interact through voice or text, execute commands such as playing music or conducting online searches, and control IoT devices.
AIPA can learn new things through a TensorFlow-based system and can remember user interactions. It also features multi-view face recognition.
It comes with security module, brain module, IoT module, ad interaction module. It is available under the MIT license
4. Mycroft
Mycroft Core is a hackable, open-source voice assistant platform. It is highly modular and works on many Linux distros as well as Raspberry Pi.
Developers and users can add many intents, and skills easily.
5. Yoda
Yoda is a versatile personal assistant offering features like Docker setup, on-demand module loading, mock external services in tests, custom commands, currency conversion, weather updates, expense management, gif creation, site checking, IP lookup, grep, horoscope, drink and recipe suggestions, along with regular bug fixes and improvements.
6. Robert
Robert is a personal assistant with a text and speech interface, task and alarms manager, and a code snippets manager. It comes with three themes and can be operated using hotkeys.
Its unique features include accessing Wikipedia, opening software, checking the weather, playing music, and a power mode. It also includes a task manager for noting down tasks.
Other Features
- Text and speech interface
- Task and alarms manager
- Code snippets manager
- Three themes
- Operable using hotkeys
- Access to Wikipedia
- Ability to open software
- Weather checking
- Music playing
- Power mode
- Inbuilt system for retrieving, creating, deleting, and editing code snippets
- Translation feature
- Settings for color customization
- Command interface through hotkeys or system tray icon
- Code snippets
7. Virtual BLU
BLU is a virtual assistant for Windows PC that features advanced Qt graphics. Additionally, BLU is an open-source voice assistant.
It can be install also on Fedora and Debian-based systems.
8. Jasper
Jasper is an open-source platform for creating voice-controlled applications. It's always on, listens for voice commands, and can be built with readily available hardware.
Jasper runs on Raspberry Pi and allows developers to create their own modules using a simple interface.
9. Naomi Project
The Naomi Project is an open-source platform for developing always-on, voice-controlled applications. It allows users to control various functions, is always listening for commands, and can be built with off-the-shelf hardware.
Currently, it supports Raspberry Pi and Linux, with cross-platform support coming soon. The project maintains thorough, up-to-date documentation.
10. Stephanie
Stephanie is an open-source platform designed for voice-controlled applications and automation of daily tasks, functioning like a virtual assistant. It can be controlled by voice to perform various tasks such as information retrieval, social network updates, weather updates, and more.
Stephanie can be configured for different levels of listening to maintain user privacy and can be used with any audio recording device.
11. Ame
Ame is a multi-modal, open-source virtual assistant framework powered by LLaMA, designed to operate on both enterprise and consumer-grade hardware.
However, its development is slow due to a single developer working on it, and audio file communication between the client and server is yet to be implemented.
12. Leon
Leon is an open-source personal assistant that lives on your server, capable of performing tasks upon request. It supports both voice and text communication, with an option for offline interaction to ensure privacy.
Leon's structure allows for the creation and sharing of custom skills, reducing the need for individual projects. It utilizes AI concepts and prioritizes privacy, with the ability to function offline.
13. Dragonfire
Dragonfire is an open-source virtual assistant for Ubuntu-based Linux distributions. It uses Mozilla DeepSpeech for voice commands and Festival Speech Synthesis System for text-to-speech tasks.
It processes commands through built-in commands, learning using advanced NLP and database management techniques, an open-domain question answering engine, and a deep conversation system trained with the Cornell Movie-Dialogs Corpus.
14. NOVA (NodeJS)
NOVA, an open-source personal assistant, can be hosted on a server and communicated with orally or in writing. However, it has performance issues, particularly on a Raspberry, likely due to the use of a chromium-based browser.
A similar project in Python is being developed to improve efficiency.
15. Dicio (Android)
Dicio is a free, open-source voice assistant for Android that supports multiple skills and languages, and provides speech and graphical feedback. It uses Vosk for speech-to-text and supports English, French, German, Greek, Italian, Russian, Slovenian, and Spanish.
Dicio can answer questions about search, weather, lyrics, open apps, and perform basic calculations.
Download
16. W.I.L.L (Retired)
The W.I.L.L Project, an open-source personal assistant that runs on a Python-based plugin framework, has been retired and development has ceased. It was designed to be free, user-friendly, and expandable, with access through a JSON API across various platforms.
17. Cortana
Cortana is an AI-powered python library that enables chatting with GPT via command line, speech to text conversion with openai-whisper, text to speech conversion with elevenlabs, creation of a personal assistant using whisper, GPT, and elevenlabs, and voice alteration using whisper and elevenlabs.
Cortana uses whisper to do speech to text, and then uses GPT to generate a response. It then uses elevenlabs to do text to speech, and plays the audio.
The assistant mode has a hotword detection system, so that you can say your desired to activate the assistant. It then listens for a command, and then responds. It'll ignore any commands that don't include the hotword.
18. Eva
Eva, an Emacs-based virtual assistant, is designed to track personal data, present it back to the user, and assist in task management. It's not a monolith, but an extensible toolbox that can be customized with new functions. It includes automatic loggers for idleness and buffer information.
Eva can also ask for user input on various topics at different times throughout the day, adjusting its frequency based on user responses.
19. Jarvis-Desktop-Voice-Assistant
The Jarvis Desktop Voice Assistant is a Python-built tool capable of performing various tasks such as Wikipedia searches, playing music, and opening browsers through voice commands.
While not as advanced as its cinematic counterpart, it can automate many daily tasks on personal computers and laptops.
What can Jarvis do?
- Greet user
- Tell current time and date
- Launch applications/softwares
- Open any website
- Tells about any person (via Wikipedia)
- Can search anything on Google
- Plays music
- Take important note in text file
- Can take screenshot and save it with custom filename
20. Helix - Personal Assistant for Linux/GNU
Helix is a personal Assistant made for Linux/GNU. He supports a wide range of commands and functions, optional you can also add an WolframAlpha API and OpenAI API for more features.
Features
- Voice compression and generation
- Music or video playing
- Conversation initiation with AI services
- Website opening
- Current time, date, or a random joke provision
- Information search about a person or thing on Wikipedia
- Web search conducting
- Complex calculations performance
- Mathematical problems solving
- Units conversion
- Derivative of a function finding
- Integral of a function finding
- Original content creation
- Complex topics explanation
- Suggestions provision
- Structured plans creation
21. pyRobBot
PyRobBot is a python package that uses OpenAI's GPT large language models to create a personal assistant with voice chat capabilities.
It offers continuous voice input and output, doesn't require button pressing to listen, and can search the internet for answers not in its training data.
Features
- Voice Chat with continuous voice input and output
- No need for button pressing, the assistant listens until you've stopped talking
- Internet access for information not in training data
- Web browser user interface with a demo app
- Integrated text interface for a traditional chatbot experience
- Automatic/editable conversation summary title
- Autosave & retrieve chat history
- Ability to resume text & voice conversations started outside the web interface
- Chat via terminal
- Fully configurable with a large number of supported languages
- Support for multiple Large Language Models (LLMs) via OpenAI API
- Choice of preferred Text-to-Speech (TTS) and Speech-To-Text (STT) engines
- Control over parameters passed to OpenAI API
- Ability to pass base directives to the LLM
- Dynamically modifiable AI parameters in each chat separately
- Chat context handling using embeddings
- Estimated API token usage and associated costs
- OpenAI API key is never stored on disk
22. Flint (Whatapp)
Flint is an open-source AI personal assistant that can be interacted with anytime over Whatsapp via Twilio.
23. NEMO
NEMO is an AI and machine learning-driven virtual assistant that responds to voice commands and performs various tasks in a human-like way. Its scope includes voice-controlled query resolution, application control, and e-messaging.
Features
- Voice Controlled Optimized Human-like Query Resolver using Artificial Intelligence
- Voice Controlled Application Control
- Voice Controlled E-messaging: email, whatsapp
- Voice Controlled Media Control
- Voice Controlled Web/Website Searching
- Voice Controlled Direction and Navigation
- Narrated Notes, Task/Appointment Scheduling
- Utility management: time, date, weather forecast
- Contact/Email Management
- Interact and Chat in a Human-like manner
- Listen to Jokes
- Get Date and Time Update
- Get Weather Forecast Update
- Google Search Support
- Search and Play anything on Youtube
- Play any Song
- Control your browser
- Open and Close any App or Application
- Visit any Website (e.g wikepedia, gmail, ) that has .in/.com/.org
- Open any Social Media sites (e.g facebook, twitter, instagram)
- Send Emails
- Send Whatsapp message
- Get Current Geo Location
- Get Direction or Navigation to Places
- Capture Screenshot
- Schedule your day
- Create To-do list
- Email and Contact management
24. Amigo
Amigo is a voice assistant similar to Siri, Alexa, Google Assistant, and Cortana. It can perform various tasks, including searching on Wikipedia, opening YouTube, Spotify, and WhatsApp, and executing custom commands.
25. The Chrome Assistant
Google Chrome's development has led to the concept of extensions that enhance browser features. However, some features like voice-speech search and offline pages are still missing.
The proposed solution is a native Google Chrome extension that acts as a smart personal assistant, integrating necessary features via code injection. This project will leverage tasks developed during the creation of Jarvis Personal Assistant, using a simple HTTP request to the hosted web application.
However, it requires some development skills to install, as it does not have any version ready to install from the Google Chrome web extension store.
26. Aria (Zotero AI Assiatnt)
Aria is a Zotero plugin powered by Large Language Models (LLMs) designed to assist in AI research. It offers features like drag-and-drop referencing for Zotero items and collections, autocompletion for creators, tags, items, and visual analysis.
27. CatAI
CatAI is an AI assistant that runs locally on your computer, inspired by Node-Llama-Cpp and Llama.cpp.
It features programming language auto-detection, real-time text streaming, and fast model downloads. Installation requires Node.js and the commands 'npm install -g catai', 'catai install vicuna-7b-16k-q4_k_s', 'catai up', and 'catai'.
28. Glowby
Glowby Basic is a voice-based AI assistant offering an autonomous mode, GPT-4 Turbo with Vision, DALL·E-powered image generation, multilingual translations, interactive adventure, Pulze support, and a hosted inference API for Text2Text generation models.
Features
- Experimental Autonomous Mode
- GPT-4 Turbo with Vision
- Image Generation powered by DALL·E
- Multilingual Translations
- Interactive Adventure
- Pulze support
- Hosted Inference API for Text2Text Generation Models
- Powerful, customizable voice-based AI assistant
- Pre-set questions and answers using the Glowbom builder
- Voice input and output
- Customizable prompts
- Ability to switch between different prompts
- Support for multiple languages
29. Ai Assisted Task Executor
The Task-Driven Autonomous Agent System is an advanced AI solution that utilizes GPT-4, Pinecone vector search, and the LangChain framework to efficiently handle, generate, and prioritize tasks.
It also offers interfaces for code, source control, file systems, documentation, and more.
Agents
- 📋 Task Manager Agent: Responsible for generating, managing, and prioritizing tasks in the task list.
- 🏃 Task Executor Agent: Processes and completes tasks using GPT-4 and the LangChain framework.
- 🧠 Memory Manager Agent: Manages memory storage of the system – short-term and long-term memory.
- ⚙️ Execution Context Agent: Handles external execution environments – file systems, APIs, unit tests.
- 🔒 Security and Safety Agent: Monitors and filtering throughout the process ensuring safe operation.
30. Dwata
Dwata is a desktop application that allows users to chat with AI models and extract insights from data. The app can connect to databases, APIs, or CSV files, and uses AI models to understand human language and interact with computers.
Users can add data or structure to the chats, and the AI will typically respond with SQL or Python code to extract the needed insights. Dwata can verify the AI-generated SQL and execute queries directly from the app, with Python code execution planned for future development.
Data sources
- Databases like PostgreSQL, MySQL or MongoDB
- SaaS products like Stripe or Shopify (API or CSV)
- Your own custom software (CSV or API)
- CSV files/folders
- Email (IMAP)
31. CustomChar
CustomChar is a project aimed at creating AI characters for use in games, simulations, virtual assistants, and other applications, independent of specific platforms, cloud services, or specialized hardware.
Applications include interactive game characters, personalized virtual assistants, online education experiences with virtual companions, and cloud services for creating personal virtual characters.
32. Amy
The Amy project, a personal assistance system developed by students from the University of Stuttgart, is deprecated and will no longer be maintained. It's a research project with potential harmful errors and untested functionality. A web interface for the project is being developed concurrently.
33. Melissa
Melissa is a virtual assistant for OS X, Windows, and Linux systems that uses Google Chrome's speech-to-text engine, OS X's say command, Linux's espeak command, or Ivona TTS. It was developed by Tanay Pant and a team, with a web UI designed by Nakul Saxena.
34. Marvin
Marvin is a virtual assistant capable of opening websites, conducting research, opening programs, playing music, setting reminders, and adding new voice commands without the need for coding.
It uses a voice recognition library to automate actions from a commands.json file, allowing for dynamic task execution. The assistant features modules for opening websites and programs, playing music, and setting reminders.
35- Tsuyu
Tsuyu is a lightweight virtual assistant framework based on Ame's design, providing a functional, fast alternative for users with less powerful systems or those who prefer not to work with Python.
It is easier to install and offers unique features like streaming and vision. However, it's currently in a preliminary state and may contain bugs.
Features
- Lightweight design based on Ame's framework
- Functional and fast, ideal for less powerful systems
- Non-reliance on Python
- Easier installation process
- Unique features including streaming and vision
- Simpler interface for quick prototype creation
- Lighter resource usage and faster generations
- Compatibility with ordinary consumer hardware
- JavaScript interfacing for users who prefer not to write Python
- Functional server/client interface
- Streaming capabilities in controller, web API, and web UI
- Crude vision support using ViT + GPT-2
- High levels of user support
- No licensing required, public domain software
36. Magick
Magick is a no-code Artificial Intelligence Development Environment for data pipelines and multimodal agents. It features real-time agents with unified memory and self, and social connectors to platforms like Discord, Twitter, and Twilio, with more connectors coming soon.
Features
- Realtime agents that interact with users and other agents
- Social connectors to Discord, Twitter and Twilio
- Ability to search Google, Wikipedia and the Semantic Web
- Included power tools including voice and image generation and vector search
- Powerful graph-based IDE for complex data pipelines
- Ability to embed graphs in subgraphs and share them for rapid community development
- Ability to automate, build, and bring ideas to life with AI
- No need to know how to code everything in a command line interface
- Visual node-based environment to connect to popular services and explore pre-built nodes and connectors
- Spells and nodes for data transformation
- Flexibility in creating nodes and re-arranging them
- Different node types including Input Node, Prompt Template, Code Node, Generator Node, and Wait For All Node
37. Friday.ai
38. LinTo
39. Python Assistant (PA)
Python Assistant (PA) is a voice command based assistant service written in Python 3.9+. It can recognize human speech or voice, talk to user and execute basic commands.
Features
- Voice command recognition
- Time, date, month, or day telling
- Web page opening
- General question answering
- YouTube music playing
- Master volume control
- Internet speed checking
- Weather updates
- MS Office applications opening
- Alarm setting
- Daily news briefing
- Note writing and reading
- Wikipedia searching
- Guidance on various topics
- WhatsApp messaging
- Email sending
- Distance calculation between cities
- Recycle Bin cleaning
- Quote telling
- COVID-19 case updates worldwide and for specific countries
- Joke telling
- Arithmetic operations
- Currency conversion
- Screenshot taking
- Poem telling
- Application opening and closing
- Computer shutdown or restart
- Internet searching