OpenAI Whisper Applications

CHATTYRENTAL AI Room Assistant

FusionAI

Our app is a unique platform that offers both content creators and users an innovative way to generate and access various types of content. The app has two interfaces: Explorer and Creator, where visitors can access various types of content, including videos, articles, audios, and tweets while creators can upload, edit and use AI tools to generate content. Our app aims to solve the problem of time-consuming content creation and fragmented content discovery. By offering multiple types of content in a single platform, we aim to increase user engagement and retention while offering creators an opportunity to monetize their content. Market: The global content creation and discovery market is expected to reach $892.5 billion by 2027, with an annual growth rate of 16.8%. The increasing demand for video content, podcasts, and other forms of digital media presents a significant opportunity for our app to succeed in the market. Competitive analysis: Our app faces competition from established content creation and discovery platforms such as YouTube, Medium, and Spotify. However, our unique value proposition of offering multiple types of content in a single platform, along with AI generative tools for creators, sets us apart from competitors. Marketing strategy: Our app will be marketed primarily through social media, paid advertising, and partnerships with content creators and publishers. We will also offer referral programs to incentivize users to invite their friends and family to use the app. Revenue model: We plan to generate revenue through a freemium model, where the app is free to access for users, but creators pay for premium tools and features. We will also offer subscription plans for users to access premium content and an advertising model, where advertisers can display ads on the app.

FusionAI

WhisperChatGPTDALL-E-2

Chattyrental

ChattyRental is an innovative AI-powered software platform designed to revolutionize the room rental experience. By integrating cutting-edge AI technology into the booking process, ChattyRental enables room rental agencies to enhance their commercial systems and streamline their operations, resulting in significant cost savings and improved customer satisfaction. The platform's primary features include AI-driven conversational booking, personalized recommendations, and intelligent search capabilities, allowing customers to find and book rooms with ease. ChattyRental's mission is to provide a seamless and efficient booking experience, fostering loyalty among renters and driving growth for room rental agencies. Initially targeting agencies in Madrid, ChattyRental aims to expand its reach globally, offering its transformative solution to agencies worldwide. The platform's unique value proposition lies in its ability to optimize key processes, reducing sales department costs by up to 36%, while simultaneously delivering a more personalized and enjoyable experience for renters. ChattyRental's SaaS solution comprises three layers: the main dashboard, maintenance and issues, and the commercial dashboard. By focusing on value, innovation, collaboration, and transparency, ChattyRental aims to become the go-to platform for room rental agencies, providing tailored solutions to meet the evolving needs of their customers and empowering them to excel in the industry.

RedisChatGPTLangChainWhisperVercel

PrepQuest

PrepQuest is the ultimate interview preparation app that helps job seekers of all levels and industries prepare for their next interview. With a massive database of cutting-edge AI-generated interview questions powered by OpenAI’s state-of-the-art Chat GPT, using custom prompts the app generates interview questions for virtually any topic. The app offers a range of features to help users improve their interview skills. One of the standout features of PrepQuest is the AI-powered mock interviews. The app simulates real-life interview scenarios using Chat GPT with custom prompts and using whisper API the app takes users voice as input to provide a very realistic setting of an interview. Users can choose the interview role and level as per their requirements and using custom prompts the app creates an immersive interview experience tailored to users requirements.

Team Vision

AI Alliance 4 Voice Analytics

Call centers handle an immense volume of customer interactions every day, and it's crucial for businesses to evaluate the quality of these interactions to maintain high customer satisfaction rate. Traditionally, quality and assurance auditing has been a time-consuming and manual process, where human auditors listen to and evaluate customer calls. This approach is prone to human error, inconsistency, and scalability challenges. The Voice Analytics with AI aims to revolutionize the quality & assurance auditing process of call centers by transcribing and analyzing audio recordings using GPT-3 and Whisper models. The proposed solution leverages Automated Speech Recognition (ASR) and Large Language Models (LLM) to automate and streamline the quality and assurance auditing process. First, the system summarizes key information in call recordings, such as operator's name, issues, and solutions, and other relevant data points. After that, the solution conducts sentiment analysis to evaluate the tone and mood of the conversation using NLP and LLM. In addition, LLM evaluates customer experience and satisfaction levels and provides scores for each. Last but not least, the model ends the report of each call with feedback and insights about the performance of operator and suggests areas for improvement. Overall, the proposed solution has the potential to transform the call center industry, providing businesses with valuable accurate insights into their customer interactions and enabling them to take proactive steps to train their operators and improve their overall customer experience.

AI Alliance

ChatGPTWhisperGPT-3

VOICE OUT AI Translation

Voice Out is a revolutionary AI translation assistant that aims to solve the problem of inaccurate and ineffective communication between people from different linguistic backgrounds. In today's globalized world, communication is essential, and language barriers often cause misunderstandings and hinder productivity. Traditional translation software requires the user to speak with high accuracy and clarity to ensure a correct translation, which can be difficult for non-native speakers or in noisy environments. Voice Out uses cutting-edge deep learning algorithms to analyze the nuances of a speaker's voice, identify commonly used phrases, and provide accurate translations in real-time. This enables users to express themselves naturally, even if their grammar or pronunciation is not perfect. Additionally, Voice Out's intelligent learning capabilities allow it to adapt to a user's unique voice and vocabulary, making communication more seamless over time. Voice Out's user-friendly interface displays translations in real-time, allowing users to adjust their speech or ask follow-up questions. It also has the ability to translate both spoken and written language, making it a versatile tool for a wide range of communication needs. With Voice Out, individuals and businesses alike can communicate more effectively and efficiently across language barriers, unlocking new opportunities for collaboration, understanding, and growth.

Voice Out

Verbify

In today's globalized world, language barriers is a major obstacle in communication and information sharing, leading to inequality and exclusion. This is where Verbify steps in as a powerful speech-to-speech translator that seamlessly translates any speech into any language, while preserving the original speaker's tone and style. With Verbify, users can expand their reach and connect with people from all corners of the world. For instance, learners who struggle to understand a foreign language can now access any form of information, regardless of its language. Creators can now effortlessly reach a wider audience, transcending linguistic and cultural boundaries. Built as a Flask web app, Verbify leverages the cutting-edge technologies of OpenAI's whisper, gpt-3.5-turbo, and Eleven labs to provide users with a smooth and accurate translation experience. Simply upload your audio or video file or paste a YouTube or audio link, choose your desired language, and let Verbify do the rest. Experience the power of seamless communication and information sharing with Verbify today!

Verbify

Multilingual Voice Assistant

In short, it’s a multilingual voice assistant, that can help Not only to reduce the language barrier in using cutting edge technologies but tries to make everyday life a bit easier. It also increases accessibility of the technology that can be helpful for people who have some kind of disability. With the use of ChatGPT & Whisper API from Open API and by using Google Translate library from Python and Text to Speech API from Google Cloud I have created a web app that can take voice input from user and perform multiple tasks based on User preference, such as language Translation and/or Communication with chatGPT to get information.

Zion

Smart Notes Learn Better Fast

NoteMaster

Our project aims to improve the academic performance of undergraduate students in their first cycles, helping them to adapt to the academic environment and develop effective study skills. To achieve this, we have created an innovative platform that integrates artificial intelligence technologies, such as GPT-4 and Whisper, to offer a series of tools that facilitate the learning process. The tools offered by our platform include automatic transcription of lectures, which allows students to access the information presented in a more accessible way; summary generation, which helps them understand and retain essential information from lectures; and personalized learning paths, which guide students to the most appropriate resources and activities for their individual needs. In addition, our platform provides research and writing assistance, which facilitates the completion of high-quality academic papers. All of this helps to reduce the stress and frustration associated with adjusting to the university environment and improve study skills, which in turn translates into better academic performance. In the $5.76 billion grade app market, our solution has great potential to revolutionize the way students deal with academic challenges. Moreover, being aligned with Sustainable Development Goal 4, our project also contributes to improving the quality of education and promoting academic success in a broader context. We are currently seeking support and investment to incorporate GPT-4 and Whisper into our platform and optimize it, thus ensuring that our solution is at the forefront of artificial intelligence technology and has an even greater impact on the lives of university students.

WhisperGPT-4

Clear Speak

The SaaS (Software as a Service) product is a powerful speech analysis tool that utilizes state-of-the-art speech-to-text technology to transcribe recorded audio and analyze it for various speech patterns. The tool identifies areas where the user may need to improve their speech, including reducing stuttering and improving clarity and coherence. Once the audio is transcribed, the tool provides users with in-depth statistics on their speech patterns, including word frequency, common mistakes, and areas that need improvement. These statistics are displayed in an easy-to-understand format that allows users to quickly identify their strengths and weaknesses. One of the most unique aspects of the tool is its ability to give real-time feedback on speech patterns. As users speak, the tool analyzes their speech in real-time and offers suggestions on how to improve their communication skills. This feedback is critical for users who want to improve their speech in real-life situations and gain confidence in their ability to communicate effectively. The tool also provides personalized suggestions for each user based on their speech patterns and areas for improvement. For example, if the tool identifies that a user tends to stutter on certain words or phrases, it will offer personalized exercises and techniques to help them reduce their stuttering and speak more fluently. Overall, the speech analysis SaaS product is an invaluable tool for anyone looking to improve their communication skills, reduce stuttering, and gain confidence in their ability to speak effectively. With its cutting-edge speech-to-text technology and real-time feedback, the tool offers a powerful solution for anyone looking to improve their speech and become a more effective communicator.

Coders Legion

WhisperChatGPTOpenAI gymGPT-3

No code customer care bot

We have created a no-code platfrom that caters to the 'customer care' department of the ecommerce websites, wherein the user can chat with our bot and the bot will figure out the problem/issue with the order and take action accordingly. Like for example, the user starts chatting with the bot and tells that the product does not fit his size, then the bot will traverse through the dataset and extract the order details and see if the product is elligible for return, if it is, then it will accordingly inform the user that the product can be returned and will place an order for the same. This no - code platform caters to the needs of ecommerce and similar online businesses that need 'customer care' service but cannot afford one. By means of this bot we are enabling that feature and making the platform no-code so that anyone can use it.

Clippy

Speechify

An AI-based audio to video converter is a tool that can convert an audio recording into a video file. This tool uses advanced machine learning algorithms to analyze the audio recording and create a video that matches the content of the recording. In addition, this tool can add text in terms of captions to the video based upon the speech in the audio recording. The AI system listens to the audio file and uses speech recognition technology to transcribe the spoken words into text. Then, it synchronizes the text with the video frames to create captions that appear on the screen at the appropriate times. The resulting video file can be used for a variety of purposes, such as creating video content for social media platforms, generating marketing videos, creating instructional or educational videos, and more. The AI-based audio-to-video converter makes it easy for users to quickly and efficiently create high-quality video content from their audio recordings,

Night Owls

Room Booking AI Assistant

CHATTYRENTAL AI Room Rentals

ChattyRental is a revolutionary chatbot powered by AI, designed to simplify and enhance the room rental experience for both renters and rental companies. Our chatbot is equipped with advanced natural language processing capabilities, enabling users to book a room through a simple, conversational interface. Additionally, ChattyRental generates personalized recommendations for rental rooms based on a user's preferences and past behavior. With our intelligent search feature, users can find the right room that meets their needs quickly and easily. By leveraging the power of ChattyRental, room rental companies can cut their sales department costs by an average of 36%, while simultaneously improving customer satisfaction and user experience.

RedisWhisperGPT-3

MyQuiz AI

Have you ever wanted to test your knowledge on a specific topic, but found traditional methods of studying and taking quizzes to be tedious and unengaging? Look no further, because we have the solution! Introducing MyQuiz.AI, a trivia game that utilizes the power of AI to generate questions tailored to your interests and abilities. With just your voice, you can embark on a fun and challenging quiz journey that will leave you wanting more. So sit back, relax, and get ready to put your knowledge to the test with MyQuiz.AI! Our team has developed a cutting-edge speech-based game that incorporates advanced technologies to deliver a highly engaging and personalized user experience. The system utilizes the Whisper API for speech recognition, Redis for data storage, and ChatGPT to generate questions and validate user answers. The quiz asks unique questions every time, tailored to the user's level of knowledge and abilities. The system's machine learning capabilities ensure that the difficulty level of the questions is appropriate and challenging, and according to age. The Whisper API's advanced speech recognition capabilities provide an immersive and interactive experience, allowing users to use their voices to mention their age and category for quiz. This feature also makes the quiz accessible to users with disabilities or those who prefer voice-based interactions. The Redis database stores questions, answers, and user responses. Overall, our speech-based quiz or game represents a significant step forward in the field of educational technology. With its advanced algorithms and machine learning capabilities, the system offers a new and innovative way for users to learn and engage with the material. The quiz's personalized approach, speech-based interface, and advanced features make it a powerful educational tool that has the potential to revolutionize the way people learn and retain knowledge.

Space Cats

ChatGPTOpenAI gymRedisWhisperGPT-3

Smart Lecture

Our app is designed to address some common problems that students and learners face when trying to engage with lectures Difficulty taking comprehensive notes: Many students struggle to capture all of the key points and details of a lecture while also actively listening and processing the information being presented. This can result in incomplete or inaccurate notes that make it harder to study and review later. Time-consuming manual transcription: In order to review lectures later, students may need to manually transcribe the audio recordings, which can be time-consuming and tedious. Limited ability to identify important information: Even with comprehensive notes or transcripts, it can be challenging to distill the most important information from a lecture, especially if there is a lot of extraneous detail or repetition. Our app aims to address these problems by automating the process of creating summaries, notes, and questions from lecture audio. By using WhisperAI to transcribe the audio to text and ChatGPT to generate a summary, notes, and questions, the app streamlines the process of reviewing lectures and helps learners more easily identify and retain key information. Here is a possible flow for the app: The user opens the app and selects the lecture they want to review. The app uses WhisperAI to transcribe the lecture audio to text. The text is passed to ChatGPT, which generates a summary, notes, and questions based on the content of the lecture. The user can review the summary, notes, and questions generated by the app, edit them as needed, and save them for future reference. Overall, this app has the potential to be a valuable tool for learners who want to optimize their engagement with lectures and maximize their retention of important information.

The bad batch

Health BOt

an innovative and user-friendly health application that uses artificial intelligence to provide Ugandan citizens with access to vital health information. The app is designed to address the challenges that many Ugandans face in accessing quality healthcare services, particularly in rural areas where health facilities are scarce. The application is built using state-of-the-art technology, including ChatGPT API for natural language processing and speech-to-text capabilities, Streamlit for the user interface, and the Reddit API to access relevant health information. These tools work together seamlessly to provide a comprehensive and user-friendly health platform that meets the unique needs of Ugandan citizens. Through the app, users can access reliable and up-to-date information on common illnesses, including symptoms, causes, and treatments. They can also receive personalised recommendations based on their symptoms and medical history, as well as find nearby health facilities and book appointments. The app can also provide educational resources on topics such as sexual health, maternal and child health, and HIV/AIDS. The app's user-friendly interface and speech-to-text capabilities make it accessible to all Ugandan citizens, regardless of their level of education or literacy. This is particularly important in rural areas where illiteracy rates are high. Additionally, the app's use of local languages such as Luganda and Runyakitara ensures that it is inclusive and accessible to all Ugandans. Overall, "Health Solutions Uganda" is a powerful tool that has the potential to revolutionise healthcare in Uganda by providing access to vital health information and services to all citizens, regardless of their location or socioeconomic status.

BadaRama

ChatGPTWhisperRedis

QuizTube

QuizTube is a web application that generates multiple-choice questions based on the audio content of a YouTube video. Users can enter the link to a YouTube video, and QuizTube will download the audio from that video, use the Whisper API to transcribe the video then submit a request to chatGPT to generate multiple choice quiz based on the content in the transcription. The questions are designed to test the user's understanding of the content, and the app can be used for educational purposes, language learning, or just for fun. With QuizTube, users can turn any YouTube video into an interactive quiz!

chatgpt5

ChatGPTWhisperDALL-E-2Cohere GenerateCohere ClassifyRedis

WeCare Caretaker Assistant

We have built a solution for agencies which provide the caretaker services for parents who are in search of babysitters for their child. When users call the agency after business hours or when agents are not available for assistance, we are routing them to leave a voicemail with their babysitter requirement and contact number. With this solution, agents can focus on more complex tasks rather than manually retrieving voicemails, analysing them and coming up with a resolution. When the caller dials the agency phone number during office closed hours or peak hours when agents are not available to serve them, we route the caller to the voicemail menu where we ask them to leave a voicemail with babysitting requirements and their contact details, etc. Once the voicemail is available, we extract it and convert this speech to text using OpenAI’s whisper API which gives us the voicemail transcription. After that, we meticulously perform the prompt engineering for ChatGPT API to provide us all the required information from voicemail like intent, sentiment, babysitting date and time, etc in JSON format. Using this information, we query the EmployeeSchedule table which is in the H2 database. Once we have the information about availability of babysitters, we query RedisJSON to get the employee profile information like employee name, contact details, date of birth, languages spoken, image, etc. We then build a PDF document using itext library. This PDF containing available babysitter information will be sent on the caller’s WhatsApp. After this, we send an SMS to the agency as an alert notification about the customer enquiry and ask them to get in touch with the customer. Github link - https://github.com/technocouple/technocouple-caretaker-assistant Video link - https://drive.google.com/drive/folders/1NBew2U0Xgtm04ubQszjLvZV92fowR6-D?usp=sharing Presentation - https://drive.google.com/file/d/1TBMSU5Ohyn1v2P2u_RqbZOpuCvWv1Crq/view?usp=share_link DEMO is at the end of the video.

TechnoCouple

Interview assistant

When you're interviewing, it's important to focus on the process and listen carefully to the interviewee and get into the process. But when you're constantly distracted by taking notes and looking at a list of questions, you lose your effectiveness and maybe forget to ask something. Our app is designed to save the interviewer from unnecessary activities and help him or her focus on what's important. Now the interviewer can take notes and try to write down the interview, because our app will do it for him. It will also review the entire resume and answer specific queries. This is just the basic functionality we managed to implement in 48 hours. In the future, the app can work in real time, toss questions and give hints, also will save the processed interviews

Nathnenne

Curated Club

Curated Club is a subscription-based service that offers monthly deliveries of curated products related to the customer's individual interests and preferences. The service uses a personalized algorithm to analyze customer data, combined with ChatGPT API to understand natural language, and customer feedback to curate a selection of high-quality products that are tailored to each customer's specific needs and preferences. The service offers a wide range of themes to choose from, such as food and snacks, books, pet care, fitness, and more. It is designed to offer a fun and convenient way for customers to discover new products and hobbies, while also providing a personalized and seamless experience that keeps them coming back for more.

TalkyAI

ChatGPTWhisperCodex

NotAlone

People with dyslexia often find it hard to read and write, primarily because it’s “hard for them to mentally lock in” as described by the person suffering from Dyslexia. This makes studying a struggle for them however it is seen that in most cases people with this condition find it easier to read on electronic devices rather than the real document itself. Although it’s helpful to have a guide to help out when difficulties arise. But what if a person doesn’t have access to a guide? This is where NotAlone comes in to help. Dyslexia can be an obstacle but with LLMs and Deep Learning making people’s life easier let’s take a step to make it more accessible to everyone so that barriers like these don’t dominate a person’s will to learn and write. NotAlone is specifically designed to empower individuals with dyslexia by providing a seamless learning-rich writing environment tailored to their unique needs. Our goal is to ensure that no one feels left behind in today's fast-paced world. People with dyslexia often prefer speaking over writing, hence many take the help of an ASR app to help them do so. Inspired by this, we provide a Whisper-based STT feature to help them type by speaking. ChatGPT-based writing assistance is another essential feature of NotAlone. This feature provides personalized guidance and support by helping users overcome challenges in writing and reading by:- 1. Helping them write about anything they want 2. Grammar Correction 3. Rewriting 4. Explaining a word/phrase 5. Summarize a paragraph 6. Suggest Synonyms 7. Chat-based assistance Even though we help them read better but some words are complicated even for us to understand this is where Text-to-Speech service can help them read a word or a paragraph whenever they feel stuck. Beyond these core features, we provide an interface that allows users to adjust settings, such as line height, word spacing, and background color. All this along with custom fonts that cater specifically to dyslexic people.

UncomplicateIT

Intelligent Health Assistant

Intelligent Health Assistant is a groundbreaking app that utilizes AI technology to help patients with their symptoms before they see a doctor. The app records symptoms in a 10-second timeframe and transcribes them using Whisper API to store them in a file. The app then uses ChatGPT API to check the history of previous inputs, outputs, and current input, and then analyzes the symptoms and compares them to a vast database of medical information to identify potential illnesses and conditions. The app is designed to guide people who do not prioritize going to a doctor or who feel worried about their symptoms, and advise them on the urgency of their symptoms. This app can help reduce the number of patients who ignore medical symptoms, as well as help identify potential illnesses and conditions in their early stages.

MASTERS

WhisperChatGPTGPT-3

Translating Voices to Signs

Deaf and hard-of-hearing individuals face a multitude of challenges in their daily lives, with one of the biggest being the difficulty in communicating with hearing individuals who do not understand sign language. This communication barrier can lead to social isolation, limited access to education and employment opportunities, and a lack of participation in various social activities. Our project aims to address this challenge by developing a real-time speech-to-sign language translation solution that can bridge the communication gap between the deaf and hard-of-hearing and the hearing individuals. This solution has the potential to enhance accessibility and inclusivity for the deaf and hard-of-hearing community and improve their quality of life.

Pac-Man

OpenAI gymWhisper

AI Study Buddy

The problem I want to solve with this program is to increase my efficiency when studying for exams. Often I feel overwhelmed with my different forms of study material like lecture recordings, notes, slides, pdf's, or voice recordings. With this program I am attempting to import all of my material in one place and then, with the help of AI, create summaries based on my content. I can then enter a interactive world where my program creates tailored exercises for me in order to prepare for my exam. Perhaps I can even feed my program with a past exam at one point and it then generates another exam in the style of the previous exam. I believe this project has great potential and many use cases for students, but also for any individual that is trying to test their knowledge on their chosen topic.

Solorider881909

AI Rap Song Creation

Generate a rap song for any YouTube video Music is easier for people to accept information than articles. If there is a news or introduction video on YouTube, we only need to enter the YouTube link, and a rap song will be automatically generated. The process is as follows: Enter the YouTube URL Enter the background music of the YouTube instrumentals Enter the rap lyrics style First, the video will be converted into text using the Whisper API, and then the GPT3.5 API will condense and highlight the key points of the text. These key points will be written into lyrics in a certain style. Next, we use Python to download the background music of the YouTube instrumentals, preprocess the lyrics, capture the rhythm of the music, match the words with the heavy beats, and then use gTTS to speak the words and automatically adjust the audio position. Then your rap song will be generated!

ProjectK AI

WhisperDALL-E-2GPT-3

AI video translator

Clients go to our website, where they can paste the URL or video itself and select the desired language for translation. The model using GPT-3 automatically determines the language for generation, translates into the required language. Also, the model automatically determines intonation, pronunciation speed, age, gender of the speaker and could make sample of own voice. The client receives the video in the language selected at the beginning. For the future, we want to link our site to virtual assistants so that videos are accessible to people with disabilities. And also connect the ability to translate into all languages of the world

Tengri AI

PollyGlotica

Polly was made possible using GPT 3.5, Whisper, and Google's Text-to-Speech. Putting these components together enables us to communicate in many forms of media and practice in different ways. The current implementation uses Whisper and Google's Text-to-Speech pair to receive and output voice messages to enhance interactive learning. We implemented six languages to choose from as we experimented with how each implementation is performed. There are many improvements within this field; this is a new way to practice and brush up your skills in Deutsche, French, English, etc. Polly is a project we intend to pursue further and in multiple different use cases.

MagnaLingua

ChatGPTWhisperRedisGPT-3

Aura the GPT assistant

AURA-the chatbot is a chatgpt-integrated speech assistant that converts user audio input into text and feeds it to chatgpt.Chatgpt then discovers a solution to our query which is converted back into audio files by Auro for the users. The two main tools used for this project are whisper and chatgpt API. Whisper can accurately recognize speech in a multitude of languages, accents, and environments. It can handle technical language and background noise and perform at par with human capabilities. While ChatGPT is a specialized variant of the GPT-3 language model designed to generate human-like responses in conversational contexts. We combine both of these cutting-edge technologies to develop an API that is useful to everyone.

GptHelpLine

CohereQdrantWhisperChatGPT

CryptoCrypt

Introducing our innovative Streamlit application, which harnesses the power of OpenAI GPT-3 to generate multi-layer encryption and decryption codes for secure communication. This application is designed to help users easily encrypt and decrypt their messages using state-of-the-art encryption techniques, making it nearly impossible for unauthorized parties to access their sensitive information. To use this application, users can input their speech message through OpenAI Whisper, which transcribes the message accurately. The application then uses GPT-3 to generate a multi-layer encryption code, which can be customized by the user according to their specific requirements. Once the encryption code is generated, it is applied to the speech message, making it indecipherable to anyone without the decryption code. Users can choose from a variety of encryption algorithms and key lengths, and can also input their own unique encryption key for added security. The application also allows users to save and retrieve their encryption codes for future use, making it easy to communicate securely with their contacts. In addition to its powerful encryption capabilities, the application is also highly user-friendly, with a clean and intuitive interface that allows users to easily navigate and customize their encryption settings. With its cutting-edge technology and ease of use, this Streamlit application is the perfect solution for anyone looking to communicate securely and confidently in today's digital world.

team phoeniks

OpenAI gymWhisperGPT-3

Joan Holloway

Currently, the most popular corporate knowledge management system is Confluence by Alatasian. It is known for a lack of search capabilities and makes most corporate knowledge inaccessible, especially in fast-growing companies where regular structure and responsibilities change. Some independent vendors fill this gap by offering carefully tuned solar-based search engines for Confluence, but not real semantic search. Confluence is a proprietary cloud-based solution, and it would be difficult to MVP a search extension in a hackathon. The most advanced open-source alternative is wiki.js, which already supports external search engines. So the current goal is to implement an external search engine for wiki.js using Cohere's LLM-powered Multilingual Text Understanding model and Qdrant's vector search engine. At the second stage of the project (most likely outside the hackathon scope), we plan to add the capability to upload and index videos in our knowledge management system. Recordings of presentations and meetings are the richest source of knowledge, but they were left outside knowledge management due to technical difficulties. Simple transcription and semantic search of that content could significantly boost corporate knowledge accessibility.

wiki search

Dripper News

a personalized news feed focused on the tech industry, powered by artificial intelligence (AI). Our news aggregator is specifically designed for busy CEOs, providing them with the latest and most relevant news in the tech sector. Through the use of AI, our platform curates and filters news articles from reputable sources, presenting only the most important and timely news stories to our users. This allows CEOs to stay informed on the latest trends, industry developments, and competitor updates in a quick and efficient manner. Additionally, our news aggregator provides a personalized experience for each user. By analyzing the user's reading habits and interests, our AI technology tailors the news feed to provide a custom selection of articles that are most relevant to their business and industry. Overall, our personalized news feed offers a comprehensive solution for CEOs who want to stay informed on the latest developments in the tech industry without the hassle of sorting through countless news sources. With our platform, CEOs can stay ahead of the curve and make informed decisions for their companies.

Dripper News

Shinyonaika

Shinyonaika, gamifies Cognitive Behaivioural Therapy into Storylines using Whisper + Cohere API to process User Emotion based on input given by user. It uses Unity for Game Development, C# for Scripting and powerful AI Models like Whisper+Cohere. CBT has 3 Steps: 1.Identify Negative Emotions 2.Identify Triggering Situations 4.Reshaping Negative Emotion It's goals are: Mental Health Awareness Making CBT Interactive and Graphical Self Therapy The product is new and has no competion. It can be made available to all Users, since, it uses technology that is common in the market. Note: Psychotherapists were consulted during development.

MAVERICKS

CohereWhisper

Vi chat

Vi-chat is an innovative AI assistant aimed at helping mothers connect with their autistic children by converting their voice into images easily understood by autistic children as they are have difficulty processing spoken language but prefer pictures. we used openai model with their whisper and dall beta embedded to transform voice into images. this solution is never offered before to autistic children but it will help them communicate and boost their learning process. we plan to make this app go both ways from voice to image and from image to voice in near future and make it customized to every child and his preferences. We are very proud and honored to help autistic children and their mothers get connected together

Clawcode

WhisperDALL-E-2

AudioQuest

Get ready to embark on an epic adventure through sound and story with AudioQuest, the thrilling new text-based adventure game! In AudioQuest, you'll take on the role of a hero on a mission to uncover the secrets of a mysterious and magical world, using nothing but your wits and your trusty set of headphones. With each new stage, you'll be immersed in a rich and detailed soundscape, filled with clues and puzzles that will challenge your mind and test your skills. As you explore this fantastical world, you'll encounter a host of memorable characters, each with their own unique stories and motivations. From fierce dragons to cunning thieves, you'll need to use your intuition and your cunning to navigate the many challenges that lie in your path. With multiple stages to explore, each more challenging than the last, AudioQuest is the perfect way to escape into a world of adventure and excitement. So why wait? Start your journey today and experience the thrill of AudioQuest! AudioQuest uses Whisper to understand what you say and lets you play using the ChatGPT API. We also added SoundCloud API support to include the optimal background music for each situation. We wrapped up everything using a Flask Web Application to bring you the best voice-commanded text-based adventure possible.

Hackstreet Boys

RedisCodexWhisperDALL-E-2ChatGPTStable DiffusionGPT-3

Miraa

Our app provides a fully digitalized package for our clients. We offer a range of services, including the creation of a logo, ads that can be used on social media platforms such as Facebook and Instagram, a website, and marketing videos. In order to enhance the quality of our videos, we use a technology called DeepFake. This technology generates faces which are then placed onto the video to create a more engaging advertisement. To create the ads, we use two different technologies called dalle and gpt3. Dalle is used to generate images, while gpt3 is used for text. The logo is also created using dalle for the image and gpt3 for the text under the image. For the website, we will use dalle for images and gpt3 to code the website itself. Additionally, we will be adding automation to our app to streamline the entire process. Impact:: Our app offers a comprehensive range of services that can potentially have a significant impact on the market. The fields in which our app can be used includes branding, digital marketing, web development, and video production.One potential way to use client data and requests of images for further work is to analyze the data to identify trends and patterns in the type of images that clients are requesting. This can help us to tailor our services to meet the specific needs and preferences of your clients. For example, if we notice that clients are frequently requesting certain types of images or logos, we could focus on developing more options in that style., our app has the potential to make a significant impact on the market and attract a wide range of clients.

DeepDream

AIYu

Supercharge your business operation by using AI technology. We support small business by introducing smart automation into their daily business operation. We leverage different open AI stack and Redis in our implementation. We achieved 10x faster operation and manage to demo our product to our potential first customer.

AI.Yu: Supercharge your business operation

Reinforcement LearningRedisWhisperGPT-3

Liquid LMS

The Problem: Traditional education has not changed much in the last century, and it fails to meet the diverse needs of students. One-size-fits-all teaching methods, outdated curricula, and limited access to resources often result in disengaged students who are unprepared for the workforce of tomorrow. The Solution: We propose a revolutionary approach to education that integrates AI and new technology. By leveraging the power of AI, we can create personalized learning experiences that cater to each student's unique needs, interests, and abilities. The Implementation: Our approach is built on three pillars: a. Adaptive Learning: Our AI-powered algorithms will analyze each student's performance data to create a customized learning path. This will help students learn at their own pace and achieve better learning outcomes b. Immersive Learning: We will use virtual and augmented reality to create immersive learning experiences. This will enable students to explore complex concepts in a more engaging and interactive way. c. Collaborative Learning: We will facilitate collaborative learning by leveraging AI-powered tools that enable students to work together on projects and assignments in real-time. The Benefits: Our approach to education will offer several benefits, including: a. Improved Learning Outcomes: Personalized and engaging learning experiences will help students achieve better learning outcomes and prepare them for the workforce of tomorrow. b. Cost-Effective: Our AI-powered approach to education will be cost-effective as it will reduce the need for physical classrooms and expensive resources. c. Accessible: Our approach will be accessible to all students regardless of their location, socioeconomic status, or learning abilities. Our approach to education will revolutionize the way we teach and learn. By leveraging the power of AI and new technology, we can create personalized, engaging, and cost-effective learning experiences that prepare students for tomorrow.

PlayFine

WhisperDALL-E-2GPT-3

TaskMate

TaskMate is a solution that can be integrated into any website, providing AI-powered speech interaction with the website. AI plays a significant role in making the solution better because of natural language processing. Speech interaction can address the problems we have identified by providing hands-free interaction, increasing accessibility, improving productivity, and reducing cognitive load. Overall, speech interaction can make it easier to use your phone in a variety of situations and improve accessibility and productivity for all users. We believe that TaskMate has the potential to be a game-changer in the way people interact with websites.

TunisFeldberg

WhisperCohere ClassifyCohere EmbedRedis

LearnIt

Watching the right content and understanding and drawing conclusions from them is very important on this content-populated internet. It is a time-consuming process to go through lengthy tiring lecture videos and research papers. In this project, we take input in 3 different formats: youtube video link, pdf link, and pdf uploaded by the user. From the youtube video link, we first download the video and then extract its audio. The audio is then transcribed using WhiperAPI. Finally, we save the transcribed text from the audio and it is summarized using GPT-3. As for the pdf link and pdf uploaded from the local device, the text is extracted from the pdf and again with the use of GPT-3, we summarize the pdf. The summary LearnIt provides gives an overview of what those lengthy tiring videos and research papers were about. This gives the user an idea of what they can expect from the video and paper. Also based on the summary, they can save time to understand the video and papers quickly and sort them based on their interest.

Tetranator

ChatGPTCodexDALL-E-2CohereAI21 LabsWhisperYOLOv7GPT-3

I AM AI personalized ChatGPT

I-AM-AI is your personalized chatbot companion that can be integrated with Telegram and Discord. Trained exclusively on your chosen content and offering secure and private access, I-AM-AI provides tailored conversations, personalized insights, and relevant recommendations based on your interests and preferences. With its powerful AI capabilities, I-AM-AI makes it easy to access and organize information, learn new things, and stay engaged with the world around you. I-AM-AI now offers integration with Telegram and Discord chatbots, making it even more accessible and convenient. Whether on the go or at your desk, you can access I-AM-AI from your favorite messaging platform and get instant access to the information you need. With the ability to pre-train with your latest data, I-AM-AI can create a customized knowledge base optimized for your specific needs and provide fast and accurate answers to your queries. From onboarding to customer support, I-AM-AI can be used for various business cases, including documentation updates, research article summaries, product recommendations, marketing campaign assistance, second brain, financial advice, employee training, HR support, sales support, news overview, and market overview. Experience the power of I-AM-AI today and see how it can transform how you work and learn.

Data Dreamers

MediFix

MediFix is an AI-powered assistant that utilizes the latest technologies such as GPT 3.5, Whisper, and gTTS to provide users with valuable healthcare information. With its advanced capabilities, MediFix is able to analyze symptoms mentioned by users and provide them with preventive measures to help them stay healthy. One of the key features of MediFix is its ability to support both voice and text input. This means that users can either speak to the assistant or type their symptoms, making it accessible to a wide range of users. When users input their symptoms, MediFix uses GPT 3.5 technology to analyze the information and provide relevant information on the causes of the symptoms and possible preventive measures. The assistant is trained on a vast amount of medical data, allowing it to provide users with accurate and reliable information. In addition, MediFix also utilizes Whisper technology to provide a personalized experience for each user. By understanding the user's context and history, MediFix is able to provide customized recommendations and preventive measures that are specific to their needs. Finally, gTTS technology is used to deliver the information to the user in a clear and easy-to-understand manner. This ensures that users are able to comprehend and follow the recommendations provided by MediFix. Overall, MediFix is a powerful healthcare assistant that leverages the latest AI technologies to provide users with accurate and personalized healthcare information. With its support for both voice and text input, MediFix is accessible to a wide range of users, making it an invaluable tool for anyone looking to take control of their health.

SeaSky

WhisperChatGPTGPT-3

Spectra Mirror

Spectra Mirror solves this problem by combining the latest in AI voice technology with one of the oldest and most widely used technologies to this day, the Mirror. Everyone owns a mirror, therefore anyone with a mirror can access Voice AI assistance thanks to Spectra Mirror. Spectra Mirror is a module application that is designed to integrate with MagicMirror, an open-source smart mirror platform. It allows users to interact with OpenAI's powerful language model, GPT, by hardcoding a prompt, sending it to the OpenAI API, and then displaying the result on the smart mirror. The seamlessness of Spectra Mirror allows baby-boomers to access information without the need of a cellphone or computer (after installation), simply through their voice they can access information and complete simple day-to-day tasks by speaking to Spectra.

HackStreet Boys