
Closed
Posted
Paid on delivery
JS Freelancer needed — Browser-based Voice Recognition (Web Speech API) Context: I'm developing a PWA web application (vanilla HTML/CSS/JS, Firebase). A "hands-free" mode allows users to validate items by voice, without touching the screen. The problem: Web Speech API on Android Chrome is unstable in noisy environments: - Sessions die silently after a period of silence - Short trigger words ("hop", "ok", "next") frequently missed or misrecognized - Erratic behavior across Android versions - Conflicts between speech synthesis (TTS) and recognition (STT) What I already have: - Working voice mode with short sessions + automatic restart - interimResults, maxAlternatives, expanded trigger word list - Accent stripping in transcript comparison What I'm looking for: Someone who has already solved these issues in production — not theory. Ideally with one of the following approaches: - Advanced Web Speech API optimization (VAD, fine-grained state management, heuristics) - Whisper integration via [login to view URL] in a mobile context - Any other proven browser-side approach, no server required Stack: vanilla HTML/JS, Android Chrome, Firebase Realtime Database. No framework. Please apply with references from similar cases.
Project ID: 40310046
78 proposals
Remote project
Active 1 mo ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
78 freelancers are bidding on average €21 EUR for this job

Hello, As one of the largest web service providers in Pakistan, my team and I at Our Software have the right blend of skills and expertise to help you optimize the Web Speech API for your PWA. We have a proven track record of developing and designing websites using the latest technologies, including vanilla HTML/CSS/JS as you require for your project. When it comes to the mobile context, particularly Android Chrome, we understand firsthand the challenges you're facing with Web Speech API instability. Drawing from our experience dealing with similar cases, we can offer valuable insights and implement effective solutions to address issues such as session drops, miss or misrecognition of short trigger words, disparate behavior across Android versions, and TTS-STT conflicts. Moreover, we deeply appreciate the significance of your "hands-free" mode for validating items by voice and understand that making this feature work flawlessly even in noisy environments is crucial for your users' experience. Rest assured, we won't approach this project theoretically; instead, we'll leverage our practical know-how to ensure that our optimizations enhance the stability and reliability of Web Speech API on Android Chrome in various circumstances. We are driven by our clients' satisfaction and strive to receive positive feedback by delivering excellent customer service consistently. With us on your side, you can expect not only technical solutions b Thanks!
€25 EUR in 1 day
8.5
8.5

Hi there I have worked on a similar voice-based feature where users had to trigger actions using short commands on mobile browsers, and we faced issues like missed words, sessions stopping randomly, and conflicts when audio was playing in background. I handled it by controlling the recognition flow more tightly, restarting sessions at the right time, adding checks around silence, and improving how short words are matched so they don’t get missed easily. I also separated TTS and voice input properly so they don’t interfere with each other. I can take your current setup and improve it step by step, focusing on stability across devices rather than changing everything. If needed, I can also explore a lightweight alternative like on-device models depending on performance. One thing I wanted to check, are you targeting a specific range of Android versions/devices, or should it work across all? Thanks, Rahul A.
€20 EUR in 1 day
8.4
8.4

i am ready to start now kindly message me so that we can start right away you can check my profile for portfolio and past projects https://www.freelancer.com/u/DestinyGuider
€30 EUR in 1 day
7.3
7.3

Hello Sir, Would you be interested in a customized demo of a Web Speech API optimization solution that can resolve your voice recognition issues before making any commitment? I have extensive experience in optimizing voice recognition systems, specifically in addressing challenges like silent session drops and recognition errors in noisy environments. I would love to discuss how we can enhance your application and invite you to collaborate on a detailed plan along with the demo showcasing potential improvements. Regards, Smith
€19 EUR in 7 days
6.5
6.5

As a seasoned Full Stack Developer with over 6 years of experience and a diverse skill set, I've tackled and overcome numerous challenges throughout my career similar to the one you're facing. I have a proven record of delivering high-performance, robust applications particularly in Java and JavaScript, making me a perfect fit for your proficiency requirements. My familiarity with Android Chrome and HTML/JS will ensure seamless integration of the PWA on your platform. In addition to my fluency in JavaScript, one of your core requirements, I've developed several projects leveraging the Web Speech API. I'm well aware of the inconsistencies and limitations present in it, especicially visible in Android Chrome's unstable environments as you pointed out. Extensively adapting this API in numerous mobile apps, I have learned effective approaches to optimize its performance for erratic sessions as well as tackle missed or misrecognized trigger words - making your "hands-free" mode more effective. Moreover, my understanding of Firebase Realtime Database harmonizes perfectly with your stack thereby ensuring smooth development process without relying heavily on servers. My past experiences are not just theoretical but they were crafted through robust implementation. I'm confident that choosing me for this task will result in an optimized Web Speech API solution which satisfactorily addresses all your pain points. Let's get started on this project today!
€10 EUR in 2 days
6.0
6.0

Hello, I understand you are developing a PWA with a hands-free voice validation mode using the Web Speech API, and you’re facing key challenges like session instability, missed trigger words, version inconsistencies, and conflicts between TTS and STT on Android Chrome. Your existing setup includes automatic session restarts, expanded trigger word lists, and accent stripping — a solid foundation. With extensive experience optimizing Web Speech API in production environments, I have successfully implemented advanced voice activity detection, state management heuristics, and integrated browser-side speech recognition enhancements without server dependencies. I am familiar with the limitations on Android Chrome and have tackled issues of silent session deaths and trigger word misrecognition in noisy contexts. I am confident my hands-on solutions can bring reliability and accuracy to your voice mode, ensuring seamless interaction for your users across devices and Android versions, all while maintaining your vanilla JS stack and Firebase backend. Let’s discuss your current implementation in detail so we can tailor an effective optimization strategy. I look forward to helping you deliver a robust hands-free experience. Best regards.
€30 EUR in 10 days
5.7
5.7

With a profound passion for innovation and an extensive track record in web development, I believe I am the ideal candidate to optimize your Web Speech API project. Not only have I worked on browser-based voice recognition projects before, but I've also successfully tackled various browser compatibility issues to create reliable and efficient solutions. Your concerns about the API's stability and misrecognition in noisy environments align perfectly with the problems I've resolved for previous clients. In terms of my technical prowess, I'm well-versed in vanilla HTML, CSS, and JS - the precise stack you require for this PWA web development - equipped with the skills to utilize them seamlessly within Android Chrome environment. Moreover, my experience with Firebase Realtime Database ensures that I'll be able to complement your existing project framework efficiently.
€8 EUR in 1 day
5.2
5.2

Hello My client. Pleasure is all mind to send my proposal on your post. I am Muamer and ready to support you with this wonderful project. I see you need stable browser-based voice recognition on Android Chrome for your PWA, handling short trigger words, noisy environments, and conflicts between STT and TTS. I’ve solved similar issues in production using advanced Web Speech API optimizations, VAD, fine-grained session management, and even lightweight browser-side Whisper integrations—keeping everything client-side with no server dependency. Your voice mode will handle automatic restarts, erratic speech, and accents robustly. Quick questions: do you want multiple trigger words recognized simultaneously, or just one active at a time? Any preference for using interimResults for live feedback? Plan: • Audit current voice mode + session logic • Improve recognition stability + trigger reliability • Handle TTS/STT conflicts gracefully • Test across Android versions • Deliver code + simple usage doc I can start immediately and make your hands-free mode rock-solid. Looking forward to hearing from you, Sincerely with your best luck in everything you do always. Thanks for your careful concern.
€10 EUR in 2 days
5.1
5.1

Hi, Client. I am very interested in your project and confident that my core skills and extensive experience align perfectly with your requirements. After carefully reviewing the project details, I am certain that I can deliver high-quality results within a short timeframe. I am available to begin work immediately and will maintain clear, consistent communication throughout the process. I look forward to the opportunity to collaborate with you. Best regards, Huy
€30 EUR in 1 day
4.5
4.5

Hello, I have worked on browser-based voice flows and I understand the real issue here is not just recognition itself, but keeping STT stable on Android Chrome when silence, background noise, and TTS interruptions start fighting the session. I can help you harden the current hands-free mode with better session control, restart logic, transcript filtering, and command-matching heuristics, and if needed I can also test a browser-side Whisper route to compare reliability without adding a server. Since you already have the base flow working, I would focus on making it production-safe rather than rebuilding everything, especially around missed short commands, silent session drops, and TTS and STT conflicts. Are you looking first for a targeted fix to the current Web Speech implementation, or do you want the freelancer to evaluate both Web Speech and Whisper and choose the more reliable option for Android Chrome? Let’s discuss detail via chat.
€19 EUR in 7 days
3.8
3.8

I can help you optimize your browser-based voice recognition using the Web Speech API so it feels fast, accurate, and reliable across different environments. I’ve worked with JS-heavy frontends where speech input, streaming transcription, and real-time feedback are core to the user experience. Previously, I’ve implemented speech recognition flows with Web Speech API and fallbacks, tuned for different browsers, noise conditions, and accents, and integrated them into larger JS applications. I’m comfortable profiling performance, handling edge cases, and improving UX around partial results, errors, and permissions. My approach would be to review your current implementation, identify bottlenecks, improve recognition configuration and event handling, and then refine UX/UX copy and error states. I’ll document the setup so it’s easy to maintain. I would love to chat more about your project! Regards
€19 EUR in 7 days
4.2
4.2

Hi, Have you encountered any specific errors when implementing the Web Speech API in your app? I can help you stabilize the voice recognition feature in noisy environments. Based on your context, advancing the optimization of the Web Speech API can be a great first step. Fine-tuning state management and employing voice activity detection (VAD) will definitely improve accuracy. Also, Whisper integration could enhance performance further, especially for short trigger words. I have worked on similar voice recognition projects and have successfully implemented solutions that reduced misrecognition under different scenarios. I’m experienced in vanilla HTML/JS and have used Firebase for real-time databases, so I’m confident I can help refine your app. Looking forward to discussing this further!
€8 EUR in 1 day
3.8
3.8

Hi, I will enhance your PWA’s voice recognition capabilities by addressing the stability issues with the Web Speech API on Android Chrome. My experience with advanced Web Speech API optimizations, including VAD and state management, makes me confident in delivering a robust solution that minimizes silent session drops and improves recognition accuracy in noisy environments. I have successfully implemented similar features in production environments, ensuring smooth integration between speech recognition and synthesis, while overcoming challenges like erratic behavior across Android versions. My approach will focus on optimizing your existing setup, confirming that trigger words are consistently recognized and that the system remains responsive even in challenging conditions. Given your current implementation, I’d like to understand the specific environments where the issues are most prevalent and if there are any constraints with your current Firebase setup that I should be aware of. Let's work together to elevate the hands-free user experience in your application. Thank you.
€20.65 EUR in 7 days
2.6
2.6

As an experienced Fullstack Developer, I have spent years working on web and mobile applications similar to your project. I've specifically developed ERP applications for different industries like mining, textiles, and chemical sales which required seamless integration of complex functionalities - a skill that will prove invaluable for the optimization you need in your PWA for voice recognition. Continuing the tradition, I have kept myself up-to-date with all the relevant latest technologies and trends. This includes having a strong understanding of both server and client-side programming languages like PHP, Python, Javascript, and Pascal. Most importantly, I've contributed significantly to projects involving voice integration with different devices and third-party APIs. Moreover, my extensive experience with databases (MySQL, PostgreSQL, MongoDB) combined with the mastery over runtime environments such as Node.js and Docker would mitigate any potential conflicts between speech synthesis and recognition that you've mentioned in your description. Rest assured that choosing my services ensures a smooth project execution and impactful results.
€19 EUR in 2 days
2.7
2.7

Hi, I just applied after read your job posting carefully and I believe that I am good fit to your project. I have thoroughly reviewed your requirements and I am confident in my ability to deliver excellent results. I'm a serious bidder. I will satisfy you with my high skills! I am an expert which have 8+ years of experience on Java, JavaScript, Mobile App Development, Android, CSS, HTML, Mobile Development, Speech Synthesis I will work on your project hard with full time. I am looking forward to meet you to discuss the further detail about this project. Looking forward to hearing from you. Warm Regards
€25 EUR in 7 days
2.8
2.8

Hey , I just went through your job description and noticed you need someone skilled in JavaScript, Speech Synthesis, Java, HTML, Mobile App Development, Mobile Development, Android and CSS. That’s right up my alley. You can check my profile —I’m Software engineer working at large-scale apps as a lead developer with U.S. and European teams. I’ve handled several projects using these exact tools and technologies. Before we proceed, I’d like to clarify a few things: Are these all the project requirements or is there more to it? Do you already have any work done, or will this start from scratch? What’s your preferred deadline for completion? Why Work With Me? 1) Over 230 successful projects completed. 2) I have not received a single bad feedback since the last 5-6 years. 3) You will find 5 star feedback on the last 100+ major projects which shows my clients are happy with my work. 4) Long-term track record of happy clients and repeat work. I prioritize quality, deadlines, and clear communication. Availability: 9am – 9pm Eastern Time (Full-time freelancer) I can share recent examples of similar projects in chat. Let’s connect and discuss your vision in detail. Kind Regards, Imran Haider
€8 EUR in 2 days
2.4
2.4

Hey — saw your post about optimizing a browser-based voice recognition app with the Web Speech API. A lot of these projects struggle with inconsistent recognition across browsers and noisy environments, which can quietly kill the user experience. Quick question before I suggest an approach: Are you tied to the native Web Speech API only, or open to a hybrid setup (e.g. fallback to another STT service) for edge cases? I’ve worked on JS-heavy, browser-based audio/voice features before, including tuning wake-word logic, handling partial results, and dealing with Chrome/Safari quirks. If you can share your current repo link, a short spec, or a screen recording of the current behavior, I can review it and tell you what’s realistically fixable and where you’ll get the biggest gains.
€19 EUR in 7 days
2.0
2.0

Hello, how are you? I’ve worked on browser-based voice systems and understand the exact limitations you’re facing with the Web Speech API—especially on Android Chrome. These issues (silent session drops, poor short-word detection, and TTS/STT conflicts) require practical handling, not just configuration tweaks. I can improve your current setup by implementing stronger state management with continuous session recovery, custom VAD-like silence detection, and smarter buffering/validation for short trigger words. I’ll also isolate TTS and STT flows to prevent conflicts and stabilize behavior across devices. If needed, I can integrate a lightweight Whisper-based approach (client-side) as a fallback for better accuracy in noisy environments. I’ll work directly within your vanilla JS + Firebase setup and focus on real, tested fixes. Thanks! Dennis
€19 EUR in 7 days
2.0
2.0

Hi, I can start your project right away. I am very familiar with JavaScript, HTML, CSS, Web Speech API, Speech Synthesis, Android, Firebase. I'm confident I can deliver a top-notch solution within your budget and timeline. For your voice feature, I can improve stability by implementing better session control with silence detection handling, resolve conflicts between TTS and STT, and enhance trigger word recognition using optimized matching logic and restart strategies. I can also explore a lightweight Whisper integration in-browser if needed to improve accuracy in noisy environments. Looking for your reply. Thank you
€15 EUR in 7 days
2.0
2.0

Hello, I am excited to help improve your PWA’s voice recognition using the Web Speech API. With strong JavaScript experience, I can optimize recognition accuracy, handle noisy environments, and stabilize performance using techniques like VAD and better state management. I’ve also worked with Whisper integrations for more reliable results. I focus on practical, efficient solutions without unnecessary complexity. Please start the chat to discuss in detail. Best regards, Someder Singh
€50 EUR in 7 days
2.2
2.2

Paris, France
Payment method verified
Member since Apr 6, 2018
€8-30 EUR
€8-30 EUR
€8-30 EUR
€8-30 EUR
€8-30 EUR
₹600-1500 INR
₹75000-150000 INR
$250-750 AUD
$250-750 USD
₹1000000-2500000 INR
$10-30 USD
₹400-750 INR / hour
$2-8 AUD / hour
₹750-1250 INR / hour
₹600-1000 INR
£250-750 GBP
₹12500-37500 INR
₹75000-150000 INR
₹12500-37500 INR
₹12500-37500 INR
$10-200 USD / hour
$2-8 USD / hour
$8-15 USD / hour
$3000-5000 USD
£20-250 GBP