updated speech recognition with permission and language selector state management #673

yodaljit · 2024-12-12T21:47:39Z

Overview
This document outlines the improvements made to the speech recognition feature, focusing on better permission handling, visual feedback, and user experience.

Key Changes

1. Permission State Management

Added new permission states: 'granted', 'denied', 'prompt', 'unsupported'
Implemented real-time permission monitoring
Added browser-specific settings navigation
Enhanced error handling for permission-related issues

2. Visual Feedback Improvements

Permission Denied: Red microphone icon with slash
Unsupported: Gray microphone icon with slash
Recording Active: Pulsing red dot indicator
Normal State: Standard microphone icon
Added tooltips with contextual help messages

3. Browser-Specific Handling

// Chrome
chrome://settings/content/microphone

// Firefox
about:preferences#privacy

// Other browsers
Generic settings instructions

Component Changes

useSpeechRecognition Hook

Added permission state tracking
Implemented permission query and monitoring
Enhanced error handling with user-friendly messages
Added language support configuration

SpeechRecognitionButton

New visual states based on permission status
Interactive tooltips with helpful messages
Direct links to browser settings
Improved accessibility features

BaseChat Integration

Updated to use new permission states
Improved error handling and user feedback
Better integration with streaming state

User Experience Improvements

Visual States

Denied Permission
- Red microphone icon
- Tooltip with instructions to enable
- Click opens browser settings
Unsupported Browser
- Gray disabled icon
- Informative tooltip
- Disabled state
Active Recording
- Stop icon when recording
- Clear recording state visibility
- Stop recording on stream start

Error Handling

Network errors: Connection status messages
Permission errors: Clear instructions for resolution
Recognition errors: User-friendly error messages
Browser compatibility: Support status messages

Technical Implementation

Permission Monitoring

const checkPermission = async () => {
  try {
    const result = await navigator.permissions.query({ name: 'microphone' });
    setPermissionState(result.state);
    
    result.addEventListener('change', () => {
      setPermissionState(result.state);
    });
  } catch (error) {
    setPermissionState('prompt');
  }
};

Visual Feedback

const getButtonIcon = (permissionState: PermissionState, isListening: boolean) => {
  switch (permissionState) {
    case 'denied':
      return <div className="text-red-500" />;
    case 'unsupported':
      return <div className="text-gray-400" />;
    default:
      return isListening ? <div className="animate-pulse" /> : <div />;
  }
};

Recent Changes (2024-12-13)

UI Improvements

Icon Updates
- Added microphone-slash icon for denied permissions state
- Added red color indication for stop and denied states
- Using UnoCSS i-ph prefix for icon integration

Functionality Improvements

Language Persistence
- Added localStorage support to persist language selection
- Language preference now survives page reloads
- Fallback to default language (en-US) if no saved preference

Component Structure

Button States
- Normal state: Shows microphone icon
- Recording state: Shows red stop icon
- Denied state: Shows red slashed microphone icon
- Each state has appropriate hover and focus styles
Language Selection
- Language menu accessible via dropdown
- Supports multiple languages including:
  - English (US/UK)
  - Spanish
  - French
  - German
  - Italian
  - Portuguese
  - Russian
  - Chinese (Simplified)
  - Japanese
  - Korean
  - Hindi

Technical Implementation

Hook Updates
- Enhanced useSpeechRecognition hook with language persistence
- Added SSR compatibility checks
- Improved error handling and state management
Styling
- Using UnoCSS for consistent styling
- Responsive design for all screen sizes
- Proper spacing and alignment with other chat components

Browser Support

Chrome: Settings redirect to chrome://settings/content/microphone
Firefox: Settings redirect to about:preferences#privacy
Other browsers: Standard permission prompts

Future Improvements

Implement noise cancellation
Support for custom wake words
Implement automatic punctuation

Browser Support

Chrome: Full support
Firefox: Full support
Safari: Partial support
Edge: Full support

Notes

Permission states are persisted across sessions
Real-time permission updates without page reload
Graceful fallback for unsupported browsers
Accessible keyboard navigation support

…and language state

thecodacus · 2025-01-05T21:44:12Z

like to test this can you resolve the conflicts and ping me once done?

Digitl-Alchemyst · 2025-01-10T22:42:47Z

I will also test this this improvement if the conflicts are resolved.
Can we also update the title to match our new naming convention.

updateed speech recognition with permission state, language selector …

389b5a2

…and language state

yodaljit changed the title ~~updateed speech recognition with permission state, language selector with state~~ updated speech recognition with permission state, language selector with state Dec 12, 2024

yodaljit changed the title ~~updated speech recognition with permission state, language selector with state~~ updated speech recognition with permission and language selector state management Dec 13, 2024

Digitl-Alchemyst requested review from thecodacus and Digitl-Alchemyst January 10, 2025 22:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updated speech recognition with permission and language selector state management #673

updated speech recognition with permission and language selector state management #673

yodaljit commented Dec 12, 2024 •

edited

Loading

thecodacus commented Jan 5, 2025

Digitl-Alchemyst commented Jan 10, 2025

updated speech recognition with permission and language selector state management #673

Are you sure you want to change the base?

updated speech recognition with permission and language selector state management #673

Conversation

yodaljit commented Dec 12, 2024 • edited Loading

thecodacus commented Jan 5, 2025

Digitl-Alchemyst commented Jan 10, 2025

yodaljit commented Dec 12, 2024 •

edited

Loading