DCT

1:24-cv-00728

SoundClear Tech LLC v. Amazon.com Inc

Key Events
Complaint
complaint

I. Executive Summary and Procedural Information

  • Parties & Counsel:
  • Case Identification: 1:24-cv-00728, E.D. Va., 05/01/2024
  • Venue Allegations: Plaintiff alleges venue is proper in the Eastern District of Virginia because Defendant Amazon maintains a regular and established place of business in the district, its "HQ2" in Arlington, and because certain features of the accused products are allegedly developed there.
  • Core Dispute: Plaintiff alleges that Defendant’s Amazon Echo products and the associated Amazon Alexa service infringe three patents related to audio noise reduction, communication status notification using light-emitting devices, and adaptive volume control methods.
  • Technical Context: The technology at issue resides in the domain of audio processing and user interface feedback for voice-activated smart devices, a significant and highly competitive consumer electronics market.
  • Key Procedural History: The complaint notes that the Patents-in-Suit were originally developed by engineers at JVC Kenwood (“JVCK”), a major audio processing company, and were subsequently acquired by Plaintiff SoundClear.

Case Timeline

Date Event
2011-09-15 ’259 Patent Priority Date
2012-02-20 ’374 Patent Priority Date
2015-05-12 ’259 Patent Issue Date
2015-06-30 ’374 Patent Issue Date
2015-12-07 ’819 Patent Priority Date
2017-10-31 ’819 Patent Issue Date
2024-05-01 Complaint Filing Date

II. Technology and Patent(s)-in-Suit Analysis

U.S. Patent No. 9,031,259 - “Noise Reduction Apparatus, Audio Input Apparatus, Wireless Communication Apparatus, and Noise Reduction Method”

  • Issued: May 12, 2015 (Compl. ¶30)

The Invention Explained

  • Problem Addressed: The patent’s background section states that conventional noise-canceling functions do not perform well in high-noise environments, failing to satisfy the demand for high-quality voice sound in communications devices (’259 Patent, col. 1:26-34).
  • The Patented Solution: The invention proposes a system using at least two microphones to improve noise reduction. The system first determines if a captured sound is a "speech segment." If so, it then detects the incoming direction of the voice based on signals from both microphones. Finally, an adaptive filter performs a noise reduction process using the inputs from both microphones, guided by the speech segment and voice direction information (’259 Patent, Abstract; col. 2:1-16). Figure 1 provides a block diagram of this architecture, showing the flow from microphones (11, 12) to the speech segment determiner (15) and voice direction detector (16), which control an adaptive filter (18).
  • Technical Importance: This approach aimed to enhance voice clarity for mobile and hands-free communication devices by intelligently adapting the noise cancellation based on whether a user is speaking and from which direction, a key challenge in noisy, real-world settings (Compl. ¶34).

Key Claims at a Glance

  • The complaint asserts infringement of at least independent Claim 1 (Compl. ¶61).
  • Essential elements of Claim 1 include:
    • A speech segment determiner configured to determine whether or not a sound picked up by at least either a first microphone or a second microphone is a speech segment and to output speech segment information.
    • A voice direction detector configured, when receiving the speech segment information, to detect a voice incoming direction... based on a first sound pick-up signal obtained based on a sound picked up by the first microphone and a second sound pick-up signal obtained based on a sound picked up by the second microphone.
    • An adaptive filter configured to perform a noise reduction process using the first and second sound pick-up signals based on the speech segment information and the voice incoming-direction information.

U.S. Patent No. 9,070,374 - “Communication Apparatus and Condition Notification Method for Notifying a Used Condition of Communication Apparatus by Using a Light-Emitting Device Attached to Communication Apparatus”

  • Issued: June 30, 2015 (Compl. ¶37)

The Invention Explained

  • Problem Addressed: For handset microphones used in noisy environments, the changing position relative to the user's mouth can create poor input conditions for noise cancellation, leading to the transmission of unclear voice sounds without the user’s awareness (’374 Patent, col. 1:36-52).
  • The Patented Solution: The invention provides a method to notify the user of the device's operational status using a light-emitting device (LED). The system determines the "communication mode" (e.g., standby vs. transmission) and the "sound pick-up state" of the voice signal. Based on these two inputs, a control unit adjusts the LED's state (e.g., off, on, or blinking) to provide real-time feedback to the user on the quality of their voice transmission (’374 Patent, Abstract; col. 2:1-9).
  • Technical Importance: The technology provides users of communication devices with immediate visual feedback, enabling them to adjust how they are using the device to ensure clear transmission, which is particularly valuable in environments where audio quality is critical (Compl. ¶41, ¶46).

Key Claims at a Glance

  • The complaint asserts infringement of at least independent Claim 9 (Compl. ¶79).
  • Essential elements of Claim 9 include:
    • A first pick-up unit configured to pick up a voice sound.
    • A transmitter unit configured to transmit the voice sound... as a first speech signal.
    • A communication-mode switching unit configured to switch a communication mode between a standby mode... and a transmission mode.
    • A sound pick-up state determination unit configured to determine a pick-up state of the voice sound.
    • A light emission device configured to emit light.
    • A control unit configured to control the light-emitting device... based on the communication mode... and the pick-up state of the voice sound.

U.S. Patent No. 9,804,819 - “Receiving Apparatus and Control Method”

  • Issued: October 31, 2017 (Compl. ¶47)

The Invention Explained

  • Problem Addressed: The patent addresses the problem of outputting audio at an "unintentional volume level as soon as the locked state is canceled" (’819 Patent, col. 1:35-40).
  • The Patented Solution: The invention describes a volume control system with a "locked state" where volume is fixed and a "non-locked state" where it is variable. A predetermined operating part can temporarily switch the system to the non-locked state to adjust volume, after which the system returns to the locked state, updating the lock value based on the adjustment (’819 Patent, Abstract).

Key Claims at a Glance

  • Asserted Claims: The complaint asserts infringement of at least independent Claim 8 (Compl. ¶101).
  • Accused Features: The complaint accuses Amazon's "Whisper Mode" and "Adaptive Volume" features, which allegedly permit the volume of Alexa's responses to automatically deviate from a user-set level (the "locked state") to a whisper or a louder level to overcome ambient noise (the "non-locked state") (Compl. ¶106, ¶107).

III. The Accused Instrumentality

Product Identification

  • The complaint identifies a broad range of "Amazon Echo Products" (including various generations of Echo, Echo Dot, Echo Show, and others) and the "Amazon Alexa" voice service (collectively, "the Amazon Products and Services") (Compl. ¶15).

Functionality and Market Context

  • The accused products are voice-activated smart devices that utilize a multi-microphone array and "beamforming technology" to isolate a user's voice from ambient noise (Compl. ¶65). The system is designed for "far-field voice recognition" and employs algorithms such as a "signal-to-interference ratio beam selector" to determine the direction of a user's voice (Compl. ¶65, ¶71). The devices perform "adaptive noise cancellation" to improve clarity (Compl. ¶74). They also provide user feedback via a prominent LED light ring, which indicates device status such as "listening" or "microphone is turned off" (Compl. ¶70, ¶92). Further, the devices include features like "Adaptive Volume" and "Whisper Mode," which automatically adjust the playback volume in response to ambient noise levels or the user's speaking volume (Compl. ¶107). The complaint positions these products as part of Amazon's large consumer electronics business (Compl. ¶20).

No probative visual evidence provided in complaint.

IV. Analysis of Infringement Allegations

’259 Patent Infringement Allegations

Claim Element (from Independent Claim 1) Alleged Infringing Functionality Complaint Citation Patent Citation
a speech segment determiner configured to determine whether or not a sound picked up... is a speech segment... The system's "keyword spotting" algorithm or Wake Word Detection ("WWD") functionality, which determines if a sound is a "wake word" or subsequent voice speech. ¶63 col. 4:41-48
a voice direction detector configured... to detect a voice incoming direction... based on a first sound pick-up signal... and a second sound pick-up signal... The "SIR Beam Selector" algorithm, which uses signals from the microphone array to learn and predict the locations of audio sources, thereby detecting the direction of the user's voice. ¶68, ¶71 col. 17:11-15
an adaptive filter configured to perform a noise reduction process... based on the speech segment information and the voice incoming-direction information The adaptive noise cancellation ("ANC" or "S-ANC") process, which uses an adaptive filter and is performed after the system has selected the "beam(s) of interest" based on direction. ¶73, ¶74 col. 2:11-16
  • Identified Points of Contention:
    • Scope Questions: A central question may be whether the detection of a single "wake word" (e.g., "Alexa") by the accused products falls within the scope of determining a "speech segment," a term the patent specification links to the analysis of vowel sounds and formants (’259 Patent, col. 4:60-62).
    • Technical Questions: The analysis may examine whether the accused "SIR Beam Selector" algorithm, which allegedly "learns" and "predicts" audio source locations, performs the same function as the claimed "voice direction detector," which is described as detecting direction based on phase differences or power information from microphone signals (’259 Patent, col. 17:1-6).

’374 Patent Infringement Allegations

Claim Element (from Independent Claim 9) Alleged Infringing Functionality Complaint Citation Patent Citation
a first pick-up unit configured to pick up a voice sound The MEMS microphones in the Echo devices that are configured to capture a user's spoken wake word and voice requests. ¶81 col. 7:4-7
a transmitter unit configured to transmit the voice sound... as a first speech signal The component within the Echo device that transmits the captured voice data (wake word and/or voice request) to Amazon's secure cloud for processing. ¶82 col. 7:12-16
a communication-mode switching unit configured to switch a communication mode between a standby mode... and a transmission mode... The Alexa Voice Assistant software, which switches the device from an idle or listening state ("standby mode") to a state where it streams voice input for processing upon detection of a wake word ("transmission mode"). ¶84, ¶86 col. 8:36-40
a sound pick-up state determination unit configured to determine a pick-up state of the voice sound A combination of hardware and software components, including acoustic echo cancellation (AEC), adaptive noise cancellation (S-ANC), SIR beam selector, and keyword spotting algorithms, which together determine the characteristics of the captured voice sound. ¶87, ¶88 col. 2:16-17
a light emission device configured to emit light The LED ring light integrated into the Amazon Echo products. ¶90 col. 7:42-43
a control unit configured to control the light-emitting device... based on the communication mode... and the pick-up state... The control unit that changes the LED ring light's color and animation (e.g., cyan spotlight for listening, red for microphone off) based on the device's operational mode and whether it is picking up voice. ¶91, ¶92 col. 8:16-24
  • Identified Points of Contention:
    • Scope Questions: The dispute may turn on whether the operational states of a modern, cloud-connected smart speaker (e.g., idle vs. streaming voice data) can be mapped to the claimed "standby mode" and "transmission mode," terms which the patent's context suggests are rooted in simplex push-to-talk radio technology (’374 Patent, Fig. 1).
    • Technical Questions: It raises the question of whether the complaint's aggregation of multiple, distinct software modules (AEC, S-ANC, beam selector, keyword spotting) satisfies the singular limitation of "a sound pick-up state determination unit," or if the claim requires a more integrated component.

V. Key Claim Terms for Construction

  • Term from ’259 Patent, Claim 1: "speech segment"

    • Context and Importance: This term's construction is fundamental. The infringement allegation hinges on Amazon's "wake word" detection qualifying as the determination of a "speech segment." A narrow definition could restrict the claim to continuous speech, while a broader one could encompass single keywords.
    • Intrinsic Evidence for a Broader Interpretation: The plain language of Claim 1 refers to determining "whether or not a sound... is a speech segment," without imposing further structural or durational requirements (’259 Patent, col. 58:20-22).
    • Intrinsic Evidence for a Narrower Interpretation: The specification repeatedly links the "speech segment determination technique" to the analysis of "a vowel sound that is a main component of a voice sound" and the detection of its "frequency spectra" or "formants" (’259 Patent, col. 4:60-62). This could support an argument that the term requires more than just identifying a keyword.
  • Term from ’374 Patent, Claim 9: "sound pick-up state determination unit"

    • Context and Importance: This is a functional limitation defined by what it does: "determine a pick-up state of the voice sound." Practitioners may focus on this term because the complaint alleges that a collection of separate software components collectively meets this single limitation (Compl. ¶87). Its construction will determine whether such an aggregation is permissible.
    • Intrinsic Evidence for a Broader Interpretation: The claim uses broad functional language ("unit configured to determine"), which may suggest that any combination of components performing the recited function could satisfy the element.
    • Intrinsic Evidence for a Narrower Interpretation: The specification describes this unit in the context of "evaluat[ing] speech quality of a speech signal" and its "noise-cancellation effect" (’374 Patent, col. 2:32-35; Fig. 4). This could support a narrower construction requiring the "unit" to perform specific quality evaluations, rather than merely determining the presence of a voice command.

VI. Other Allegations

  • Indirect Infringement: The complaint does not provide sufficient detail for analysis of indirect infringement. While the "Claims for Relief" sections mention acts by "Defendants (or those acting on their behalf)," no specific facts are pleaded to establish the knowledge and intent required for induced infringement or the elements of contributory infringement (Compl. ¶61, ¶79, ¶101).
  • Willful Infringement: The complaint's prayer for relief seeks "enhanced damages for willful infringement" (Compl. p. 28). However, the body of the complaint does not allege any specific facts to support this claim, such as pre-suit knowledge of the patents by the Defendant.

VII. Analyst’s Conclusion: Key Questions for the Case

  • A core issue will be one of technological translation: can claim terms and system architectures rooted in the context of traditional communication devices (e.g., "speech segment," "transmission mode," "PTT unit") be construed to read on the distinct operational paradigms of a modern, cloud-based, "always-on" smart device ecosystem?
  • A key evidentiary question will be one of structural and functional correspondence: does the accused system's distributed collection of software modules for keyword spotting, beamforming, and noise cancellation collectively constitute the singular "determiner" and "determination unit" elements recited in the claims, or is there a fundamental mismatch in the claimed versus accused architectures?
  • A central question for the ’819 patent will be one of definitional scope: does Amazon's "Adaptive Volume" feature, which temporarily overrides a user-set volume, operate within the claimed framework of switching between a "locked state" and a "non-locked state" before updating and returning to a "lock value"?