DCT

1:22-cv-01344

Nielsen US LLC v. ACRCloud Ltd

Key Events

Complaint

complaint

I. Executive Summary and Procedural Information

Parties & Counsel:
- Plaintiff: The Nielsen Company (US), LLC and Gracenote, Inc. (Delaware)
- Defendant: ACRCloud, Ltd. (Hong Kong SAR, China)
- Plaintiff’s Counsel: Potter Anderson & Corroon LLP; Kelley Drye & Warren LLP
Case Identification: 1:22-cv-01344, D. Del., 10/12/2022
Venue Allegations: Venue is alleged to be proper under 28 U.S.C. § 1391(c)(3) on the basis that the defendant is a foreign entity not residing in the United States.
Core Dispute: Plaintiffs allege that Defendant’s audio recognition platform infringes four U.S. patents related to generating audio signatures for content identification and architecting systems for audience measurement.
Technical Context: The lawsuit concerns Automatic Content Recognition (ACR) technology, which enables the automated identification of media content, a foundational capability for modern audience analytics and media monitoring services.
Key Procedural History: The complaint does not mention any prior litigation, Inter Partes Review (IPR) proceedings, or licensing history related to the asserted patents.

Case Timeline

Date	Event
2000-08-23	’503 Patent Priority Date
2003-10-17	’700 Patent Priority Date
2004-08-18	’889 Patent Priority Date
2004-08-31	’823 Patent Priority Date
2009-11-24	’823 Patent Issue Date
2010-08-24	’889 Patent Issue Date
2011-03-08	’503 Patent Issue Date
2011-11-22	’700 Patent Issue Date
2022-10-12	Complaint Filing Date

II. Technology and Patent(s)-in-Suit Analysis

U.S. Patent No. 7,783,889 - "Methods and Apparatus for Generating Signatures"

The Invention Explained

Problem Addressed: The patent describes prior art methods for generating digital spectral signatures from audio as relying on "interframe processing," which involves processing data from multiple frames of audio (Compl. ¶25). This approach was described as adding computational and implementation complexity, particularly for large-scale systems with extensive media libraries (Compl. ¶25, ¶28).
The Patented Solution: The invention discloses a more computationally efficient "intraframe" method for generating signatures (’889 Patent, col. 2:56-61). The process involves identifying two frequency components and their respective power levels within a single frame of media samples, determining a "descriptor" based on a comparison of those two power levels, and then generating a signature from that descriptor (Compl. ¶18, ¶26). This avoids the complexity of combining data across multiple frames (Compl. ¶28).
Technical Importance: This approach is presented as providing a robust, discriminative, and computationally efficient method for generating audio fingerprints, which is a critical requirement for operating large-scale media identification systems (Compl. ¶20, ¶28-31).

Key Claims at a Glance

The complaint asserts independent claims 1, 8, and 14 (Compl. ¶17, ¶66).
Independent Claim 8 includes the following essential elements:
- Performing a spectral transform on a first frame of media samples to identify first and second frequency components with first and second spectral power values.
- Determining a first descriptor based on a comparison of the first and second spectral power values.
- Identifying third and fourth spectral power values in a second, consecutive and overlapping frame of media samples.
- Determining a second descriptor based on a comparison of the third and fourth spectral power values.
- Generating first and second signatures based on the first and second descriptors.
The complaint reserves the right to assert dependent claims (Compl. ¶66).

U.S. Patent No. 8,065,700 - "Methods and Apparatus for Identifying Audio/Video Content Using Temporal Signal Characteristics"

The Invention Explained

Problem Addressed: The patent states that prior art techniques for content identification were limited or could lead to incorrect identification (’700 Patent, col. 1:62-2:2; Compl. ¶38).
The Patented Solution: The invention discloses a method for identifying content using a signature based on temporal characteristics of an audio signal (’700 Patent, Abstract). Specifically, the signature is based on "a plurality of time intervals between audio features," such as peak signal values or zero crossings (Compl. ¶34, ¶35, ¶39). This approach uses variable time intervals defined by the audio's intrinsic features, rather than preset, fixed-length intervals (Compl. ¶39).
Technical Importance: The creation of signatures based on time intervals tied to discriminative audio features is alleged to enhance the discrimination and robustness of content identification systems, allowing for more precise detection even in the presence of noise (Compl. ¶37, ¶41, ¶42).

Key Claims at a Glance

The complaint asserts independent claims 1, 8, and 15 (Compl. ¶34, ¶73).
Independent Claim 1 includes the following essential elements:
- Receiving a signature associated with audio content presented at a monitored site.
- The signature being based on a plurality of time intervals between audio features of the audio content.
- Identifying the content presented at the monitored site based on the received signature.
The complaint reserves the right to assert dependent claims (Compl. ¶73).

Multi-Patent Capsule: U.S. Patent No. 7,904,503

Patent Identification: U.S. Patent No. 7,904,503, “Method of Enhancing Rendering of Content Item, Client System and Server System,” issued March 8, 2011.
Technology Synopsis: The ’503 Patent addresses a problem where prior art systems required embedding wasteful auxiliary data (like watermarks or metadata) into content to retrieve related information (Compl. ¶47-49). The patented solution involves a server system receiving a portion of the content itself, computing a hash value from that content portion, and using the hash value as an identifier to search a database and retrieve enhanced content (e.g., song titles, artist names) (Compl. ¶45, ¶46).
Asserted Claims: The complaint asserts independent claim 1 (Compl. ¶46, ¶79).
Accused Features: The complaint alleges that when audio content is received at the ACRCloud platform, "identifiers such as hash values can be generated from the audio and thereafter used to access enhanced content in databases" (Compl. ¶60).

Multi-Patent Capsule: U.S. Patent No. 7,623,823

Patent Identification: U.S. Patent No. 7,623,823, “Detecting and Measuring Exposure to Media Content Items,” issued November 24, 2009.
Technology Synopsis: The ’823 Patent addresses challenges in prior art media measurement systems related to handling large loads and geographic diversity, which could cause delays in identifying user exposure (Compl. ¶57). The patented solution describes a system with a "stream collection server selector" that selects one of multiple data stream collection servers to receive user content data, with the selection based on factors such as load balancing and the geographic location of the user's device (Compl. ¶53, ¶57).
Asserted Claims: The complaint asserts independent claim 6 (Compl. ¶85).
Accused Features: The complaint alleges that ACRCloud uses servers and databases associated with Amazon Web Services ("AWS") in the United States and around the world to provide its audio recognition platform, which may suggest the use of a geographically distributed and load-balanced architecture (Compl. ¶59).

III. The Accused Instrumentality

Product Identification

Defendant ACRCloud’s "audio recognition platform," including its associated software, systems, and methods (Compl. ¶58).

Functionality and Market Context

The accused platform performs "audio fingerprint-based content identification" (Compl. ¶58). Functionally, it receives either audio content or pre-generated fingerprints from user client devices (Compl. ¶60). When raw audio is received, the platform generates fingerprints (Compl. ¶60). These fingerprints are then compared against a background audio database to identify the media, and the identification results are returned to the client (Compl. ¶61, ¶62). The complaint alleges that ACRCloud’s technical methods are described in a paper titled “Quad-Based Audio Fingerprinting Robust to Time and Frequency Scaling” (Compl. ¶61).
The platform is marketed for services including audience measurement, broadcast monitoring, and live channel detection (Compl. ¶58). The complaint alleges the platform relies on Amazon Web Services ("AWS") for its server and database infrastructure in the United States and globally (Compl. ¶59).

No probative visual evidence provided in complaint.

IV. Analysis of Infringement Allegations

The complaint references claim chart exhibits (Exhibits M, N, O, P) that are not provided in the filed document. The infringement theory is therefore summarized in prose based on the complaint's narrative allegations.

’889 Patent Infringement Allegations

The complaint alleges that ACRCloud's platform infringes by generating spectral signatures using an intraframe method similar to that claimed in the ’889 Patent (Compl. ¶17-18, ¶62). The primary evidence cited is a third-party technical paper that allegedly describes ACRCloud's methods (Compl. ¶61). According to the complaint's interpretation of this paper, the accused system processes audio files by computing a spectrogram, which indicates the power levels of frequency components within a frame (Compl. ¶62). It then allegedly compares these power levels within a single frame to identify a "spectral peak," which is used as a descriptor to generate a signature (Compl. ¶62). This narrative attempts to map the accused functionality to the patent's core concept of generating signatures from comparisons made entirely within one audio frame.

Identified Points of Contention:
- Scope Questions: A central question may be whether identifying a "spectral peak" as described in the cited paper constitutes the claimed steps of "identifying first and second frequency components" and "determining a first descriptor ... based on a comparison of the first and second spectral power values." The defense may argue that finding a local maximum is technically distinct from the direct comparison of two pre-selected frequency components as claimed.
- Technical Questions: What evidence does the complaint provide, beyond citation to a third-party paper, that ACRCloud’s commercial platform actually implements the specific steps described? The connection between the academic paper and the accused commercial product will likely be a key factual dispute.

’700 Patent Infringement Allegations

The complaint's narrative for the ’700 Patent is less detailed. It alleges that ACRCloud's infringing products and methods include all elements of the asserted claims (Compl. ¶74), but it does not specify how the accused system generates signatures "based on a plurality of time intervals between audio features" such as peaks and zero crossings, which is the central inventive concept of the ’700 Patent. The infringement theory appears to rest on the general function of the accused system in identifying content, without detailing the specific technical mechanism that allegedly maps to the patent's claims.

Identified Points of Contention:
- Scope Questions: The complaint's description of the accused technology focuses on spectral analysis within a frame (for the '889 patent), while the ’700 Patent claims a method based on temporal analysis between features. This raises the question of whether a single accused process can infringe both patents, or if there is a fundamental mismatch in the asserted technical theories.
- Technical Questions: A primary evidentiary question will be whether Plaintiffs can demonstrate that the accused ACRCloud system, in fact, measures and uses time intervals between signal peaks or zero crossings to form its audio fingerprints as required by the claims of the ’700 Patent. The complaint does not present specific factual allegations on this point.

V. Key Claim Terms for Construction

The Term: "descriptor" (’889 Patent, Claim 8)
Context and Importance: This term is central to the inventive step of the ’889 Patent. The infringement analysis will depend on whether the "spectral peak" allegedly identified by the accused system (Compl. ¶62) qualifies as a "descriptor...based on a comparison" of two spectral power values. Practitioners may focus on this term because its construction will determine if the accused method falls within the claim scope.
Intrinsic Evidence for Interpretation:
- Evidence for a Broader Interpretation: The specification may define a "descriptor" broadly as any data derived from an intraframe analysis of spectral components, which could support Plaintiffs' infringement theory (’889 Patent, col. 2:37-41).
- Evidence for a Narrower Interpretation: The complaint notes that the claimed approach can be a "sign-based approach based on comparisons of power levels," for example, where a descriptor is based on whether one power level is greater or less than another (Compl. ¶21, ¶30). This could support a narrower construction where the "descriptor" must be the direct binary result of a comparison, potentially distinguishing it from a "spectral peak" value.
The Term: "signature...based on a plurality of time intervals between audio features" (’700 Patent, Claim 1)
Context and Importance: This phrase defines the core technical mechanism of the ’700 Patent. The viability of the infringement claim hinges on whether the accused ACRCloud fingerprints are constructed using this specific temporal measurement technique.
Intrinsic Evidence for Interpretation:
- Evidence for a Broader Interpretation: The term "audio features" is not explicitly defined in the claim itself, potentially allowing for a broad reading that covers various signal characteristics beyond those explicitly mentioned in the specification.
- Evidence for a Narrower Interpretation: The specification repeatedly uses "peak values and zero crossings" as primary examples of "audio features" (Compl. ¶35, ¶41). Dependent claims also specify these features (’700 Patent, Claims 2, 9, 16). A defendant may argue these examples limit the scope of "audio features" to these specific types, potentially excluding other signal characteristics that might be used by the accused system.

VI. Other Allegations

Willful Infringement: The complaint alleges willful infringement for all four asserted patents. The basis for this allegation is Defendant's knowledge of the patents "as of the service date of this Complaint," indicating a theory of post-suit willfulness only (Compl. ¶68, ¶75, ¶81, ¶87).

VII. Analyst’s Conclusion: Key Questions for the Case

A core issue will be one of evidentiary mapping: Can Plaintiffs produce evidence demonstrating that ACRCloud’s commercial platform actually uses the specific technical methods described in the cited third-party academic paper, and that these methods meet every limitation of the asserted ’889 Patent claims?
A key technical question will be one of competing infringement theories: Does the complaint allege two fundamentally different signature generation methods (intraframe spectral comparison for the ’889 Patent versus temporal intervals between features for the ’700 Patent) against a single accused system? The court will need to determine if the accused system can plausibly infringe claims from both patents or if the theories are mutually exclusive in this context.
A central architectural question for the ’823 Patent will be one of functional scope: Does the use of a generic, geographically distributed cloud computing platform like AWS, without more specific allegations, satisfy the claim requirement for a "stream collection server selector" that actively chooses servers based on "load and geographic proximity"?