DCT

1:22-cv-00680

VideoLabs Inc v. Meta Platforms Inc

Key Events

Amended Complaint

I. Executive Summary and Procedural Information

Parties & Counsel:
- Plaintiff: VideoLabs, Inc., and VL Collective IP LLC (Delaware)
- Defendant: Meta Platforms, Inc.; Instagram, Inc.; WhatsApp LLC; Facebook Technologies, LLC (Delaware)
- Plaintiff’s Counsel: Reichman Jorgensen Lehman & Feldberg LLP; Farnan LLP
Case Identification: 1:22-cv-00680, D. Del., 06/06/2024
Venue Allegations: Venue is based on Defendants’ incorporation in the State of Delaware, which establishes residency within the District.
Core Dispute: Plaintiff alleges that video processing, secure transmission, and object detection functionalities within Defendants’ social media and communication platforms—including Facebook, Instagram, WhatsApp, and Oculus/Portal devices—infringe five U.S. patents.
Technical Context: The technologies at issue relate to foundational aspects of modern digital media, including video compression standards (H.264), secure real-time communication protocols (SRTP/WebRTC), and AI-based computer vision for object detection.
Key Procedural History: The complaint alleges that Plaintiff initiated licensing discussions with Meta representatives in October 2019, provided notice of the patent portfolio’s relevance to Meta’s products, and made multiple follow-up attempts, but that Meta stated in June 2020 it was not interested in good faith licensing discussions.

Case Timeline

Date	Event
2000-01-18	Priority Date for U.S. Patent No. 7,266,682
2002-04-15	Priority Date for U.S. Patent Nos. 8,139,878; 7,769,238; 7,970,059
2004-05-27	Priority Date for U.S. Patent No. 7,436,980
2007-09-04	U.S. Patent No. 7,266,682 Issued
2008-10-14	U.S. Patent No. 7,436,980 Issued
2010-08-03	U.S. Patent No. 7,769,238 Issued
2011-06-28	U.S. Patent No. 7,970,059 Issued
2012-03-20	U.S. Patent No. 8,139,878 Issued
2019-10-02	Plaintiff allegedly first contacted Meta regarding licensing
2020-06-26	Meta allegedly stated it was not interested in licensing discussions
2024-06-06	First Amended Complaint Filed

II. Technology and Patent(s)-in-Suit Analysis

U.S. Patent No. 8,139,878 - "Picture Coding Method and Picture Decoding Method"

Patent Identification: U.S. Patent No. 8,139,878, "Picture Coding Method and Picture Decoding Method," issued March 20, 2012 (Compl. ¶35).

The Invention Explained

Problem Addressed: The patent describes that prior art video compression techniques were inefficient because they used the same coding tables for different types of content and prediction, and compression efficiency would decrease as video quality increased (’878 Patent, col. 1:33-44; Compl. ¶55).
The Patented Solution: The invention is a picture coding (encoding) method that improves compression efficiency by using a technique called Context-based Adaptive Variable Length Coding (CAVLC) (’878 Patent, col. 1:49-52). It determines a predictive value for the number of non-zero coefficients in a current block of video data based on the coefficients of neighboring, already-coded blocks. This predictive value is then used to select an optimal variable length code table for encoding the current block, adapting the compression scheme to the local context of the image (’878 Patent, col. 9:34-42; Compl. ¶55).
Technical Importance: The compression techniques described are used in the H.264 video codec, which represented a significant advance over prior standards like MPEG-2 and became ubiquitous in video streaming (Compl. ¶59).

Key Claims at a Glance

The complaint asserts at least independent claim 1 (Compl. ¶122).
Essential elements of claim 1 include:
- A transmitting apparatus with an audio processing unit, a picture coding unit, and a multiplexing unit.
- The picture coding unit codes a block image by generating a residual block from an original block and a predictive block.
- A coefficient number coding unit codes the total number of non-zero coefficients in the current block.
- A determining unit determines a predictive value for this number based on non-zero coefficients in a coded block on the "periphery" of the current block.
- A selecting unit selects a variable length code table based on the predictive value.
- A variable length coding unit performs the coding using the selected table.
The complaint does not explicitly reserve the right to assert dependent claims.

U.S. Patent No. 7,769,238 - "Picture Coding Method and Picture Decoding Method"

Patent Identification: U.S. Patent No. 7,769,238, "Picture Coding Method and Picture Decoding Method," issued August 3, 2010 (Compl. ¶36).

The Invention Explained

Problem Addressed: The ’238 Patent shares a specification with the ’878 Patent and thus addresses the same problems of inefficient video compression in prior art systems (’238 Patent, col. 1:33-44; Compl. ¶52, n.20).
The Patented Solution: This patent claims the corresponding decoding method for the CAVLC invention. A receiving apparatus decodes a video stream by determining a predictive value for the number of non-zero coefficients in a current block based on already-decoded neighboring blocks. This predictive value allows the decoder to select the correct variable length code table to properly interpret the coded bitstream and reconstruct the video data (’238 Patent, col. 4:1-12; Compl. ¶52).
Technical Importance: This decoding method is the necessary counterpart to the encoding invention of the ’878 Patent and is an essential component of H.264-compliant decoders (Compl. ¶59).

Key Claims at a Glance

The complaint asserts at least independent claim 1 (Compl. ¶155).
Essential elements of claim 1 include:
- A receiving apparatus with a demultiplexing unit, an audio processing unit, and a picture decoding unit.
- The picture decoding unit includes a block decoding unit that reproduces a block image from a residual block and a predictive block.
- A coefficient number decoding unit decodes data to obtain the number of non-zero coefficients.
- A determining unit determines a predictive value for this number based on non-zero coefficients in a decoded block on the "periphery" of the current block.
- A selecting unit selects a variable length code table based on the predictive value.
- A variable length decoding unit performs decoding on the coded stream using the selected table.
The complaint does not explicitly reserve the right to assert dependent claims.

U.S. Patent No. 7,970,059 - "Variable Length Coding Method and Variable Length Decoding Method"

Patent Identification: U.S. Patent No. 7,970,059, "Variable Length Coding Method and Variable Length Decoding Method," issued June 28, 2011 (Compl. ¶37).
Technology Synopsis: The patent describes an advance in video compression using Context Adaptive Binary Arithmetic Coding (CABAC), which relies on probability tables. The invention involves selecting and switching between probability tables in a predetermined, unidirectional manner based on the absolute value of data being coded compared to a threshold, which is alleged to increase coding efficiency by adapting to the data's context (’059 Patent, col. 2:52-56; Compl. ¶¶56-58).
Asserted Claims: At least independent claim 3 (Compl. ¶196).
Accused Features: Defendants' products that use H.264 entropy coding with the CABAC algorithm (Compl. ¶¶96, 195).

U.S. Patent No. 7,266,682 - "Method and System for Transmitting Data from a Transmitter to a Receiver and Transmitter and Receiver Therefore"

Patent Identification: U.S. Patent No. 7,266,682, "Method and System for Transmitting Data from a Transmitter to a Receiver and Transmitter and Receiver Therefore," issued September 4, 2007 (Compl. ¶60).
Technology Synopsis: The patent addresses security flaws in real-time data transmission protocols like RTP. The invention provides for transmitter-to-receiver authentication at the application layer by inserting authentication data at the end of an RTP packet payload. This allows the receiver to verify the transmitter is known before accepting and processing the packet, improving security and performance without the delays of other protocols (’682 Patent, col. 3:45-54; Compl. ¶¶64, 66).
Asserted Claims: At least independent claim 1 (Compl. ¶236).
Accused Features: Defendants' products that implement Web Real-Time Communication (WebRTC) or Secure Real-Time Transport Protocol (SRTP) for services like video calls (Compl. ¶¶103, 238).

U.S. Patent No. 7,436,980 - "Graphical Object Models For Detection And Tracking"

Patent Identification: U.S. Patent No. 7,436,980, "Graphical Object Models For Detection And Tracking," issued October 14, 2008 (Compl. ¶76).
Technology Synopsis: The patent addresses challenges in computer-based object detection within images and video. The invention uses a two-layer, spatio-temporal graphical model to represent an object (e.g., a person) and its constituent components (e.g., head, arms, legs). By determining the probability of an object's presence based on this component-based model, the system can more accurately and efficiently detect objects, including those that are partially occluded or articulated (’980 Patent, col. 1:41-42, col. 4:18-24; Compl. ¶¶80-82).
Asserted Claims: At least independent claim 1 (Compl. ¶258).
Accused Features: Defendants' object identification functionalities, including the Detectron, Detectron2, and D2Go software libraries used for computer vision tasks (Compl. ¶¶117, 260).

III. The Accused Instrumentality

Product Identification

The complaint identifies three overlapping groups of accused products and services across the Meta, Instagram, WhatsApp, and Facebook Technologies (Oculus/Portal) platforms (Compl. ¶¶84, 103, 117).

Coding Accused Products: Functionalities that use H.264 entropy coding (CAVLC or CABAC) for video compression, such as Facebook Live, Instagram Stories, WhatsApp video calls, and Oculus video (Compl. ¶¶84, 85, 87, 89).
'682 Accused Products: Functionalities that implement Web Real-Time Communication (WebRTC) or Secure Real-Time Transport Protocol (SRTP), including Facebook Messenger Video Chat, Instagram Live Video Chat, and VR Chat (Compl. ¶¶103, 104, 106, 108).
'980 Accused Products: Functionalities that perform object identification, specifically those using the Detectron, Detectron2, and D2Go libraries for computer vision tasks (Compl. ¶117).

Functionality and Market Context

The accused functionalities are core features of Defendants’ services, which enable the efficient streaming of video, secure real-time voice and video calls, and AI-powered content analysis. The complaint alleges these platforms are among the world's most popular, with services like Facebook Messenger supporting over 400 million voice and video calls per month using the accused WebRTC technology (Compl. ¶¶8, 114). The complaint alleges that Detectron2 is a widely adopted open-source platform used for object detection in production use cases at Facebook (Compl. ¶119). A screenshot of an ISO Media File illustrates how multiplexed audio and video tracks are stored for processing by the accused H.264 codecs (Compl. ¶127).

IV. Analysis of Infringement Allegations

8,139,878 Infringement Allegations

Claim Element (from Independent Claim 1)	Alleged Infringing Functionality	Complaint Citation	Patent Citation
A transmitting apparatus which transmits multiplexed data which is obtained by multiplexing coded audio data and coded picture data...	Accused products include encoders that multiplex coded audio and video into a single stream, such as in the ISO Media File Format.	¶127	col. 2:19-27
a picture coding unit configured to code picture data to obtain coded picture data...	Accused products incorporate an H.264 video codec to code picture data into an H.264-compliant bitstream.	¶129	col. 10:1-4
generating a residual block image from a block image of the respective blocks and a predictive block image obtained by intra-picture prediction or inter-picture prediction...	An H.264-compliant encoder generates a predictive macroblock using intra- or inter-picture prediction and subtracts it from the original to create a residual block. This process is illustrated with a series of images showing the original, predicted, and residual macroblocks (Compl. ¶134).	¶¶134-135	col. 10:10-14
a determining unit configured to determine a predictive value for the number of non-zero coefficients included in the current block based on the number of non-zero coefficients included in a coded block located on a periphery of the current block...	An H.264-compliant encoder uses previously-processed macroblocks, including those to the left and above the current macroblock, to predict the number of non-zero coefficients in the current block.	¶140	col. 2:1-5
a selecting unit configured to select a variable length code table based on the determined predictive value...	The encoder uses the predictive value (variable nC) to select one of six specified variable length coding tables from the H.264 standard. A diagram illustrates this context-adaptive selection process (Compl. ¶143).	¶142	col. 2:5-9
a variable length coding unit configured to perform variable length coding... by using the selected variable length code table.	A Context-Adaptive Variable-Length Coding (CAVLC) unit uses the selected table to perform variable length coding on the number of non-zero coefficients to generate the bitstream.	¶144	col. 2:9-12

7,769,238 Infringement Allegations

Claim Element (from Independent Claim 1)	Alleged Infringing Functionality	Complaint Citation	Patent Citation
A receiving apparatus which receives multiplexed data... said receiving apparatus comprising: a demultiplexing unit configured to separate the multiplexed data...	Accused products receive and demultiplex streams, such as an ISO Media File, into separate coded audio and coded picture data for respective decoders.	¶¶160-161	col. 2:19-27
a picture decoding unit configured to decode the separated coded picture data...	Accused products incorporate an H.264 video codec to decode an H.264-compliant bitstream.	¶163	col. 2:38-41
said block decoding unit includes... a reproducing unit configured to reproduce a block image of the current block, from the obtained residual block image and a predictive block image...	An H.264 decoder forms a prediction identical to the encoder's, adds it to the decoded residual, and reconstructs the macroblock for display.	¶177	col. 12:47-56
a determining unit configured to determine a predictive value for the number of non-zero coefficients included in the current block based on the number of non-zero coefficients included in a decoded block located on a periphery of the current block...	An H.264-compliant decoder uses previously-processed macroblocks (left and above) to predict the number of non-zero coefficients in the current block to be decoded.	¶179	col. 4:1-5
a selecting unit configured to select a variable length code table based on the determined predictive value...	The decoder uses the predictive value (variable nC) to select the same variable length coding table from the H.264 standard that was used by the encoder.	¶181	col. 4:5-9
a variable length decoding unit configured to perform variable length decoding on a coded stream... by using the selected variable length code table.	The H.264-compliant CAVLC decoder uses the selected table to decode the syntax element (coeff_token) representing the number of non-zero coefficients.	¶182	col. 4:9-12

Identified Points of Contention

Scope Questions (Coding Patents): The infringement allegations for the '878, '238, and '059 patents rely heavily on Defendants' compliance with the H.264 standard. A central question will be whether compliance with the standard necessarily requires practicing the specific methods claimed in the patents, or if non-infringing implementations of H.264 exist that Defendants may use.
Technical Questions ('682 Patent): The infringement analysis for the '682 patent will turn on whether SRTP, as used in WebRTC, functions "as an application protocol on an application layer." A diagram showing the SRTP protocol layer in the WebRTC stack is provided (Compl. ¶243). The defense may argue that SRTP operates at a different layer of the OSI model, raising a technical dispute over the protocol's classification relative to the claim language.
Functional Mapping ('980 Patent): For the '980 patent, a question will be whether the processes used in Meta's Detectron2 library map onto the claimed steps. For example, does a convolutional neural network's processing of an image constitute "measuring the object as a collection of components," and is the "score" or "confidence" output by the model legally and technically equivalent to "determining a probability" as required by the claim? An image with bounding boxes labeled with percentage scores is used to support this allegation (Compl. ¶271).

V. Key Claim Terms for Construction

Term ('878 and '238 Patents): "a block located on a periphery of the current block"

Context and Importance: The infringement theory for the primary coding patents depends on the encoder/decoder using data from neighboring blocks to select a coding table. The definition of "periphery" determines which neighboring blocks are relevant, directly impacting the scope of the infringement analysis.
Intrinsic Evidence for Interpretation:
- Evidence for a Broader Interpretation: The patent abstracts refer more generally to "coded blocks located in the neighborhood of a current block" ('878 Patent, Abstract), which may support a construction not strictly limited to immediately adjacent blocks.
- Evidence for a Narrower Interpretation: The detailed description and figures referenced in the complaint's infringement allegations show the use of blocks immediately above, to the left, and diagonally above the current block, which may support a narrower construction limited to these specific relative positions (Compl. ¶140, citing H.264 Standard, Fig. 6-14).

Term ('682 Patent): "as an application protocol on an application layer"

Context and Importance: This phrase is central to the patent's claimed point of novelty over prior art security protocols that operated at lower layers of the network stack. The entire infringement case for this patent may depend on whether SRTP, as used in the accused products, falls within this definition. Practitioners may focus on this term because the patentee appears to have acted as its own lexicographer.
Intrinsic Evidence for Interpretation:
- Evidence for a Broader Interpretation: The patent specification explicitly defines "application layer" as "the layers above the transport layer (layers above 4)" and "application protocol" as the protocol for communication between transmitter and receiver application layers situated above the transport layer (’682 Patent, col. 2:6-11). Plaintiff will argue this specific definition controls.
- Evidence for a Narrower Interpretation: A defendant may argue that, despite the patent's definition, a person of ordinary skill in the art would understand SRTP to be a profile of RTP that operates at the transport layer or a security sub-layer, not the application layer, and that the patent's definition is ambiguous or inconsistent with the disclosed embodiments.

VI. Other Allegations

Indirect Infringement

The complaint alleges inducement of infringement for all five patents. The allegations are based on Defendants providing and marketing the accused products and publishing documentation, help center articles, and tutorials that allegedly instruct and encourage end-users to perform the infringing methods, such as making video calls or uploading videos (Compl. ¶¶149-151, 187-189, 230-232, 252-254, 278-281).

Willful Infringement

The complaint alleges willful infringement for all five patents. The basis for willfulness is alleged pre-suit knowledge stemming from licensing negotiations Plaintiff initiated with Meta in October 2019. The complaint alleges that during these communications, Plaintiff identified its patent portfolio as relevant to Meta's products and that Meta subsequently declined to engage in licensing discussions, allegedly demonstrating willful blindness or wanton disregard of Plaintiff's patent rights (Compl. ¶¶145-148, 183-186, 226-229, 248-251, 274-277).

VII. Analyst’s Conclusion: Key Questions for the Case

A central issue will be one of standard versus invention: Does compliance with industry standards like H.264 and WebRTC/SRTP, as alleged by the Plaintiff, inherently result in infringement of the specific methods claimed in the '878, '238, '059, and '682 patents, or can a party implement those standards in a non-infringing manner?
A key question will be one of definitional scope: Can the term "application protocol on an application layer," as defined by the '682 patent's specification, be construed to cover the Secure Real-Time Transport Protocol (SRTP) as implemented in modern systems, or is there a fundamental mismatch in its position within the network protocol stack?
A core evidentiary question will be one of technical and functional equivalence: For the '980 patent, does the operation of a modern convolutional neural network, which identifies objects and assigns confidence scores within bounding boxes, perform the specific steps of "measuring the object as a collection of components" and "determining a probability" as recited in the claims?