DCT

1:23-cv-05687

Monument Peak Ventures LLC v. Samsara Inc

Key Events

Complaint

complaint

I. Executive Summary and Procedural Information

Parties & Counsel:
- Plaintiff: Monument Peak Ventures, LLC (Texas)
- Defendant: Samsara Inc. (Delaware)
- Plaintiff’s Counsel: Kent & Risley, LLC; Connor Lee & Shumaker PLLC
Case Identification: 1:23-cv-05687, N.D. Ga., 12/11/2023
Venue Allegations: Plaintiff alleges venue is proper because Defendant Samsara Inc. maintains an established place of business in the district, conducts business in the district, and has committed the alleged acts of infringement in the district.
Core Dispute: Plaintiff alleges that Defendant’s AI-powered fleet and worksite monitoring platform infringes three patents, originally developed by Eastman Kodak, related to analyzing and managing video based on affective, contextual, and event-based information.
Technical Context: The technology at issue involves AI-driven analysis of video and sensor data to monitor driver behavior and worksite safety, a critical and growing market for vehicle fleet management, logistics, and industrial operations.
Key Procedural History: The asserted patents originate from Eastman Kodak Company. Plaintiff states it has licensed the Kodak portfolio to over thirty other companies. The complaint does not mention any prior litigation or administrative challenges involving the asserted patents.

Case Timeline

Date	Event
2002-11-25	'684 Patent Priority Date
2007-06-19	'684 Patent Issue Date
2008-12-05	'311 Patent Priority Date
2010-10-27	'784 Patent Priority Date
2011-09-20	'311 Patent Issue Date
2014-09-16	'784 Patent Issue Date
2015-01-01	Samsara Inc. incorporated
2023-12-11	Complaint Filing Date

II. Technology and Patent(s)-in-Suit Analysis

U.S. Patent No. 7,233,684 - Imaging Method and System Using Affective Information, Issued June 19, 2007

The Invention Explained

Problem Addressed: The patent describes the increasing difficulty for users to organize and catalog large volumes of digital images. It notes that traditional categorization methods (e.g., by date or location) fail to help users locate images of "particular importance or value," which are often tied to the user's emotional state at the time of capture (’684 Patent, col. 2:6-21).
The Patented Solution: The invention proposes a system that captures an image of a scene while simultaneously collecting "affective information" from the user, such as physiological or behavioral reactions. This affective data is then interpreted to determine a "relative degree of importance" for the image, which is then associated with the image file as metadata to facilitate later organization and retrieval (’684 Patent, Abstract; col. 5:4-14).
Technical Importance: This technology sought to address the "image overload" problem by automating the process of tagging images based on a user's subjective, emotional response, a concept that precedes modern AI-driven sentiment analysis and content-based filtering systems (’684 Patent, col. 2:22-34).

Key Claims at a Glance

The complaint asserts independent claim 1 (Compl. ¶31).
Claim 1 is a method claim with the following essential elements:
- Capturing an image of a scene;
- Collecting affective information at capture;
- Associating the affective information with the scene image;
- Wherein collecting affective information comprises monitoring the physiology of a user; and
- Wherein collecting affective information comprises interpreting the collected physiological information to determine the relative degree of importance of the scene image.
The complaint does not explicitly reserve the right to assert dependent claims.

U.S. Patent No. 8,024,311 - Identifying Media Assets From Contextual Information, Issued September 20, 2011

The Invention Explained

Problem Addressed: The patent addresses the challenge of retrieving specific media assets from large, generally unorganized personal collections, noting that manual annotation by consumers is a "tedious task seldom performed" (’311 Patent, col. 1:21-32).
The Patented Solution: The invention describes a method for identifying relevant media assets using contextual information, such as text from a blog post or spoken narration. A key feature is a two-stage filtering process: the system first identifies a broad "superset" of media assets based on an initial piece of contextual information (e.g., a date range), and then refines this to a final "set" of media assets using a second piece of contextual information (e.g., descriptive keywords) that is received later (’311 Patent, Abstract; col. 2:44-50).
Technical Importance: The technology outlines an interactive, context-aware search paradigm where a user's ongoing narrative input dynamically refines a media query, a foundational concept for modern intelligent assistants and context-sensitive content management systems (’311 Patent, col. 4:3-15).

Key Claims at a Glance

The complaint asserts independent claim 1 (Compl. ¶54).
Claim 1 is a method claim with the following essential elements:
- Receiving contextual information, comprising a first set and a second set, where the second is received after the first;
- Identifying a chosen event based on analysis of the contextual information;
- Identifying a set of media assets based on the event, which comprises:
  - Identifying a superset of media assets based on the first set of contextual information (before the second set is received); and
  - Identifying the final set of media assets from the superset based on the second set of contextual information.
- Associating some of the contextual information with the event or an asset.
The complaint does not explicitly reserve the right to assert dependent claims.

Multi-Patent Capsule

Patent Identification: U.S. Patent No. 8,836,784, Automotive Imaging System for Recording Exception Events, Issued September 16, 2014 (Compl. ¶¶78, 80).
Technology Synopsis: The patent describes an automotive imaging system designed to record and report "exception events" like accidents or theft. The system operates in a low-frequency monitoring mode to conserve memory but, upon detecting an exception event via sensors, it captures and stores relevant video and can transmit the data to a central service (’784 Patent, Abstract).
Asserted Claims: The complaint asserts independent claim 25 (Compl. ¶81).
Accused Features: The complaint accuses Samsara's AI-powered Dash Cam system, which uses features like "Rolling Stop Detection" and unsafe following distance alerts as triggers ("exception events") for recording, analyzing, and storing video clips (Compl. ¶¶84, 85, 87).

III. The Accused Instrumentality

Product Identification

The complaint names Samsara's "Connected Operations Cloud" system, including its "Video-Based Safety," "Site Visibility," and "Samsara Sites App" components (Compl. ¶¶5, 21, 57).

Functionality and Market Context

The accused system is an AI-powered platform for monitoring physical operations, primarily for vehicle fleets and worksites (Compl. ¶¶4, 21). A core feature is the use of in-vehicle, AI-powered cameras to monitor driver behavior in real-time (Compl. ¶34). The system detects safety events such as "distracted driving, tailgating, and unsafe driving habits" and automatically creates labeled video records for review in a "Safety Inbox" (Compl. ¶¶24, 25). One visual provided in the complaint shows this "Safety Event Review Rubric" interface where events like "Mobile Usage" are archived for coaching (Compl. p. 7).
The "Samsara Sites App" allows users to search for video footage from cameras at a worksite (Compl. ¶57). It provides a user interface with successive filters for date, time, presence of a person, and clothing color to narrow search results (Compl. ¶¶58, 59).
The complaint alleges the platform is a key part of Samsara's business, enabling "digital transformation on a massive scale" for its customers (Compl. ¶4).

IV. Analysis of Infringement Allegations

U.S. Patent No. 7,233,684 Infringement Allegations

Claim Element (from Independent Claim 1)	Alleged Infringing Functionality	Complaint Citation	Patent Citation
An imaging method comprising the steps of capturing an image of a scene;	The system uses an inward-facing cabin camera that captures images of the driver, which the complaint defines as an "image of a scene." The complaint includes a visual of the in-cabin camera capturing the driver.	¶35	col. 5:4
collecting affective information at capture;	The system collects information such as driver head position and activity at the time of capture.	¶36	col. 5:6
and associating the affective information with the scene image,	The system associates the driver's head activity with the captured images of the driver.	¶38	col. 5:7-8
wherein the step of collecting affective information comprises monitoring the physiology of a user...	The system is alleged to monitor driver physiology and attentiveness by analyzing the driver's head position and activity in real-time.	¶¶36, 38	col. 5:49-53
and, wherein the step of collecting affective information comprises the steps of interpreting the collected physiological information to determine the relative degree of importance of the scene image.	The system interprets the collected information to trigger alerts and categorizes events by "Severity" (e.g., Low, Medium, High), which the complaint alleges indicates a "relative degree of importance" of the image.	¶¶39, 41	col. 5:9-11

Identified Points of Contention:
- Scope Questions: A primary question will be whether monitoring "driver head position and activity" falls within the scope of "monitoring the physiology of a user" as contemplated by the patent. The defense may argue that "physiology" in the patent refers to biometric signals indicative of emotion or interest (e.g., GSR, heart rate), not gross physical movements like head orientation.
- Technical Questions: Does the accused system's function of detecting and classifying a "distracted driving" event perform the claimed step of determining the "relative degree of importance of the scene image"? The analysis may focus on whether the system is evaluating the image itself or merely categorizing the driver's actions depicted within it.

U.S. Patent No. 8,024,311 Infringement Allegations

Claim Element (from Independent Claim 1)	Alleged Infringing Functionality	Complaint Citation	Patent Citation
receiving, by the data processing system, the contextual information, wherein the received contextual information comprises a first set of contextual information and a second set of contextual information, the second set being received after the first set;	The Samsara Sites App receives a "first set" of information via date and time range filters, and subsequently receives a "second set" via filters for a person's presence or clothing color. The complaint provides a screenshot of the date filter interface.	¶¶58, 59	col. 2:27-29
identifying a chosen event based at least upon an analysis of the contextual information;	The system identifies an "event" that satisfies the combined filter criteria selected by the user.	¶60	col. 2:30-32
identifying a superset of media assets... based at least upon an analysis of the first set of contextual information at a time when the second set... has not yet been received...;	The complaint alleges that after a user applies the date/time filters (the "first set"), clicking the "Show events" button identifies a "superset" of recordings before any further filters (the "second set") are applied.	¶63	col. 2:44-48
and identifying the set of media assets from the superset... based at least upon an analysis of the second set of contextual information;	After the initial results are shown, the system identifies a refined set of recordings based on the additional filters for person detection or clothing color.	¶65	col. 2:48-50
associating, in a processor-accessible memory system, at least some of the contextual information with the chosen event, or at least one asset in the set of media assets...	The system displays the final video clips with associated filter tags, which the complaint alleges constitutes associating the filter information (contextual information) with the media assets.	¶68	col. 2:51-56

Identified Points of Contention:
- Scope Questions: The case may turn on whether a sequence of user clicks on graphical filter buttons in an app constitutes "receiving contextual information" in the manner described by the patent, which provides examples like analyzing a user's blog post or story.
- Technical Questions: What evidence demonstrates that the accused app first identifies a "superset" of assets and then refines it? The defense could argue the system simply performs a single, cumulative database query using all applied filters, rather than the distinct two-step "superset-then-set" process required by the claim. The exact function of the "Show events" button will be a key factual issue (Compl. ¶64).

V. Key Claim Terms for Construction

For U.S. Patent No. 7,233,684:

The Term: "affective information"
Context and Importance: This term is the core of the invention. The infringement case hinges on whether the driver data collected by Samsara (e.g., head position) qualifies as "affective information." Practitioners may focus on this term because the patent's examples seem to imply an emotional or interest-based component that may not be present in the accused system's functionality.
Intrinsic Evidence for Interpretation:
- Evidence for a Broader Interpretation: The specification provides a broad definition: "Information that represents user's psychological, physiological, and behavioral reactions to a particular scene" (’684 Patent, col. 5:41-44). This language could be argued to encompass head movement as a "behavioral reaction."
- Evidence for a Narrower Interpretation: The patent’s title ("...Using Affective Information"), abstract ("characterize the emotional state"), and specific examples focus on emotional states like "happiness, sadness, fear, anger," "preference," and "interest" (’684 Patent, Abstract; col. 5:49-50). This could support a narrower construction limited to data indicative of a user's feelings or emotional state.

For U.S. Patent No. 8,024,311:

The Term: "contextual information"
Context and Importance: The viability of the infringement claim depends on construing this term to include a user's selections in a graphical user interface. Practitioners may focus on this term because the patent's specification repeatedly uses narrative-based examples like blogs and stories, suggesting a potential mismatch with the accused product's structured, non-narrative input method.
Intrinsic Evidence for Interpretation:
- Evidence for a Broader Interpretation: The patent defines the term to include "gesture data (mouse clicks, hand motions, etc.)" (’311 Patent, col. 2:62-63). Plaintiff will likely argue that selecting filters in the accused app constitutes a series of "mouse clicks."
- Evidence for a Narrower Interpretation: The patent's detailed examples focus on deriving meaning from natural language: "the typing or speaking of a blog, an email, or other vehicle for telling of a story" (’311 Patent, col. 2:35-37). The defense may argue this context limits the term to semantic, narrative-based inputs rather than structured GUI filter selections.

VI. Other Allegations

Indirect Infringement: For all three patents, the complaint alleges induced infringement. The allegations are based on Samsara providing its "Connected Operations Cloud" system along with "technical assistance and... instructional information" that allegedly direct customers to use the system in an infringing manner (Compl. ¶¶44-45, 71-72, 93-94).
Willful Infringement: The complaint alleges willful infringement for all three asserted patents, based on knowledge obtained "at least by service of this Complaint" (Compl. ¶¶43, 70, 92). The prayer for relief requests a finding of willfulness and enhanced damages (Compl. p. 37).

VII. Analyst’s Conclusion: Key Questions for the Case

A core issue will be one of definitional scope: can terms like "affective information" and "contextual information," which the patents describe in the context of human emotion and natural language narrative, be construed broadly enough to cover the objective, event-driven data (e.g., driver head position, GUI filter clicks) collected by Samsara's accused platform?
A key evidentiary question will be one of functional equivalence: does the accused technology perform the specific functions required by the claims? For the '684 patent, this means determining if classifying a "distracted driving" event is the same as determining the "relative degree of importance of the scene image." For the '311 patent, it means establishing whether the app's filtering mechanism executes the specific two-step "superset-then-set" identification process or a functionally different, one-step cumulative query.