PTAB
IPR2024-00659
Databricks Inc v. R2 Solutions LLC
Key Events
Petition
Table of Contents
petition
1. Case Identification
- Case #: IPR2024-00659
- Patent #: 8,190,610
- Filed: March 6, 2024
- Petitioner(s): Databricks, Inc.
- Patent Owner(s): R2 Solutions LLC
- Challenged Claims: 1-46
2. Patent Overview
- Title: MapReduce for distributed database processing
- Brief Description: The ’610 patent discloses an "enhanced" MapReduce programming methodology for processing large, distributed data sets. The purported enhancement involves treating an input data set as a plurality of data groups, where data sets within the same group share a common schema, but data sets in different groups are characterized by different schemas, allegedly allowing for more efficient processing of heterogeneous data sources.
3. Grounds for Unpatentability
Ground 1: Claims 1-46 are obvious over Pike.
- Prior Art Relied Upon: Pike (Patent 7,590,620).
- Core Argument for this Ground:
- Prior Art Mapping: Petitioner asserted that Pike discloses a framework for large-scale, parallel data processing that is functionally identical to conventional MapReduce. Pike’s system processes "input files" that are partitioned into data blocks, provided to worker processes that apply map operators to generate intermediate key-value pairs, and subsequently processed by reduce operators that merge values based on common keys. Petitioner argued that Pike’s disclosure of processing a variety of data types, including "text files" and "tables," inherently teaches processing data groups with different schemas. Furthermore, Pike's "application-specific" map and reduce operators, which are created by programmers for specific use cases, meet the claim limitations for user-configurable functions.
- Motivation to Combine (for §103 grounds): This ground is based on a single reference. The argument is that a person of ordinary skill in the art (POSITA) would have found it obvious to apply Pike's disclosed system to process a combination of its explicitly mentioned input file types (e.g., text files and tables) simultaneously. This direct application would predictably result in the processing of heterogeneous data as claimed, representing a known use-case for such a flexible data processing system.
- Expectation of Success: A POSITA would have a high expectation of success, as Pike’s framework was designed to be general-purpose and handle varied data types. Applying it to different, known data formats would be a routine implementation.
Ground 2: Claims 1-46 are obvious over Pike in view of Chowdhuri.
- Prior Art Relied Upon: Pike (Patent 7,590,620) and Chowdhuri (Application # 2006/0218123).
- Core Argument for this Ground:
- Prior Art Mapping: Petitioner argued that Chowdhuri reinforces and extends Pike’s teachings by disclosing a parallel database query system that processes data from multiple source tables with different schemas, such as "order" and "customer" tables. Chowdhuri’s use of various iterators (scan, hashjoin, GroupBy, sort) was presented as analogous to the map and reduce functions in Pike’s framework. The combination, therefore, explicitly discloses processing data groups ("order" and "customer" tables) with different schemas and a common key ("customer_id") to produce a merged result.
- Motivation to Combine: A POSITA would combine Pike's robust, fault-tolerant MapReduce architecture with Chowdhuri's specific teachings on processing structured database tables. The motivation would be to leverage the scalability and fault tolerance of Pike's system to perform the type of parallel database joins across heterogeneous tables described in Chowdhuri. This represents the application of a general-purpose tool (Pike) to a specific, well-understood problem domain (Chowdhuri), a predictable combination to improve performance.
- Expectation of Success: Success would be expected because both references describe parallel data processing systems, and their underlying operations (mapping/scanning, reducing/joining) are compatible. Integrating Chowdhuri's table-based processing into Pike's more general framework would be a straightforward task for a POSITA.
Ground 3: Claims 3, 4, 9-11, 19, 26, 27, 37-39, and 44-46 are obvious over Pike and Chowdhuri in view of MacLeod.
- Prior Art Relied Upon: Pike (Patent 7,590,620), Chowdhuri (Application # 2006/0218123), and MacLeod (Patent 6,343,295).
- Core Argument for this Ground:
- Prior Art Mapping: This ground addresses claims requiring metadata that is identifiable to a specific data group. Petitioner argued MacLeod discloses a system for "tracking the lineage of data in a database" using a "lineage identifier." This identifier is attached to data as it moves through the system, allowing its origin to be traced. Petitioner asserted this directly teaches the use of metadata to track data back to its source data group, as required by the challenged claims.
- Motivation to Combine: A POSITA would be motivated to incorporate MacLeod's data lineage tracking into the combined Pike/Chowdhuri system to enhance data management and processing efficiency. For example, tracking the origin of intermediate data would allow for post-processing operations, such as selectively excluding data from a particular source table before the final reduce step without having to re-run the entire map phase.
- Expectation of Success: A POSITA would have a high expectation of success in adding a known data tracking technique (MacLeod) to a data processing system (Pike/Chowdhuri) to gain better control and efficiency, which is a predictable and well-understood engineering improvement.
4. Key Claim Construction Positions
- Petitioner noted that in a related district court case, two key terms were construed, and it applied these constructions for its arguments.
- "data group": Construed as "a group of data and a mechanism for identifying data from that group." Petitioner argued that filenames in Pike and table names in Chowdhuri meet this construction.
- "a plurality of mapping functions that are each user-configurable": Construed as "two or more mapping functions that are each configurable by a user." Petitioner argued Pike’s disclosure of "application-specific" operators that programmers create meets this construction.
5. Arguments Regarding Discretionary Denial
- Petitioner’s filing is substantively identical to a petition filed in a related inter partes review, IPR2024-00303, and was filed concurrently with a motion for joinder to that proceeding. This action is intended to promote administrative efficiency and avoid duplicative efforts by the Patent Trial and Appeal Board and the parties, weighing against discretionary denial.
6. Relief Requested
- Petitioner requests institution of an inter partes review and cancellation of claims 1-46 of the ’610 patent as unpatentable.
Analysis metadata