DCT
1:18-cv-00549
Rensselaer Polytechnic Institute v. Amazon.com Inc
I. Executive Summary and Procedural Information
- Parties & Counsel:- Plaintiff: Rensselaer Polytechnic Institute (New York) and CF Dynamic Advances LLC (Delaware)
- Defendant: Amazon.com, Inc. (Delaware)
- Plaintiff’s Counsel: Robins Kaplan LLP
 
- Case Identification: 1:18-cv-00549, N.D.N.Y., 05/08/2018
- Venue Allegations: Plaintiffs allege venue is proper in the Northern District of New York based on Defendant’s transaction of business, commission of infringing acts, and maintenance of a regular and established place of business in the district, including physical "Amazon Pickup Locations" and the employment of local personnel.
- Core Dispute: Plaintiffs allege that Defendant’s Alexa Voice Service and associated hardware products infringe a patent related to methods for processing natural language user queries.
- Technical Context: The lawsuit concerns the foundational technology of natural language processing (NLP), which enable voice-activated digital assistants to understand and respond to conversational user commands.
- Key Procedural History: The complaint notes that the patent-in-suit was previously asserted against Apple Inc. in the same court, a case that was resolved through settlement in 2016. This history may be relevant to claim construction, as the patent’s terms may have been previously analyzed by the parties or the court.
Case Timeline
| Date | Event | 
|---|---|
| 2000-05-19 | '798 Patent Priority Date | 
| 2007-02-13 | U.S. Patent No. 7,177,798 Issues | 
| 2013-06-03 | Prior litigation against Apple Inc. filed | 
| 2016-05-02 | Prior litigation against Apple Inc. settled | 
| 2017-02-01 | Alleged date of Amazon's earliest knowledge of the patent | 
| 2017-03-10 | Amazon allegedly receives a claim chart for the '798 Patent | 
| 2018-05-08 | Complaint Filed | 
II. Technology and Patent(s)-in-Suit Analysis
U.S. Patent No. 7,177,798 - “Natural Language Interface Using Constrained Intermediate Dictionary of Results”
- Patent Identification: U.S. Patent No. 7177798, “Natural Language Interface Using Constrained Intermediate Dictionary of Results,” issued February 13, 2007.
The Invention Explained
- Problem Addressed: The patent describes prior art natural language interfaces as being impractical for broad applications. They either placed severe restrictions on user syntax or required "exceedingly large collections of linguistic terms" and complex keyword databases, yet still could not guarantee successful interpretation of user queries (’798 Patent, col. 1:41-45, col. 2:21-30).
- The Patented Solution: Instead of relying on traditional linguistics, the invention proposes a system that interprets queries by using a structured "reference dictionary" built upon the database's own metadata. This dictionary contains four integrated layers of information: "case information" (past queries), "keywords", "information models" (the database schema), and "database values" (’798 Patent, Abstract; col. 7:51-54). The system identifies possible interpretations by finding combinations of these objects that match a user's query and uses a search algorithm to determine the optimal result (’798 Patent, col. 2:31-39).
- Technical Importance: The technology aimed to make natural language interfaces more robust and efficient by grounding query interpretation in the known, finite structure of a database's metadata rather than attempting to model the near-infinite complexity of human language (’798 Patent, col. 5:19-35).
Key Claims at a Glance
- The complaint explicitly alleges infringement of at least Claim 1 and its narrative infringement theory closely tracks the elements of independent Claim 9 (Compl. ¶55, ¶64).
- Independent Claim 9:- A computer-implemented method for processing a natural language input comprising:
- receiving a natural language input;
- providing from said natural language input a plurality of language-based database objects;
- identifying a finite number of permutations of the plurality of database objects, the database objects being stored in a metadata database comprising at least one of a group of information comprising case information, keywords, information models, and database values; and
- interpreting at least one of the permutations to provide determination of a result of the natural language input.
 
- The complaint does not explicitly reserve the right to assert dependent claims.
III. The Accused Instrumentality
Product Identification
- The complaint names Defendant’s "Alexa Voice Software and Alexa enabled devices," which include the Echo product line, Fire TV accessories, and Fire Tablets, among others (Compl. ¶2, Appendix A). The core accused functionality resides in the "Alexa Voice Service," the cloud-based system that processes the queries (Compl. ¶53).
Functionality and Market Context
- The accused products provide a voice-based command system that allows users to ask questions or issue commands in natural speech (Compl. ¶55). The complaint alleges that the Alexa service processes these commands by interpreting the input's meaning, accessing information, and performing tasks in response (Compl. ¶55).
- The system is advertised as "always getting smarter" and adapting to user speech patterns and preferences, which Plaintiffs allege involves searching databases based on historical information and prior processing (Compl. ¶55, ¶63). The complaint includes a diagram from an Amazon video illustrating that the Alexa Voice Service comprises Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), and Text-To-Speech (TTS) components. (Compl. p. 20, ¶62).
IV. Analysis of Infringement Allegations
'798 Patent Infringement Allegations
| Claim Element (from Independent Claim 9) | Alleged Infringing Functionality | Complaint Citation | Patent Citation | 
|---|---|---|---|
| receiving a natural language input; | The Alexa service receives voice commands from users via Alexa-enabled devices. | ¶55, ¶64 | col. 37:18 | 
| providing from said natural language input a plurality of language-based database objects; | After recognizing speech, Alexa "determines what the user wants" by identifying relevant "database objects." The complaint cites Amazon developer documentation stating that skill categories contain "a set of relevant entity types." | ¶16, ¶64 | col. 37:19-20 | 
| identifying a finite number of permutations of the plurality of database objects, the database objects being stored in a metadata database comprising at least one of a group of information comprising case information, keywords, information models, and database values; | The Alexa service allegedly searches databases using a combination of historical/prior processing information (case information), keywords, organized concepts/information (information models), and user/context data (database values), and determines a "plurality of combinations" to generate a response. | ¶16, ¶55, ¶64 | col. 37:21-27 | 
| and interpreting at least one of the permutations to provide determination of a result of the natural language input. | The Alexa service provides a verbal and/or visual response to the user based on the determined combination of database objects. | ¶16, ¶64 | col. 37:28-29 | 
Identified Points of Contention
- Scope Questions: A central issue may be whether the architecture of the Alexa service, which likely relies heavily on modern machine-learning models, includes a "metadata database" containing the four distinct information types ("case information", "keywords", "information models", "database values") as required by the claim. The defense may argue that its system is a fundamentally different, holistic model rather than the structured, multi-part database described in the patent.
- Technical Questions: The complaint alleges Alexa determines a "plurality of combinations," mapping this to the claim's requirement of "identifying a finite number of permutations." A technical dispute may arise over whether the probabilistic interpretation methods used by modern NLP systems are equivalent to the more deterministic, combinatorial "branch and bound search" of "permutations" described in the patent's specification (’798 Patent, col. 22:17-32).
V. Key Claim Terms for Construction
The Term: "metadata database"
- Context and Importance: This term is the central component of the claimed invention. The entire inventive concept rests on using this specific type of database to interpret queries. Infringement will hinge on whether Plaintiffs can prove that Amazon's cloud architecture contains a structure that meets this definition.
- Intrinsic Evidence for Interpretation:- Evidence for a Broader Interpretation: The specification describes the metadatabase at a high level as containing "information about enterprise data combined with knowledge of how the data is used," which a plaintiff might argue is a functional description that could read on various modern data architectures (’798 Patent, col. 15:30-33).
- Evidence for a Narrower Interpretation: The patent provides a highly detailed graphical representation of the reference dictionary (FIG. 3) and describes its implementation using specific entity-relationship constructs from the "TSER" methodology (’798 Patent, col. 19:50-57). A defendant could argue this detailed embodiment limits the term to a more rigid, formally structured database.
 
The Term: "permutations"
- Context and Importance: This term defines the mechanism by which the system resolves ambiguity. The dispute will likely focus on whether the term is limited to the explicit, combinatorial process detailed in the patent or if it can encompass the probabilistic outputs of a modern AI.
- Intrinsic Evidence for Interpretation:- Evidence for a Broader Interpretation: The claim language itself is general, and the patent summary also uses the broader term "plurality of combinations," which could support an argument that the claim is not tied to one specific mathematical method (’798 Patent, col. 2:50-51).
- Evidence for a Narrower Interpretation: The detailed description teaches a specific "branch and bound search algorithm" for evaluating interpretations, which implies a structured, systematic exploration of discrete combinations, not a statistical inference (’798 Patent, col. 22:17-50).
 
VI. Other Allegations
- Indirect Infringement: The complaint alleges active inducement based on Defendant’s creation and distribution of advertisements, user manuals, and technical support that instruct and encourage customers to use the Alexa service in an infringing manner (Compl. ¶59). It also alleges contributory infringement, stating that the Alexa Voice Service is a material component specifically made for this infringing use and is not a staple article of commerce suitable for substantial non-infringing use (Compl. ¶61).
- Willful Infringement: The complaint alleges that Defendant had pre-suit knowledge of the '798 Patent since "at least February 2017" and received a specific claim chart on March 10, 2017, but continued its allegedly infringing activities despite an objectively high likelihood of infringement (Compl. ¶56-58).
VII. Analyst’s Conclusion: Key Questions for the Case
- A core issue will be one of "architectural equivalence": Can the specific, four-part "metadata database" structure, which forms the foundation of the '798 patent, be found within Amazon's modern, likely AI-driven, cloud-based Alexa architecture, or is there a fundamental mismatch between the patent's prescribed structure and the accused system?
- A key evidentiary question will be one of "functional operation": Does the Alexa service’s method for resolving a user's query—a process likely based on probabilistic machine learning—perform the claimed step of "identifying a finite number of permutations" of discrete database objects, or does the patent’s description of a deterministic "branch and bound" search algorithm create a functional distinction that precludes a finding of infringement?