Hoping someone here can help with this... COL R of the attached spreadsheet, includes multiple patent claims in each cell. The text of each claim in the cell is preceded by the text “ | “ (1-3 digit(s) for the claim number) “. “ (except in a few instances where the cell begins with "1. " so that it isn't preceded by a " | ".
I need to parse the text of each claim to categorize the claims in a given cell into 3 categories (System, Method, or Composition). I’d like to create the following additional columns based on this data:
a. (COL S) with a count of Method claims, (These claims would include one of the following strings within the first 10 words: "method", "process", and/or “computer readable medium”).
b. (COL T) with a count of System claims, (These claims would include one of the following strings within the first 10 words: “apparatus”, “system”, "structure", "product", "assembly", and/or "construct").
c. (COL U) with a count of Composition of matter claims (These claims would include the following string within the first 10 words: “composition” and/or "material").
d. (COL V) with a list of claims that do not have the syntax for any of the three categories above.
So, for example:
Sample - results of parsing claims.xlsx
Thanks in advance. This is just beyond my excel skills
Bookmarks