Open@VT

Open Access, Open Data, and Open Educational Resources

Category Archives: Intellectual Property

Introducing the Virginia Tech Patents collection in VTechWorks and the patent harvesting software repository, Patent-Harvest

Authors: Philippe Gray and Anne Lawrence

Inspired by the Association of Southeastern Regional Libraries webinar, “Adding Patent Records to Clemson’s IR — Highlighting the University’s Output,” VTechWorks, Virginia Tech’s institutional repository, now offers a similar collection, Virginia Tech Patents. The collection contains 645 U.S. Patents assigned to Virginia Tech at the time of patent application. The dates of issuance span 1919-2016. The collection’s display is customized with fields, search filters, and facets particular to patents, such as patent type, inventor, assignee, patent and application numbers, and patent classifications. Our motivation for creating the collection was that a sizeable collection of useful public domain content could be harvested programmatically and that it provides an opportunity to spotlight how Virginia Tech “invents the future.”

To enable other repositories to develop a similar collection, we offer our software, Patent-Harvest, in a GitHub repository. Patent-Harvest contains a Java program written to harvest all patents with Virginia Tech as the assignee. It can be adapted to harvest patents and associated files for other organizations or search parameters.

The harvesting program uses the PatentsView API to retrieve relevant metadata for all Virginia Tech patents and outputs a CSV spreadsheet. If desired, all the corresponding files for each patent are also downloaded and logically renamed. Since most United States patent documents are image-only PDFs, a script is included that uses optical character recognition to read text content and embed it in the patent documents. This makes the text of the patent documents searchable, but doesn’t change how they appear to the reader.

Intellectual Property Strategy by John Palfrey

Intellectual Property Strategy (Update May 14, 2018: This book is now available in an open access edition with additional material.)

John Palfrey’s Intellectual Property Strategy (MIT Press, 2012) is the first book I’ve read on the subject. Having read one of his previous books, Born Digital, and because it is in the same book series as Peter Suber’s Open Access, I suspected openness would be a theme, and I wasn’t disappointed. This review is mostly about that theme, rather than all aspects of the book, so keep that in mind. Palfrey is a well qualified writer on this subject, having taught law at Harvard, practiced intellectual property (IP) law, cofounded several tech startups, and is a venture executive. The book is aimed at CEOs and senior managers, and is short enough that it might be finished on a cross-country flight.

The four areas of IP are patent, copyright, trademark, and trade secret. Palfrey first addresses the prevailing “sword and shield” IP strategy by pointing out that it benefits lawyers more than organizations. He urges readers to “give special consideration to strategies of openness rather than exclusion, especially in the information context” (p. 3).

IP is a nonrival good- more than one party can use it simultaneously. The author points out that IP often gains in value the more that it is used, which is a flaw in the “full exclusion” approach. Palfrey is quite familiar with universities and libraries, and interestingly uses MIT’s OpenCourseWare as an example of using openness to increase assets. However, it’s important to establish ownership rights in order to give IP away (p. 56):

It may seem counterintuitive, but even the strategies of openness that I urge you to consider need to be grounded first in the system of rights in order to work smoothly.

Palfrey spends some time talking about open innovation, that is, using openly available or customer-generated information. For example, Zillow as well as legal publishers Lexis and Westlaw thrive in this environment. He cites a study (PDF) showing that the fair use economy in the U.S. supports hundreds of billions in exports, employs millions, and is growing by 5% annually. Palfrey warns that zealous protectionism can backfire, such as demanding royalties for using the song “Happy Birthday” (a demand that now appears fraudulent rather than protectionist).

Nonprofits as a special case are examined in Chapter 7. The differing missions of for-profits and nonprofits “opens up new possibilities” and can make IP strategy more important. Using libraries as an example, Palfrey suggests digitization in collaboration with for-profit partners, with a limited term of exclusivity during which the library receives royalties. Summarizing, he says (p. 120):

If the default in the for-profit world is to generate maximum revenues from the licensing of intellectual property, the default in the non-profit setting is probably to make intellectual property as broadly available as possible.

There are a few stumbles along the way- Palfrey occasionally uses the term “open access” in a confusingly loose way (p. 89, 105) despite discussing it accurately elsewhere (p. 118), and offers Google Wave (p. 68) as an example of open innovation (oops!). And he suggests that universities license IP in a nonexclusive way (p. 119), lowering fees for greater societal benefit (perhaps I’m too cynical, but I don’t see this happening).

I recommend this book as an introduction to IP in general- it’s a quick and informative read. Intellectual Property Strategy is available in Newman Library, and Palfrey’s book talk is below (beginning at 7:00).