Changes

60 bytes added , 15:43, 2 September 2020

no edit summary

|Has paper status=Working paper

}}

==Latest Progress==

The PDF-to-Text converter, the key terms finder, and phrase extraction scripts all work well and have been tested. The Google Scholar Web Crawler is being worked on by Christy Warden, current status unknown.

192 text versions of papers have been assembled in "Candidate Papers by LB." These were used as a test set for developing an analysis protocol for the entire paper. ~~Analysis~~ An analysis protocol has been developed that involves taking the counts of each term by topic per paper to assign each paper 1-4 topics. The process also counts the number of key terms in each paper for relevance, the number of modern terms, the year of the paper, the number of cost terms, the area of the paper, and the mentions of particular authors. All of this analysis is then dumped into artifacts. ~~Analysis~~ The analysis protocol is currently ~~hard coded~~hardcoded. Once a set list of terms has been established, a soft-coded version should be able to be developed [(in progress]).

Larger test set desired to be analyzed and soft-coded on.

==The Paper==

~~Latest~~ The latest version is:

\coauthoredprojects\Egan and Teece\McN-PatentThicket-Egan-092215.pdf

This file is posted at http://www.bakerinstitute.org/research/untangling-patent-thicket-literature

<pdf>File:McN-PatentThicket-Egan-092215.pdf</pdf>

==Codification==

Ed

Bureaucrats, Interface administrators, Administrators (Semantic MediaWiki), Administrators

7,612

edits

Changes

Untangling the Economics of Patent Thickets (view source)

Revision as of 15:43, 2 September 2020

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools