Listing Page Plugin Spec
| Listing Page Plugin Spec | |
|---|---|
| Project Information | |
| Has title | Listing Page Plugin Spec |
| Has owner | Rex Bone |
| Has start date | |
| Has deadline date | |
| Has project status | Active |
| Copyright © 2019 edegan.com. All Rights Reserved. | |
Plugin Overview
Faced with the problem of no standardization across incubator and accelerator websites, there is a design feasibility question concerning automating the extraction of information. A browser plugin with user guidance could serve as a fundamental first step towards total mechanization of the process. See LP_Extractor_Protocol for a comprehensive introduction to potential methods.
The focus of this design is to create a tool which allows for the quick identification of HTML markings on a webpage and subsequent reduction to a DSL for useful data extraction. Multiple options will be considered, including allowing the user to visually 'draw' a grid, either via dragging or marking vertices, and mouse-over. Attention will be given to potentially viable technical resources as well as usability.
Current List of sites to examine: Media:File:Accelerator List.xlsx
Sample Webpage:
Technical Specifications
User Input Styles
- Drag + Drop
- Marking Vertices
- Mouse-Over
Browser Considerations
- Firefox
- Chrome
- Version Control
Language Considerations
Current Problems
- "Infinite Scroll" webpages: Potentially impossible to account for incubator websites which display company lists in an infinite scroll style. Would require multiple instances of user input.