Changes

Jump to navigation Jump to search
1,919 bytes added ,  20:03, 12 November 2020
no edit summary
{{Project
|Has project output=Tool
|Has image=Whois Parser
|Has title=
|Has owner=Kunal Shah
|Has start date=
|Has deadline=
|Has keywords=Tool
|Has sponsor=McNair Center
|Has notes=
|Has project status=Complete
|Is dependent on=
|Depends upon it=
|Does subsume=
}}
 
==Current Notes==
 
Note: WHOIS is not an acronym but should be capitalized. It isn't here for legacy reasons.
{{Colored box|title=NOTICE|content=This page has a naming issue. Alternative versions are available at [[WhoIsParser.pl]], [[WhoIs Parser]], [[WhoisParser]], and [[Whois Parser]]}}
 
The latest version of the script (based on v2 from Kunal) is in:
E:\tools\WhoisParser\WhoisParser.pl
 
Packages were updated on Father using PPM (as admin):
*Net::WhoisNG
*Net::Whois::Parser
 
Packages were installed on Mother:
*cpanm Date::Manip
*cpanm Net::WhoisNG --force
*cpanm Net::Whois --force
*cpanm Net::Whois::Parser
 
It was run on Father as:
perl WhoisParser.pl -file="DistinctIncubatorDomains.txt" -outfile="IncubatorWhois.txt"
 
Note that the Date::Manip functions were commented out in the version on Father, and that line 174 had a map to <nowiki>''</nowiki> added in the join as most records have nulls for most fields.
 
==2016 Version==
 
[[Internal Classification::Internal Resources| ]]
This wiki page is under Additional Links/WhoisParser
The whoisParser was written by Kunal Shah on March 20, 2016 and is located in the Bulk Server under repository: Web_Crawler branch: shoeb_patch/whoisParser directory: /WhoIsParser file: whoisParser.pl Location: E:\McNair\Projects/\Houston/\WhoIsParser
To use this parser, move input file copy above perl program into the WhoIsParser a directory, make it current working directory (that is, use 'cd' command if needed) and run the following command on . The directory should also have the terminal:input file(see below).
perl WhoIsParser.pl -file=listofurls.txt -outfile=listofurls_processed.txt
=== NAME ===
WhoIs Parser - Retrieves and parses Whois information
corresponding columns with information from the WhoIs API.
=== SYNOPSIS ===
perl whoisParser -file=<file> [-outfile=<file>]
=== OPTIONS ===
-file=<file>: Name of file of domain names.
http://www.alpheus.net 2003-03-27T23:14:33Z 27-mar-2018 2016-03-28T11:22:05Z Alpheus Firstcall 1301 Fannin St.20th Floor Houston 77002 US 1301 Fannin St.20th Floor Houston 77002 US
 
==Summer 2018 Work==
 
I used this parser after running my Google URL finder as detailed on http://mcnair.bakerinstitute.org/wiki/U.S._Seed_Accelerators#Finding_Company_URLs.
 
Type this in the command line:
perl whoisParser_v2.pl -file="inputfile" -outfile="outputfile"
 
Associated files can be found in:
E:\McNair\Projects\Accelerators\Summer 2018\url finder
 
Input file is allURLS.txt and output file is whoisresults.txt

Navigation menu