Changes

Jump to navigation Jump to search
1,859 bytes added ,  20:03, 12 November 2020
no edit summary
{{Project|Has project output=Tool|Has image=Whois Parser|Has title=|Has owner=Kunal Shah|Has start date=|Has deadline=|Has keywords=Tool|Has sponsor=McNair Center|Has notes=|Has project status=Complete|Is dependent on=|Depends upon it=|Does subsume=}} ==Current Notes== Note: WHOIS is not an acronym but should be capitalized. It isn't here for legacy reasons.{{Colored box|title=NOTICE|content=This page has a naming issue. Alternative versions are available at [[WhoIsParser.pl]], [[WhoIs Parser]], [[WhoisParser]], and [[Category:InternalWhois Parser]]}} The latest version of the script (based on v2 from Kunal) is in: E:\tools\WhoisParser\WhoisParser.pl Packages were updated on Father using PPM (as admin):*Net::WhoisNG*Net::Whois::Parser Packages were installed on Mother:*cpanm Date::Manip*cpanm Net::WhoisNG --force *cpanm Net::Whois --force *cpanm Net::Whois::Parser It was run on Father as: perl WhoisParser.pl -file="DistinctIncubatorDomains.txt" -outfile="IncubatorWhois.txt" Note that the Date::Manip functions were commented out in the version on Father, and that line 174 had a map to <nowiki>''</nowiki> added in the join as most records have nulls for most fields. ==2016 Version== [[Internal Classification::LegacyInternal Resources| ]]
This wiki page is under Additional Links/WhoisParser
The whoisParser was written by Kunal Shah on March 20, 2016 and is located in the Bulk Server under repository: Web_Crawler branch: shoeb_patch/whoisParser directory: /WhoIsParser file: whoisParser.pl Location: E:\McNair\Projects/\Houston/\WhoIsParser
To use this parser, move input file copy above perl program into the WhoIsParser a directory, make it current working directory (that is, use 'cd' command if needed) and run the following command on . The directory should also have the terminal:input file(see below).
perl WhoIsParser.pl -file=listofurls.txt -outfile=listofurls_processed.txt
=== NAME ===
WhoIs Parser - Retrieves and parses Whois information
corresponding columns with information from the WhoIs API.
=== SYNOPSIS ===
perl whoisParser -file=<file> [-outfile=<file>]
=== OPTIONS ===
-file=<file>: Name of file of domain names.
http://www.alpheus.net 2003-03-27T23:14:33Z 27-mar-2018 2016-03-28T11:22:05Z Alpheus Firstcall 1301 Fannin St.20th Floor Houston 77002 US 1301 Fannin St.20th Floor Houston 77002 US
 
==Summer 2018 Work==
 
I used this parser after running my Google URL finder as detailed on http://mcnair.bakerinstitute.org/wiki/U.S._Seed_Accelerators#Finding_Company_URLs.
 
Type this in the command line:
perl whoisParser_v2.pl -file="inputfile" -outfile="outputfile"
 
Associated files can be found in:
E:\McNair\Projects\Accelerators\Summer 2018\url finder
 
Input file is allURLS.txt and output file is whoisresults.txt

Navigation menu