Edito
This Dataset contain file available before 30 septembter 2021
We collect and process data every 5 days and release the data when we have some time.
All this data is currently available online with public access at the date of the scan.
Link to access this Dataset : https://www.youpwn.ca/research/x_public/indexof/c2NyYXBtZV8yMDIxLTA5LTI5/index.html
Directory structure :
├── export_canada
│ ├── csv
│ └── json
├── export_usa
│ ├── csv
│ └── json
└── html_canada
File format and example
JSON and CSV is available, also the raw html file from the hosting web server
Format of the JSON :
{
"scantime": "2021-09-29T03:06:14.828104",
"ip": "144.217.12.224",
"port": 80,
"host": "144.217.12.224",
"link": "server.properties",
"org": "OVH Hosting, Inc.",
"asn": "AS16276",
"city": "Montréal",
"isp": "OVH SAS",
"shodanmodule": "http",
"sslsubject": null,
"province": "10",
"hash": -1655093557,
"html_hash": 1800728864,
"product": "nginx",
"ssl_jarm": null,
"sslissuer": null,
"country": "Canada"
}
NB : province "10" is Quebec province in Canada, for texas state in USA you should see "TX"
In this example, the file server.properties was online at 03:06 AM on 2021-09-29 with link http://144.217.12.224/server.properties
Format of the CSV :
Field name :
Scheme,HOST,PORT,FILE,Organization,ISP,ASN,TLS-CERT-AUTHORITY,TLS-CERT-SUBJECT,Country,Region-CODE,Scan-date
infile :
https,167.114.115.37,443,composer.json,OVH SAS,"OVH Hosting, Inc.",AS16276,ogha.ca,ogha.ca,Canada,10,2021-09-25T03:42:35.999146
NB : Region-code "10" is Quebec province in Canada, for texas state in USA you should see "TX"
In this example, the file composer.json was online at 03:42 on 2021-09-25 with link https://167.114.115.37/composer.json