This will be our 6th time making this tool, and we are trying to find or to perfect this process.
The tool will check the website's HOMEPAGE/MAIN page, look for a phone number, and extract it. The tool will check for the “tel:” of a website on the homepage/main.
We are open for any suggestion if it beats our accuracy.
We run like 100k websites, we want the tool to bypass error/bugs to continue. We also want the tool to resume where it left off just in case of interruption the output file is saved and we know where to left off.
Speed is also important.
Another problem is it should be bug or error-proof, if the program encounters an error it should automatically skip and resume the next url(or line)
The ideal output would be like this
Url, suitability, phone, logic. Url - the input website Suitability - YES/NO if there is a phone number Phone - extract phone Logic - if source if from tel or class number or regex phone format (US and UK only)
We also have an existing python code for these, if you can find a way to auto resume it where it left off (skip the error line) then its also ok.