ICONICO

Discussion Thread

Data Extractor

Message Thread

For WindowsData Extractor

Data Extractor iconExtract any data, including email addresses and URLs from your files and webpages.

Posted in the Data Extractor Forum.




Search Engine Harvesting Not Working

The software doesn't work for search engine results that have emails on the page.  The emails on the page look like these for example: sales@abc.com or jack@dfecorp.com

I suspect the emails are not being extracted because of the bolding.  So the software needs to recognize that in all instances where there is an email address on the page that all characters after @ sign will be bolded.

I believe you answered this question for me before, but I can't find it in this forum's history.  I had to re-install the software, so this rule got deleted on my end.  The answer was adding a small piece of code to the Extraction Rules section.  Could you re-post this answer if you can find it?  Thanks!


by Randy Macdonald on Jan 4 2010 5:23pm Reply

Search Engine Harvesting Not Working


for this rul: "Extract Emails from Webpages"

change this:

objResults = document.body.outerHTML.match(objRE)

to this:

objResults = document.body.innerText.match(objRE)
by Nico Westerdale on Jan 4 2010 7:48pm Reply

Our Software Stores

IconicoAccurate Design and Development Software

BitsDuJourDiscount Deal Coupons for Windows and Mac Software Apps

Our Software Services

IcoBlogOur Official Blog

© copyright 2004-2024 Iconico, Inc. Code & Design. All Rights Reserved. Terms & Conditions Privacy Policy Terms of Use Login