Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1   Report Post  
Old May 22nd 05, 01:42 AM
Caveat Lector
 
Posts: n/a
Default Question on web spiders

Maybe not the right place but seems there are several web experts here.

Can web spiders read and harvest e-mail addresses from a pdf file ?

Many users and folks like QRZ.com are using jpegs not ascii for listing
e-mails -- this seems to work.

So for pdf files without going to a jpeg --- are ascii text addresses
harvestable ?

Thanks

--
CL -- I doubt, therefore I might be !







  #2   Report Post  
Old May 22nd 05, 02:15 AM
Brian Hill
 
Posts: n/a
Default


"Caveat Lector" wrote in message
news:8AQje.1453$Xh.738@fed1read07...
Maybe not the right place but seems there are several web experts here.

Can web spiders read and harvest e-mail addresses from a pdf file ?

Many users and folks like QRZ.com are using jpegs not ascii for listing
e-mails -- this seems to work.

So for pdf files without going to a jpeg --- are ascii text addresses
harvestable ?

Thanks

--
CL -- I doubt, therefore I might be !



Programs like Adobe have search capability so I would think it's possible a
havester could use the same technique but the time to open such docs and go
through the search probably wouldn't be worth the effort to develop? This is
ascii and any address posted to usenet can be harvested.

B.H.



  #3   Report Post  
Old May 27th 05, 08:39 AM
 
Posts: n/a
Default

In: 8AQje.1453$Xh.738@fed1read07, "Caveat Lector" wrote:
Maybe not the right place but seems there are several web experts here.

Can web spiders read and harvest e-mail addresses from a pdf file ?

Many users and folks like QRZ.com are using jpegs not ascii for listing
e-mails -- this seems to work.

So for pdf files without going to a jpeg --- are ascii text addresses
harvestable ?


Yes, (well, most likely) If it can be exported to text, it can be
harvested. Take a look at googles "view as HTML" option for instance.

Not sure if spammers have resorted this far or not yet though...

Have a look at spamassassin if you want a good (free) spam detection
system.

Jamie
--
http://www.geniegate.com Custom web programming
(rot13) User Management Solutions
Reply
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules

Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Question on web spiders Caveat Lector Antenna 9 May 25th 05 05:16 AM
Question on web spiders Caveat Lector Policy 3 May 23rd 05 08:42 AM
Good morning or good evening depending upon your location. I want to ask you the most important question of your life. Your joy or sorrow for all eternity depends upon your answer. The question is: Are you saved? It is not a question of how good [email protected] Antenna 0 April 25th 05 03:43 AM
Good morning or good evening depending upon your location. I want to ask you the most important question of your life. Your joy or sorrow for all eternity depends upon your answer. The question is: Are you saved? It is not a question of how good H. Adam Stevens, NQ5H Antenna 2 April 24th 05 09:42 PM
Question Pool vs Book Larnin' Mike Coslo Policy 24 July 22nd 04 05:50 AM


All times are GMT +1. The time now is 02:06 AM.

Powered by vBulletin® Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 RadioBanter.
The comments are property of their posters.
 

About Us

"It's about Radio"

 

Copyright © 2017