Archive

Posts Tagged ‘info macro’

Scrape Website Info – Macro

November 29th, 2009 Comments off

Search for several keywords I will supply on deviantart.com.
Enter information into columns on a spreadsheet. I estimate there would be 5000-10000 pages to get the info from.

For every search result (artwork):

Click into every piece of artwork.
Click into each person’s profile.
Click through advertisement or wait through it.
Scrape the following on each profile page:

profile page url
username
subusername
# of deviations
# comments
# pageviews

any info in devious section including:

email address
website url

age
gender
location
favorite artist
listening to
watching
etc.

Bear