Scraping pages for Images and Alt text

NJDevil

Banned
May 21, 2011
30
0
0
I have a list of URLs in excel... each of these pages has an image within the <div class="post-bodycopy"> (the site I want to scrape is on wordpress if that matters).

I'm looking to scrape the images and if possible, the alt text as well from all of these pages. I tried doing this with google docs, but all I get is the author of the image, not the link to the image or the alt text.

Does anyone know a way of doing what I'm trying to accomplish? I would prefer to be able to export the image files and the alt text, but I will settle for being able to export the url of the image itself (on this website it is /files/[year]/[month]/....) along with the alt text in a separate column.


Any help is greatly appreciated!
 


to update, I can use screaming frog to get the links to the images and the alt text. Still unable to bulk download all the images though.
 
Maybe I'm missing something but a simple custom perl/php script should be able to handle all of this. The script would parse the URLs, download the images you have, and copies the alt tags that you need and save it in a corresponding location for the image, and saves the results in the excel file.

Unless you don't have a coder, and then nothing I said would matter.​