您现在的位置是：课程教程文章

爬取动漫图片：以后就有好看的桌面背景啦

2023-12-18 18:25课程教程文章人已围观

课程介绍: 1、 爬取动漫图片：以后就有好看的桌面背景啦课程推荐; 2、 爬取动漫图片：以后就有好看的桌面背景啦课程专题

正文

话不多说，直接上完整代码

import requests as r
import re
import os
import time
file_name = "动漫截图"
if not os.path.exists(file_name):
	os.mkdir(file_name)
                
for p in range(1,34):
        print("--------------------正在爬取第{}页内容------------------".format(p))
        url = 'https://www.acgimage.com/shot/recommend?page={}'.format(p)
        headers = {"user-agent"
           : "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.162 Safari/537.36"}
           
        resp = r.get(url, headers=headers)  
        html = resp.text

        images = re.findall('data-original="(.*?)" ', html)
        names =re.findall('title="(.*?)"', html)
        #print(images)
        #print(names)
        dic = dict(zip(images, names))
        for image in images:
                time.sleep(1)
                print(image, dic[image])
                name = dic[image]
                #name = image.split('/')[-1]
                i = r.get(image, headers=headers).content
                try:
                     with open(file_name + '/' + name  + '.jpg' , 'wb') as f:
                         f.write(i)
                except FileNotFoundError:
                    continue

先导入要使用的库

import requests as r
import re
import os
import time

然后去分析要去爬的网址：动漫截图网

下图是网址的内容：

好了 url已经确定

下面去寻找headers

找到下面是代码展示

url = 'https://www.acgimage.com/shot/recommend?page={}'.format(p)
headers = {"user-agent"
           : "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.162 Safari/537.36"
           }

然后检索要爬的图片内容

从上图就可以找到图片的位置：data-origina=后面的内容
以及图片的名字：title=后面的内容

然后用正则表达式re来检索就行了

images = re.findall('data-original="(.*?)" ', html)
names =re.findall('title="(.*?)"', html)

最后将其保存就好了

i = r.get(image, headers=headers).content
with open(file_name + '/' + name  + '.jpg' , 'wb') as f:
         f.write(i)

然后将page后面的数字改动就可以跳到相应的页面
换页的问题也就解决了

or p in range(1,34):
  url = 'https://www.acgimage.com/shot/recommend?page={}'.format(p)

以及将爬到的图片放到自己建立的文件zh
使用了os库

file_name = "动漫截图"
if not os.path.exists(file_name):
  os.mkdir(file_name)

更多python相关文章，请关注python自学网。 课程教程：爬取动漫图片：以后就有好看的桌面背景啦

课程专题：

上一篇：python如何显示出一张图片？

下一篇：没有了

其他课程

随机推荐

站点信息

文章统计：篇文章

您现在的位置是：课程教程文章

爬取动漫图片：以后就有好看的桌面背景啦

课程介绍

1、爬取动漫图片：以后就有好看的桌面背景啦课程推荐

2、爬取动漫图片：以后就有好看的桌面背景啦课程专题

正文

其他课程

猜你喜欢

随机推荐

站点信息

您现在的位置是：课程教程文章

爬取动漫图片：以后就有好看的桌面背景啦

课程介绍

1、 爬取动漫图片：以后就有好看的桌面背景啦课程推荐

2、 爬取动漫图片：以后就有好看的桌面背景啦课程专题

正文

其他课程

猜你喜欢

随机推荐

站点信息

1、爬取动漫图片：以后就有好看的桌面背景啦课程推荐

2、爬取动漫图片：以后就有好看的桌面背景啦课程专题