Python爬虫之抓取网络图片

安全经验
21年10月11日
编辑

aqzt

释放双眼，带上耳机，听听看~！

1.目的

以百度图片首页为例，首页如下图所示，网页上有一些图片，我们的目的就是将这些图片保存到本地。

Python爬虫之抓取网络图片

2.源码


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
1#coding=utf-8

2#version: python 2.7

3#author: Hao Chen

4

5import urllib

6import re

7

8#step1.获取整个页面的数据

9url=&quot;http://image.baidu.com/&quot;

10page = urllib.urlopen(url)     #打开一个url地址

11html = page.read()             #读取url上的数据

12

13#step2.删选页面中想要的数据

14reg = r&#x27;src=&quot;(.+?\.jpg)&quot; &#x27;        #构建正则表达式

15imgre = re.compile(reg)           #把正则表达式变异成一个对象

16imgList = re.findall(imgre,html)  #读取html中包含正则表达式的数据

17

18#直接用以下方法也行，更简便

19#imgList = re.findall(&#x27;src=&quot;(.+?\.jpg)&quot; &#x27;,html)

20 

21#step3.将页面筛选的数据保存到本地

22x=0

23for imgurl in imgList:

24    urllib.urlretrieve(imgurl,&#x27;%s.jpg&#x27;%x)  #远程将数据下载到本地

25    x+=1

26

27

{{userData.name}}已认证

Python爬虫之抓取网络图片

1.目的

2.源码

职场中的那些话那些事

Linux日志分析

{{userData.name}}已认证

1.目的

2.源码

Related posts:

职场中的那些话那些事

Linux日志分析

如何构建一个分布式爬虫：理论篇

Http与RPC通信协议的比较

Redis+Keepalived高可用方案详细分析

BP神经网络算法