释放双眼，带上耳机，听听看~！

模块

现在已经知道了通过import从外部模块获取函数：
比如import math

那么如何编写自己的模块呢

模块是程序

任何Python程序都可以作为模块导入

假设在/root目录下编写hello.py，内容为：


1
2
3
4
1# hello.py

2print &#x27;Hello&#x27;

3

4

进入python解释器：


1
2
3
4
5
6
1&gt;&gt;&gt; import sys

2&gt;&gt;&gt; sys.path.append(&#x27;/root&#x27;)

3&gt;&gt;&gt; import hello

4Hello

5

6

在导入模块后，会有新文件出现：


1
2
3
4
1-rw-r--r-- 1 root root   24 Jul 16 17:29 hello.py

2-rw-r--r-- 1 root root  110 Jul 16 17:30 hello.pyc

3

4

这个以.pyc为扩展名的文件是经过编译的，如果再次导入同一个模块，Python会导入.pyc文件而不是.py文件。

导入模块主要用于定义，比如导入变量、函数和类。只需要定义这些东西一次，因此多次导入和导入一次的效果一样。

模块用于定义

1.在模块中定义函数


1
2
3
4
5
1#hello2.py

2def hello():

3    print &quot;Hello2&quot;

4

5

运行：


1
2
3
4
5
1&gt;&gt;&gt; import hello2

2&gt;&gt;&gt; hello2.hello()

3Hello2

4

5

2.在模块中增加测试代码

如果想要确保hello函数正常工作，可能会将hello2重写为新的模块（还是叫hello2）:


1
2
3
4
5
6
7
1#hello2.py

2def hello():

3    print &quot;Hello2&quot;

4#test

5hello()

6

7

运行：


1
2
3
4
5
6
1&gt;&gt;&gt; import hello2

2Hello2

3&gt;&gt;&gt; hello2.hello()

4Hello2

5

6

这可能不是想要的结果。我们得区分模块作为程序运行还是导入其他程序。为了实现这一点，需要使用__name__变量：


1
2
3
4
5
6
1&gt;&gt;&gt; __name__

2&#x27;__main__&#x27;

3&gt;&gt;&gt; hello2.__name__

4&#x27;hello2&#x27;

5

6

在主程序(包括解释器的交互提示符中)，变量__name__是__main__而在导入的模块中，就是模块的名字。因此，我这样修改hello2:


1
2
3
4
5
6
7
8
9
10
1#hello2.py

2def hello():

3    print &quot;Hello2&quot;

4#test

5def test():

6    hello()

7if __name__ == &#x27;__main__&#x27;:

8    test()     

9

10

运行:


1
2
3
4
5
6
7
8
9
10
1[root@instance-54lh4cfv ~]# python hello2.py

2Hello2

3[root@instance-54lh4cfv ~]# python

4Python 2.7.5 (default, Apr  9 2019, 14:30:50) 

5[GCC 4.8.5 20150623 (Red Hat 4.8.5-36)] on linux2

6Type &quot;help&quot;, &quot;copyright&quot;, &quot;credits&quot; or &quot;license&quot; for more information.

7&gt;&gt;&gt; import hello2

8&gt;&gt;&gt; hello

9

10

如果作为程序运行，hello函数会执行。而作为模块导入时，就不会执行。

让你的模块可用

前面的例子中，通过修改sys.path来告知解释器去哪里查找模块。正常情况你不会每次都想这么做，那么有两种方法可以做到这一点：
将模块放到合适的位置；告诉解释器去哪里查找需要的模块。

1.将模块放在正确位置：


1
2
3
4
5
6
7
8
9
10
11
12
13
14
1&gt;&gt;&gt; import sys,pprint

2&gt;&gt;&gt; pprint.pprint(sys.path) #提供更加智能的打印输出

3[&#x27;&#x27;,

4 &#x27;/usr/lib64/python27.zip&#x27;,

5 &#x27;/usr/lib64/python2.7&#x27;,

6 &#x27;/usr/lib64/python2.7/plat-linux2&#x27;,

7 &#x27;/usr/lib64/python2.7/lib-tk&#x27;,

8 &#x27;/usr/lib64/python2.7/lib-old&#x27;,

9 &#x27;/usr/lib64/python2.7/lib-dynload&#x27;,

10 &#x27;/usr/lib64/python2.7/site-packages&#x27;,

11 &#x27;/usr/lib64/python2.7/site-packages/gtk-2.0&#x27;,

12 &#x27;/usr/lib/python2.7/site-packages&#x27;]

13

14

解释器可以从这些路径中查找模块，其中site-packages目录是最佳选择。将hello2.py放入该目录即可。

2.告诉编译器去哪里找

如果我们想将模块放在其他地方时可以采用这种方案。
通常是在PYTHONPATH环境变量中包含模块所在的目录。

3.命名模块
包含模块代码的文件的名称要和模块名一样。

包

为了更好的组织模块，可以将它们分组为包。包就是另一类模块，它们能包含其他模块。其实就是模块的目录。

为了让python将其作为包对待，必须包含一个命名为__init__py的文件。

例如有个名为drawing的包，其中包含名为shapes和colors的模块。

Python标准库

引用代码：


1
2
3
4
5
1import drawing # 此时drawing中__init__模块的内容是可用的

2import drawing.colors #此时colors模块是可用的，但是只能通过全名drawing.colors来使用

3from drawing import shapes#此时shapes模块可用，可以仅使用shapes来使用

4

5

标准库

标准库会随着Python解释器，一起安装在你的电脑中的。它是Python的一个组成部分。这些标准库是Python为你准备好的利器，可以让编程事半功倍。

下面介绍一些常见的标准库

sys

能通过该模块访问与python解释器关联紧密的变量和函数：

argv
命令行参数，包括脚本名称
exit([arg])
退出当前程序
modules
映射模块名字到载入模块的字典
path
查找模块所在目录的目录名列表
platform
平台标识符
stdin
标准输入流
stdout
标准输出流
stderr
标准错误流

我们实现一个范旭打印命令行参数的脚本：


1
2
3
4
5
6
1import sys

2args = sys.argv[1:] #argv[0] is the file name

3args.reverse()

4print &#x27; &#x27;.join(args)

5

6

执行并加入一些参数：


1
2
3
4
1[root@root~]# python reverseargs.py this is a test

2test a is this

3

4

os

提供了访问多个操作系统服务的功能。

environ
对环境变量进行映射
system(command)
在子shell中执行操作系统命令
sep
路径中的分隔符
pathsep
分隔路径的分隔符
linesep
行分隔符
urandom(n)
返回n字节的加密强随机数据

fileinput

该模块能让你轻松地遍历文件的所有行。

input([files[,inplace[,backup]]
遍历多个输入流中的行
filename()
返回当前文件的名称
lineno()
返回当前累计的行数
close()
关闭文件链

通过一个为代码编号的脚本来演示这个模块的使用：


1
2
3
4
5
6
7
8
1import fileinput                         

2for line in fileinput.input(inplace=True): # True原地处理，即直接修改原文件

3    line = line.rstrip()#去除行尾空白符      

4    num = fileinput.lineno()#得到行号

5    print &#x27;%-40s # %2i&#x27; % (line,num)     #重写格式 内容 + #行号

6                                         

7

8

如果你运行:python numberlines.py numberlines.py


1
2
3
4
5
6
7
8
1import fileinput                         #  1

2for line in fileinput.input(inplace=True): #  2

3    line = line.rstrip()                 #  3

4    num = fileinput.lineno()             #  4

5    print &#x27;%-40s # %2i&#x27; % (line,num)     #  5

6                                         #  6

7

8

上面脚本中的中文注释是我后来加的。

谨慎的将inplace设为True，因为它很容易破坏文件。

堆和双端队列

堆

它是优先队列的一种。python中没有独立的堆类型，只有一个包含一些堆操作函数的模块——heapq，包含6个函数：

heappush(heap,x)
x入堆
heappop(heap)
将堆中最小的元素弹出(原来这是一小顶堆)
heapify(heap)
通过列表构建堆
heapreplace(heap,x)
将堆中最小元素弹出，同时将x入堆
nlargest(n,iter)
返回iter中第n大的元素
nsmallest(n,iter)
返回iter中第n小的元素

前4个直接和堆操作相关，必须将列表作为堆对象本身(可以通过数组来实现堆，这里可以理解为是通过列表来实现的)。


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
1&gt;&gt;&gt; from heapq import *

2&gt;&gt;&gt; from random import shuffle

3&gt;&gt;&gt; data = range(10)

4&gt;&gt;&gt; shuffle(data)

5&gt;&gt;&gt; data

6[9, 8, 1, 0, 3, 4, 7, 6, 5, 2]

7&gt;&gt;&gt; heapify(data) #可以直接通过列表构建堆

8&gt;&gt;&gt; data

9[0, 2, 1, 5, 3, 4, 7, 6, 8, 9]

10&gt;&gt;&gt; heappush(data,0.5) #将0.5如堆

11&gt;&gt;&gt; data

12[0, 0.5, 1, 5, 2, 4, 7, 6, 8, 9, 3]

13&gt;&gt;&gt; heappop(data)      #弹出三个最小元素            

140

15&gt;&gt;&gt; heappop(data)

160.5

17&gt;&gt;&gt; heappop(data)

181

19

20

关于堆的性质可参考Java二叉堆，虽然是用Java语言描述的，但是应该也不妨碍理解。

双端队列

双端队列(deque)也可通过可迭代对象创建：


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
1&gt;&gt;&gt; from collections import deque

2&gt;&gt;&gt; q = deque(range(5))

3&gt;&gt;&gt; q

4deque([0, 1, 2, 3, 4])

5&gt;&gt;&gt; q.append(5)

6&gt;&gt;&gt; q.appendleft(6)

7&gt;&gt;&gt; q

8deque([6, 0, 1, 2, 3, 4, 5])

9&gt;&gt;&gt; q.pop()

105

11&gt;&gt;&gt; q.popleft()

126

13&gt;&gt;&gt; q.rotate(3)

14&gt;&gt;&gt; q

15deque([2, 3, 4, 0, 1])

16&gt;&gt;&gt; q.rotate(-1)

17&gt;&gt;&gt; q

18deque([3, 4, 0, 1, 2])

19

20

所谓双端是指可以在两端新增和删除元素。

rotate(n = 1)是将它们左移或右移，使头尾相连，当n是正数时，表示从左往右循环移n位，当n是负数时，表示从右往左循环移n位，比如deq.rotate(1)相当于deq.appendleft(deq.pop())

time

该模块可以获取当前时间、操作时间和日期、从字符串读取时间以及格式化时间为字符串。

很多Python函数用一个元组装起来的9组数字处理时间:


1
2
3
4
5
6
7
8
9
1&gt;&gt;&gt; import time

2&gt;&gt;&gt; localtime = time.localtime(time.time())

3&gt;&gt;&gt; print &quot;本地时间为 :&quot;, localtime

4本地时间为 : time.struct_time(tm_year=2019, tm_mon=7, tm_mday=17, tm_hour=17, tm_min=43, tm_sec=34, tm_wday=2, tm_yday=198, tm_isdst=0)

5&gt;&gt;&gt; localtime = time.asctime( time.localtime(time.time()) )

6&gt;&gt;&gt; localtime

7&#x27;Wed Jul 17 17:44:13 2019&#x27;

8

9

random

包括返回随机数的函数

random()
返回0<n<=1之间的随机实数n
uniform(a,b)
返回随机数n,其中a<=n<b
randrange([start],stop,[step])
返回range(start,stop,step)中的随机数
choice(seq)
从序列seq中返回随机元素
shuffle(seq[,random])
原地对序列seq进行洗牌(打乱)操作

{{userData.name}}已认证

Python标准库

模块

模块是程序

模块用于定义

让你的模块可用

包

标准库

sys

os

fileinput

堆和双端队列

堆

双端队列

time

random

C++调用Python

JVM性能优化 Part II：编译器

{{userData.name}}已认证

模块

模块是程序

模块用于定义

让你的模块可用

包

标准库

sys

os

fileinput

堆和双端队列

堆

双端队列

time

random

Related posts:

C++调用Python

JVM性能优化 Part II：编译器

shell脚本

crontab命令详解

java内存溢出分析工具：jmap使用实战

使用 Python 获取 Linux 系统信息