91欧美视频,国产欧美日韩精品一区二区三区,青青综合网

本文將介紹使用Python編寫多線程 HTTP 下載器，并生成.exe可執(zhí)行文件。

環(huán)境：windows/Linux + Python2.7.x

單線程

在介紹多線程之前首先介紹單線程。編寫單線程的思路為：

1.解析url；

2.連接web服務(wù)器；

3.構(gòu)造http請(qǐng)求包；

4.下載文件。

接下來(lái)通過(guò)代碼進(jìn)行說(shuō)明。

解析url

通過(guò)用戶輸入url進(jìn)行解析。如果解析的路徑為空，則賦值為'/'；如果端口號(hào)為空，則賦值為"80”；下載文件的文件名可根據(jù)用戶的意愿進(jìn)行更改（輸入'y'表示更改，輸入其它表示不需要更改）。

下面列出幾個(gè)解析函數(shù)：

									#解析host和path

									def analyHostAndPath(totalUrl):

									  protocol,s1 = urllib.splittype(totalUrl)

									  host, path = urllib.splithost(s1)

									  if path == '':

									    path = '/'

									  return host, path

									#解析port

									def analysisPort(host):

									  host, port = urllib.splitport(host)

									  if port is None:

									    return 80

									  return port

									#解析filename

									def analysisFilename(path):

									  filename = path.split('/')[-1]

									  if '.' not in filename:

									    return None

									  return filename

連接web服務(wù)器

使用socket模塊，根據(jù)解析url得到的host和port連接web服務(wù)器，代碼如下：

									import socket

									from analysisUrl import port,host

									ip = socket.gethostbyname(host)

									s = socket.socket(socket.AF_INET,socket.SOCK_STREAM)

									s.connect((ip, port))

									print "success connected webServer！！"

構(gòu)造http請(qǐng)求包

根據(jù)解析url得到的path, host, port構(gòu)造一個(gè)HTTP請(qǐng)求包。

									from analysisUrl import path, host, port

									packet = 'GET ' + path + ' HTTP/1.1\r\nHost: ' + host + '\r\n\r\n'

下載文件

根據(jù)構(gòu)造的http請(qǐng)求包，向服務(wù)器發(fā)送文件，抓取響應(yīng)報(bào)文頭部的"Content-Length"。

									def getLength(self):

									    s.send(packet)

									    print "send success!"

									    buf = s.recv(1024)

									    print buf

									    p = re.compile(r'Content-Length: (\d*)')

									    length = int(p.findall(buf)[0])

									    return length, buf

下載文件并計(jì)算下載所用的時(shí)間。

									def download(self):

									    file = open(self.filename,'wb')

									    length,buf = self.getLength()

									    packetIndex = buf.index('\r\n\r\n')

									    buf = buf[packetIndex+4:]

									    file.write(buf)

									    sum = len(buf)

									    while 1:

									      buf = s.recv(1024)

									      file.write(buf)

									      sum = sum + len(buf)

									      if sum >= length:

									        break

									    print "Success!!"

									if __name__ == "__main__":

									  start = time.time()

									  down = downloader()

									  down.download()

									  end = time.time()

									  print "The time spent on this program is %f s"%(end - start)

多線程

抓取響應(yīng)報(bào)文頭部的"Content-Length"字段，結(jié)合線程個(gè)數(shù)，加鎖分段下載。與單線程的不同，這里將所有代碼整合為一個(gè)文件，代碼中使用更多的Python自帶模塊。

得到"Content-Length"：

									def getLength(self):

									    opener = urllib2.build_opener()

									    req = opener.open(self.url)

									    meta = req.info()

									    length = int(meta.getheaders("Content-Length")[0])

									    return length

根據(jù)得到的Length，結(jié)合線程個(gè)數(shù)劃分范圍：

									def get_range(self):

									    ranges = []

									    length = self.getLength()

									    offset = int(int(length) / self.threadNum)

									    for i in range(self.threadNum):

									      if i == (self.threadNum - 1):

									        ranges.append((i*offset,''))

									      else:

									        ranges.append((i*offset,(i+1)*offset))

									    return ranges

實(shí)現(xiàn)多線程下載，在向文件寫入內(nèi)容時(shí)，向線程加鎖，并使用with lock代替lock.acquire( )...lock.release( );使用file.seek( )設(shè)置文件偏移地址，保證寫入文件的準(zhǔn)確性。

									def downloadThread(self,start,end):

									    req = urllib2.Request(self.url)

									    req.headers['Range'] = 'bytes=%s-%s' % (start, end)

									    f = urllib2.urlopen(req)

									    offset = start

									    buffer = 1024

									    while 1:

									      block = f.read(buffer)

									      if not block:

									        break

									      with lock:

									        self.file.seek(offset)

									        self.file.write(block)

									        offset = offset + len(block)

									  def download(self):

									    filename = self.getFilename()

									    self.file = open(filename, 'wb')

									    thread_list = []

									    n = 1

									    for ran in self.get_range():

									      start, end = ran

									      print 'starting:%d thread '% n

									      n += 1

									      thread = threading.Thread(target=self.downloadThread,args=(start,end))

									      thread.start()

									      thread_list.append(thread)

									    for i in thread_list:

									      i.join()

									    print 'Download %s Success!'%(self.file)

									    self.file.close()

運(yùn)行結(jié)果：

Python實(shí)現(xiàn)多線程HTTP下載器示例

將(*.py)文件轉(zhuǎn)化為(*.exe)可執(zhí)行文件

當(dāng)寫好了一個(gè)工具，如何讓那些沒(méi)有安裝Python的人使用這個(gè)工具呢？這就需要將.py文件轉(zhuǎn)化為.exe文件。

這里用到Python的py2exe模塊，初次使用，所以對(duì)其進(jìn)行介紹：

py2exe是一個(gè)將Python腳本轉(zhuǎn)換成windows上可獨(dú)立執(zhí)行的可執(zhí)行文件（*.exe）的工具，這樣，就可以不用裝Python在windows上運(yùn)行這個(gè)可執(zhí)行程序。

接下來(lái)，在multiThreadDownload.py的同目錄下，創(chuàng)建mysetup.py文件，編寫：

									from distutils.core import setup

									import py2exe

									setup(console=["multiThreadDownload.py"])

接著執(zhí)行命令：Python mysetup.py py2exe

生成dist文件夾，multiTjhreadDownload.exe文件位于其中，點(diǎn)擊運(yùn)行即可：

Python實(shí)現(xiàn)多線程HTTP下載器示例

demo下載地址：HttpFileDownload.rar

以上就是本文的全部?jī)?nèi)容，希望對(duì)大家的學(xué)習(xí)有所幫助，也希望大家多多支持服務(wù)器之家。

原文鏈接：http://www.cnblogs.com/ybjourney/p/6387965.html

一区二区三区在线-一区二区三区亚洲视频-一区二区三区亚洲-一区二区三区午夜-一区二区三区四区在线视频-一区二区三区四区在线免费观看

Python實(shí)現(xiàn)多線程HTTP下載器示例

延伸 · 閱讀

python 列表轉(zhuǎn)為字典的兩個(gè)小方法(小結(jié))

python 插入Null值數(shù)據(jù)到Postgresql的操作

Python3以GitHub為例來(lái)實(shí)現(xiàn)模擬登錄和爬取的實(shí)例講解

在Windows系統(tǒng)上搭建Nginx+Python+MySQL環(huán)境的教程

Python實(shí)現(xiàn)ping指定IP的示例

使用NumPy和pandas對(duì)CSV文件進(jìn)行寫操作的實(shí)例

python直接訪問(wèn)私有屬性的簡(jiǎn)單方法

Python的dict字典結(jié)構(gòu)操作方法學(xué)習(xí)筆記

PyCharm設(shè)置SSH遠(yuǎn)程調(diào)試的方法

Python安裝圖文教程 Pycharm安裝教程

python是什么意思？python有什么用？

使用Python抓取模板之家的CSS模板

Python 列表(List)操作方法詳解