python 多数据输出到txt_python-BeautifulSoup输出到.txt文件
我正在尝试将数据导出为.txt文件
from bs4 import BeautifulSoup
import requests
import os
import os
os.getcwd()
'/home/folder'
os.mkdir("Probeersel6")
os.chdir("Probeersel6")
os.getcwd()
'/home/Desktop/folder'
os.mkdir("img") #now `folder`
url = "http://nos.nl/artikel/2093082-steeds-meer-nekklachten-bij-kinderen-door-gebruik-tablets.html"
r = requests.get(url)
soup = BeautifulSoup(r.content)
data = soup.find_all("article", {"class": "article"})
with open(""%s".txt", "wb" %(url)) as file:
for item in data:
print item.contents[0].find_all("time", {"datetime": "2016-03-16T09:50:30+0100"})[0].text
print item.contents[0].find_all("a", {"class": "link-grey"})[0].text
print "
"
print item.contents[0].find_all("img", {"class": "media-full"})[0]
print "
"
print item.contents[1].find_all("div", {"class": "article_textwrap"})[0].text
file.write()
应该放在:
file.write()
上班?
我还试图将.txt文件的名称与url相同,应该使用字符串吗?
with open(""%s".txt", "wb" %(url)) as file:
url = "http://nos.nl/artikel/2093082-steeds-meer-nekklachten-bij-kinderen-door-gebruik-tablets.html"
总结
以上是生活随笔为你收集整理的python 多数据输出到txt_python-BeautifulSoup输出到.txt文件的全部内容,希望文章能够帮你解决所遇到的问题。
- 上一篇: qt爬取网页信息_豆瓣TOP250数据爬
- 下一篇: python爬虫深入爬取_Python爬