Scrapy 302 error
in setting.py change COOKIES_ENABLED to be true.
I wrote about the solutions to some problems I found from programming and data analytics. They may help you on your work. Thank you.
ezoic
Tuesday, June 26, 2018
stderr, stdout, stdin , how to output to linux file
http://tldp.org/HOWTO/Bash-Prog-Intro-HOWTO-3.html
There are 3 file descriptors, stdin, stdout and stderr (std=standard).
Basically you can:
- redirect stdout to a file
- redirect stderr to a file
- redirect stdout to a stderr
- redirect stderr to a stdout
- redirect stderr and stdout to a file
- redirect stderr and stdout to stdout
- redirect stderr and stdout to stderr
I stderr to a file.
scrapy crawl XXX 2> nohup.txt
or
scrapy crawl XXX 2>> nohup.txt
">>" means append.
Tuesday, June 12, 2018
Monday, June 11, 2018
Scrapy Spider, one url, multiple request sample code
class PabhSpider(CrawlSpider):
name = 'pabh'
allowed_domains = ['xxx']
def start_requests(self):
url = 'http://xxx'
num1 = '01'
formdata = {
"depart":num,
"years":'2014'
}
return [FormRequest(url=url,formdata=formdata,method='get',callback=self.parse)]
def parse(self, response):
item = XXXItem()
hxs = Selector(response)
item['bh'] = hxs.xpath('/html/body/form/p/font/select[3]/option/@value').extract()
yield item
num = ['02','03','04','05','06','07','08','09','10','11','12','13','14','21','31','40','51','61']
for x in num:
url = 'http://xxx'
formdata={
"depart":x,
"years":'2014'
}
yield FormRequest(url=url,formdata=formdata,method='get',callback=self.parse)
Wednesday, June 6, 2018
how to get rid of garbage characters when opening a txt file with excel and the txt file has east Asian characters
Sometimes when we open a txt file with excel and the txt file has east Asian characters, we will see some garbage characters. How to get rid of them.
To open a txt file with excel. First open an empty excel, then click File=>Open=> Go the the file you want to open => click open.
Some ones say if we open it with option Windows(ANSI) we will get rid of garbage characters. But I tried my file with Windows(ANSI), did not get rid of garbage characters.
So I tried some other options, I tried Unicode (UTF-8) and got rid of garbage characters.
To open a txt file with excel. First open an empty excel, then click File=>Open=> Go the the file you want to open => click open.
Some ones say if we open it with option Windows(ANSI) we will get rid of garbage characters. But I tried my file with Windows(ANSI), did not get rid of garbage characters.
So I tried some other options, I tried Unicode (UTF-8) and got rid of garbage characters.
Subscribe to:
Posts (Atom)
looking for a man
I am a mid aged woman. I was born in 1980. I do not have any kid. no complicated dating before . I am looking for a man here for marriage...
-
I tried to commit script to bitbucket using sourcetree. I first cloned from bitbucket using SSH, and I got an error, "authentication ...
-
https://github.com/boto/boto3/issues/134 import boto3 import botocore client = boto3.client('s3') result = client.list_obje...
-
Previously, I wanted to install "script" on Atom to run PHP. And there was some problem, like the firewall. So I tried atom-runner...