|
以下就是报错信息,我又试了几次,博主主页采集还是没有成功,单搜或群搜都有通过打数机浏览博主主页的过程,虽然时间很短
我同时还在试着用关键字搜索的工具,那个就会很正常的浏览翻页然后存入DataScraperWorks的文档下,
然而,博主主页就只有这个报错的log了,请帮我看看到底是什么问题,太感谢了。
2017-06-05 22:08:32 FileHandler RemoveCloseWindowMark WARN: Fail to find .metaseeker
2017-06-05 22:08:32 DataScraperEngine CloseEngineExternal WARN: Closing the engine, which is initiated from the external
2017-06-06 16:48:44 ValidateDelayedPage:Run 新浪微博_博主主页46221 ERROR: Timeout to load the page
2017-06-06 16:48:44 ExtractWebNodeData_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:48:44 SaveFile_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:48:44 ExtractSpiderClue_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:48:45 PushStack:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:48:45 CleanWorksBucket:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:48:46 FetchSpiderClue flushLastModified WARN: lastmodified is expected
2017-06-06 16:51:12 DataScraperEngine CrawlForTheme WARN: Transfer state from 18 to STATE_CRAWL_COUNTED.
2017-06-06 16:52:27 ValidateDelayedPage:Run 新浪微博_博主主页46221 ERROR: Timeout to load the page
2017-06-06 16:52:27 ExtractWebNodeData_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:52:27 SaveFile_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:52:27 ExtractSpiderClue_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:52:28 PushStack:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:52:28 CleanWorksBucket:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:52:29 FetchSpiderClue flushLastModified WARN: lastmodified is expected
2017-06-06 16:54:36 DataScraperEngine CrawlForTheme WARN: Transfer state from 18 to STATE_CRAWL_COUNTED.
2017-06-06 16:55:56 ValidateDelayedPage:Run 新浪微博_博主主页46221 ERROR: Timeout to load the page
2017-06-06 16:55:56 ExtractWebNodeData_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:55:56 SaveFile_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:55:56 ExtractSpiderClue_Simp:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:55:57 PushStack:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:56:03 CleanWorksBucket:Run 新浪微博_博主主页46221 WARN: Encounter processor error. The processor is skipped. PipeLineState : 54
2017-06-06 16:56:04 FetchSpiderClue flushLastModified WARN: lastmodified is expected
2017-06-06 16:58:12 DataScraperEngine CrawlForTheme WARN: Transfer state from 18 to STATE_CRAWL_COUNTED.
2017-06-06 17:04:36 DataScraperEngine CloseEngineExternal WARN: Closing the engine, which is initiated from the external
2017-06-06 17:10:25 DataScraperEngine CloseEngineExternal WARN: Closing the engine, which is initiated from the external
|
|
共 30 个关于本帖的回复 最后回复于 2020-5-31 18:41