stats - ScrapydWeb

'pip install logparser' on host '127.0.0.1:6800' and run command 'logparser'. Or wait until LogParser parses the log.

PROJECT (event_scrapers), SPIDER (norwalk_historical_society)

Log analysis
Log categorization
View log
Crawler.stats

project	event_scrapers
spider	norwalk_historical_society
job	16325bf1696d11f194b20050565fa5d9
first_log_time	2026-06-16 12:21:10
latest_log_time	2026-06-16 12:21:12
runtime	0:00:02
crawled_pages	1
scraped_items	0
shutdown_reason	N/A
finish_reason	finished
log_critical_count	0
log_error_count	0
log_warning_count	2
log_redirect_count	0
log_retry_count	0
log_ignore_count	0
latest_crawl
latest_scrape
latest_log
current_time
latest_item	N/A

WARNING+

warning_logs
2 in total

2026-06-16 12:21:11 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute.
  self._check_mw_method_spider_arg(pipe.process_item)

2026-06-16 12:21:11 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.norwalk_historical_society.ListingSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html
  warn(

INFO

DEBUG

scrapy_version
```
2.14.1
```
telnet_console
```
127.0.0.1:6023
```
telnet_password
```
1426083f6704b7ab
```

latest_crawl

2026-06-16 12:21:12 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://norwalkhistoricalsociety.org/events/list/?hide_subsequent_recurrences=1> (referer: None)

latest_stat

2026-06-16 12:21:11 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)

Head

2026-06-16 12:21:10 [scrapy.utils.log] INFO: Scrapy 2.14.1 started (bot: event_scrapers)
2026-06-16 12:21:10 [scrapy.utils.log] INFO: Versions:
{'lxml': '6.0.2',
 'libxml2': '2.14.6',
 'cssselect': '1.3.0',
 'parsel': '1.10.0',
 'w3lib': '2.0.0',
 'Twisted': '25.5.0',
 'Python': '3.12.3 (main, Mar 23 2026, 19:04:32) [GCC 13.3.0]',
 'pyOpenSSL': '25.3.0 (OpenSSL 3.5.4 30 Sep 2025)',
 'cryptography': '46.0.3',
 'Platform': 'Linux-6.8.0-90-generic-x86_64-with-glibc2.39'}
2026-06-16 12:21:10 [scrapy.crawler] DEBUG: Using AsyncCrawlerProcess
2026-06-16 12:21:10 [asyncio] DEBUG: Using selector: EpollSelector
2026-06-16 12:21:10 [scrapy.addons] INFO: Enabled addons:
[]
2026-06-16 12:21:11 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor
2026-06-16 12:21:11 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop
2026-06-16 12:21:11 [scrapy.extensions.telnet] INFO: Telnet Password: 1426083f6704b7ab
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled extensions:
['scrapy.extensions.corestats.CoreStats',
 'scrapy.extensions.logcount.LogCount',
 'scrapy.extensions.telnet.TelnetConsole',
 'scrapy.extensions.memusage.MemoryUsage',
 'scrapy.extensions.feedexport.FeedExporter',
 'scrapy.extensions.logstats.LogStats']
2026-06-16 12:21:11 [scrapy.crawler] INFO: Overridden settings:
{'BOT_NAME': 'event_scrapers',
 'FEED_EXPORT_ENCODING': 'utf-8',
 'FEED_URI_PARAMS': <function _feed_uri_params at 0x72958a32c540>,
 'LOG_FILE': '/root/event-list-scraping/logs/event_scrapers/norwalk_historical_society/16325bf1696d11f194b20050565fa5d9.log',
 'NEWSPIDER_MODULE': 'event_scrapers.spiders',
 'REQUEST_FINGERPRINTER_CLASS': 'scrapy_zyte_api.ScrapyZyteAPIRequestFingerprinter',
 'SPIDER_MODULES': ['event_scrapers.spiders']}
2026-06-16 12:21:11 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 12:21:11 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled downloader middlewares:
['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware',
 'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
 'scrapy.downloadermiddlewares.retry.RetryMiddleware',
 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
 'scrapy_zyte_api.ScrapyZyteAPIDownloaderMiddleware',
 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware',
 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware',
 'scrapy.downloadermiddlewares.stats.DownloaderStats']
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled spider middlewares:
['scrapy.spidermiddlewares.start.StartSpiderMiddleware',
 'scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
 'scrapy_zyte_api.ScrapyZyteAPISpiderMiddleware',
 'scrapy.spidermiddlewares.referer.RefererMiddleware',
 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
 'scrapy.spidermiddlewares.depth.DepthMiddleware',
 'scrapy_zyte_api.ScrapyZyteAPIRefererSpiderMiddleware']
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled item pipelines:
['event_scrapers.pipelines.EventScrapersPipeline']
2026-06-16 12:21:11 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute.
  self._check_mw_method_spider_arg(pipe.process_item)

2026-06-16 12:21:11 [scrapy.core.engine] INFO: Spider opened
2026-06-16 12:21:11 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.norwalk_historical_society.ListingSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html
  warn(

2026-06-16 12:21:11 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2026-06-16 12:21:11 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
2026-06-16 12:21:12 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://norwalkhistoricalsociety.org/events/list/?hide_subsequent_recurrences=1> (referer: None)
2026-06-16 12:21:12 [scrapy.core.engine] INFO: Closing spider (finished)
2026-06-16 12:21:12 [scrapy.extensions.feedexport] INFO: Stored csv feed (0 items) in: output/2026/06/16/norwalk_historical_society.csv
2026-06-16 12:21:12 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 275,
 'downloader/request_count': 1,
 'downloader/request_method_count/GET': 1,
 'downloader/response_bytes': 23627,
 'downloader/response_count': 1,
 'downloader/response_status_count/200': 1,
 'elapsed_time_seconds': 1.297281,
 'feedexport/success_count/FileFeedStorage': 1,
 'finish_reason': 'finished',
 'finish_time': datetime.datetime(2026, 6, 16, 10, 21, 12, 579690, tzinfo=datetime.timezone.utc),
 'httpcompression/response_bytes': 103266,
 'httpcompression/response_count': 1,
 'items_per_minute': 0.0,
 'log_count/DEBUG': 1,
 'log_count/INFO': 3,
 'memusage/max': 93044736,
 'memusage/startup': 93044736,
 'response_received_count': 1,
 'responses_per_minute': 60.0,
 'scheduler/dequeued': 1,
 'scheduler/dequeued/memory': 1,
 'scheduler/enqueued': 1,
 'scheduler/enqueued/memory': 1,
 'start_time': datetime.datetime(2026, 6, 16, 10, 21, 11, 282409, tzinfo=datetime.timezone.utc)}
2026-06-16 12:21:12 [scrapy.core.engine] INFO: Spider closed (finished)

Tail

2026-06-16 12:21:10 [scrapy.utils.log] INFO: Scrapy 2.14.1 started (bot: event_scrapers)
2026-06-16 12:21:10 [scrapy.utils.log] INFO: Versions:
{'lxml': '6.0.2',
 'libxml2': '2.14.6',
 'cssselect': '1.3.0',
 'parsel': '1.10.0',
 'w3lib': '2.0.0',
 'Twisted': '25.5.0',
 'Python': '3.12.3 (main, Mar 23 2026, 19:04:32) [GCC 13.3.0]',
 'pyOpenSSL': '25.3.0 (OpenSSL 3.5.4 30 Sep 2025)',
 'cryptography': '46.0.3',
 'Platform': 'Linux-6.8.0-90-generic-x86_64-with-glibc2.39'}
2026-06-16 12:21:10 [scrapy.crawler] DEBUG: Using AsyncCrawlerProcess
2026-06-16 12:21:10 [asyncio] DEBUG: Using selector: EpollSelector
2026-06-16 12:21:10 [scrapy.addons] INFO: Enabled addons:
[]
2026-06-16 12:21:11 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor
2026-06-16 12:21:11 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop
2026-06-16 12:21:11 [scrapy.extensions.telnet] INFO: Telnet Password: 1426083f6704b7ab
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled extensions:
['scrapy.extensions.corestats.CoreStats',
 'scrapy.extensions.logcount.LogCount',
 'scrapy.extensions.telnet.TelnetConsole',
 'scrapy.extensions.memusage.MemoryUsage',
 'scrapy.extensions.feedexport.FeedExporter',
 'scrapy.extensions.logstats.LogStats']
2026-06-16 12:21:11 [scrapy.crawler] INFO: Overridden settings:
{'BOT_NAME': 'event_scrapers',
 'FEED_EXPORT_ENCODING': 'utf-8',
 'FEED_URI_PARAMS': <function _feed_uri_params at 0x72958a32c540>,
 'LOG_FILE': '/root/event-list-scraping/logs/event_scrapers/norwalk_historical_society/16325bf1696d11f194b20050565fa5d9.log',
 'NEWSPIDER_MODULE': 'event_scrapers.spiders',
 'REQUEST_FINGERPRINTER_CLASS': 'scrapy_zyte_api.ScrapyZyteAPIRequestFingerprinter',
 'SPIDER_MODULES': ['event_scrapers.spiders']}
2026-06-16 12:21:11 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 12:21:11 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled downloader middlewares:
['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware',
 'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
 'scrapy.downloadermiddlewares.retry.RetryMiddleware',
 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
 'scrapy_zyte_api.ScrapyZyteAPIDownloaderMiddleware',
 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware',
 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware',
 'scrapy.downloadermiddlewares.stats.DownloaderStats']
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled spider middlewares:
['scrapy.spidermiddlewares.start.StartSpiderMiddleware',
 'scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
 'scrapy_zyte_api.ScrapyZyteAPISpiderMiddleware',
 'scrapy.spidermiddlewares.referer.RefererMiddleware',
 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
 'scrapy.spidermiddlewares.depth.DepthMiddleware',
 'scrapy_zyte_api.ScrapyZyteAPIRefererSpiderMiddleware']
2026-06-16 12:21:11 [scrapy.middleware] INFO: Enabled item pipelines:
['event_scrapers.pipelines.EventScrapersPipeline']
2026-06-16 12:21:11 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute.
  self._check_mw_method_spider_arg(pipe.process_item)

2026-06-16 12:21:11 [scrapy.core.engine] INFO: Spider opened
2026-06-16 12:21:11 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.norwalk_historical_society.ListingSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html
  warn(

2026-06-16 12:21:11 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2026-06-16 12:21:11 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
2026-06-16 12:21:12 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://norwalkhistoricalsociety.org/events/list/?hide_subsequent_recurrences=1> (referer: None)
2026-06-16 12:21:12 [scrapy.core.engine] INFO: Closing spider (finished)
2026-06-16 12:21:12 [scrapy.extensions.feedexport] INFO: Stored csv feed (0 items) in: output/2026/06/16/norwalk_historical_society.csv
2026-06-16 12:21:12 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 275,
 'downloader/request_count': 1,
 'downloader/request_method_count/GET': 1,
 'downloader/response_bytes': 23627,
 'downloader/response_count': 1,
 'downloader/response_status_count/200': 1,
 'elapsed_time_seconds': 1.297281,
 'feedexport/success_count/FileFeedStorage': 1,
 'finish_reason': 'finished',
 'finish_time': datetime.datetime(2026, 6, 16, 10, 21, 12, 579690, tzinfo=datetime.timezone.utc),
 'httpcompression/response_bytes': 103266,
 'httpcompression/response_count': 1,
 'items_per_minute': 0.0,
 'log_count/DEBUG': 1,
 'log_count/INFO': 3,
 'memusage/max': 93044736,
 'memusage/startup': 93044736,
 'response_received_count': 1,
 'responses_per_minute': 60.0,
 'scheduler/dequeued': 1,
 'scheduler/dequeued/memory': 1,
 'scheduler/enqueued': 1,
 'scheduler/enqueued/memory': 1,
 'start_time': datetime.datetime(2026, 6, 16, 10, 21, 11, 282409, tzinfo=datetime.timezone.utc)}
2026-06-16 12:21:12 [scrapy.core.engine] INFO: Spider closed (finished)

Log

/1/log/utf8/event_scrapers/norwalk_historical_society/16325bf1696d11f194b20050565fa5d9/?job_finished=True
Source

http://127.0.0.1:6800/logs/event_scrapers/norwalk_historical_society/16325bf1696d11f194b20050565fa5d9.log

source	log
last_update_time	2026-06-16 12:21:12
last_update_timestamp	1781605272
downloader/request_bytes	275
downloader/request_count	1
downloader/request_method_count/GET	1
downloader/response_bytes	23627
downloader/response_count	1
downloader/response_status_count/200	1
elapsed_time_seconds	1.297281
feedexport/success_count/FileFeedStorage	1
finish_reason	finished
finish_time	datetime.datetime(2026, 6, 16, 10, 21, 12, 579690, tzinfo=datetime.timezone.utc)
httpcompression/response_bytes	103266
httpcompression/response_count	1
items_per_minute	0.0
log_count/DEBUG	1
log_count/INFO	3
memusage/max	93044736
memusage/startup	93044736
response_received_count	1
responses_per_minute	60.0
scheduler/dequeued	1
scheduler/dequeued/memory	1
scheduler/enqueued	1
scheduler/enqueued/memory	1
start_time	datetime.datetime(2026, 6, 16, 10, 21, 11, 282409, tzinfo=datetime.timezone.utc)

PROJECT (event_scrapers), SPIDER (norwalk_historical_society)

WARNING+

warning_logs 2 in total

INFO

DEBUG

scrapy_version

telnet_console

telnet_password

latest_crawl

latest_stat

Head

Tail

Log

Source

warning_logs
2 in total