Head
2026-06-16 10:41:04 [scrapy.utils.log] INFO: Scrapy 2.14.1 started (bot: event_scrapers)
2026-06-16 10:41:04 [scrapy.utils.log] INFO: Versions:
{'lxml': '6.0.2',
'libxml2': '2.14.6',
'cssselect': '1.3.0',
'parsel': '1.10.0',
'w3lib': '2.0.0',
'Twisted': '25.5.0',
'Python': '3.12.3 (main, Mar 23 2026, 19:04:32) [GCC 13.3.0]',
'pyOpenSSL': '25.3.0 (OpenSSL 3.5.4 30 Sep 2025)',
'cryptography': '46.0.3',
'Platform': 'Linux-6.8.0-90-generic-x86_64-with-glibc2.39'}
2026-06-16 10:41:04 [scrapy.crawler] DEBUG: Using AsyncCrawlerProcess
2026-06-16 10:41:04 [asyncio] DEBUG: Using selector: EpollSelector
2026-06-16 10:41:04 [scrapy.addons] INFO: Enabled addons:
[]
2026-06-16 10:41:04 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor
2026-06-16 10:41:04 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop
2026-06-16 10:41:04 [scrapy.extensions.telnet] INFO: Telnet Password: c2a9eb45d94699b7
2026-06-16 10:41:04 [scrapy.middleware] INFO: Enabled extensions:
['scrapy.extensions.corestats.CoreStats',
'scrapy.extensions.logcount.LogCount',
'scrapy.extensions.telnet.TelnetConsole',
'scrapy.extensions.memusage.MemoryUsage',
'scrapy.extensions.feedexport.FeedExporter',
'scrapy.extensions.logstats.LogStats']
2026-06-16 10:41:04 [scrapy.crawler] INFO: Overridden settings:
{'BOT_NAME': 'event_scrapers',
'FEED_EXPORT_ENCODING': 'utf-8',
'FEED_URI_PARAMS': <function _feed_uri_params at 0x74ef9004c400>,
'LOG_FILE': '/root/event-list-scraping/logs/event_scrapers/moca_west/1878aa82695f11f184390050565fa5d9.log',
'NEWSPIDER_MODULE': 'event_scrapers.spiders',
'REQUEST_FINGERPRINTER_CLASS': 'scrapy_zyte_api.ScrapyZyteAPIRequestFingerprinter',
'SPIDER_MODULES': ['event_scrapers.spiders']}
2026-06-16 10:41:04 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 10:41:04 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 10:41:04 [scrapy.middleware] INFO: Enabled downloader middlewares:
['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware',
'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
'scrapy.downloadermiddlewares.retry.RetryMiddleware',
'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
'scrapy_zyte_api.ScrapyZyteAPIDownloaderMiddleware',
'scrapy.downloadermiddlewares.cookies.CookiesMiddleware',
'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware',
'scrapy.downloadermiddlewares.stats.DownloaderStats']
2026-06-16 10:41:04 [scrapy.middleware] INFO: Enabled spider middlewares:
['scrapy.spidermiddlewares.start.StartSpiderMiddleware',
'scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
'scrapy_zyte_api.ScrapyZyteAPISpiderMiddleware',
'scrapy.spidermiddlewares.referer.RefererMiddleware',
'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
'scrapy.spidermiddlewares.depth.DepthMiddleware',
'scrapy_zyte_api.ScrapyZyteAPIRefererSpiderMiddleware']
2026-06-16 10:41:04 [scrapy.middleware] INFO: Enabled item pipelines:
['event_scrapers.pipelines.EventScrapersPipeline']
2026-06-16 10:41:04 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute.
self._check_mw_method_spider_arg(pipe.process_item)
2026-06-16 10:41:04 [scrapy.core.engine] INFO: Spider opened
2026-06-16 10:41:04 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.moca_west_port.MocaWestSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html
warn(
2026-06-16 10:41:04 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2026-06-16 10:41:04 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
2026-06-16 10:41:04 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): mocact.org:443
2026-06-16 10:41:05 [urllib3.connectionpool] DEBUG: https://mocact.org:443 "GET /events/ HTTP/1.1" 200 None
2026-06-16 10:41:05 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:05 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:05 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:05 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:05 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:05 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:05 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://mocact.org/wp-json/tribe/views/v2/html?pu=%2Fnew-events%2Fmonth%2F2026-06%2F%3Fshortcode%3Df0b1cb7d&u=%2Fnew-events%2Flist%2F%3Fshortcode%3Df0b1cb7d%26tribe-bar-date%3D2026-10-15&smu=false&tvn1=149ea4efbc&tvn2=> (referer: https://mocact.org/) ['zyte-api']
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 10:41:07 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://mocact.org/wp-json/tribe/views/v2/html?pu=%2Fnew-events%2Fmonth%2F2026-06%2F%3Fshortcode%3Df0b1cb7d&u=%2Fnew-events%2Flist%2F%3Fshortcode%3Df0b1cb7d%26tribe-bar-date%3D2026-11-15&smu=false&tvn1=149ea4efbc&tvn2=> (referer: https://mocact.org/) ['zyte-api']
2026-06-16 10:41:07 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://mocact.org/wp-json/tribe/views/v2/html?pu=%2Fnew-events%2Fmonth%2F2026-06%2F%3Fshortcode%3Df0b1cb7d&u=%2Fnew-events%2Flist%2F%3Fshortcode%3Df0b1cb7d%26tribe-bar-date%3D2026-8-15&smu=false&tvn1=149ea4efbc&tvn2=> (referer: https://mocact.org/) ['zyte-api']
2026-06-16 10:41:07 [scrapy.dupefilters] DEBUG: Filtered duplicate request: <GET https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-11-21/> - no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates)
2026-06-16 10:41:08 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://mocact.org/wp-json/tribe/views/v2/html?pu=%2Fnew-events%2Fmonth%2F2026-06%2F%3Fshortcode%3Df0b1cb7d&u=%2Fnew-events%2Flist%2F%3Fshortcode%3Df0b1cb7d%26tribe-bar-date%3D2026-12-15&smu=false&tvn1=149ea4efbc&tvn2=> (referer: https://mocact.org/) ['zyte-api']
2026-06-16 10:41:09 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-11-28/> (referer: None) ['zyte-api']
2026-06-16 10:41:09 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-12-05/> (referer: None) ['zyte-api']
2026-06-16 10:41:09 [py.warnings] WARNING: /root/event-list-scraping/event_scrapers/spiders/moca_west_port.py:76: GuessedAtParserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("lxml"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.
The code that caused this warning is on line 76 of the file /root/event-list-scraping/event_scrapers/spiders/moca_west_port.py. To get rid of this warning, pass the additional argument 'features="lxml"' to the BeautifulSoup constructor.
item['raw_body'] = str(bs(response.css('section#tribe-events-pg-template').get('')))
Tail
'<li class="tribe-events-meta-item tribe-venue-location">\n'
'<address class="tribe-events-address">\n'
'<span class="tribe-address">\n'
'<span class="tribe-street-address">19 Newtown Turnpike</span>\n'
'<br/>\n'
'<span class="tribe-locality">Westport</span><span '
'class="tribe-delimiter">,</span>\n'
'<span class="tribe-postal-code">06880</span>\n'
'<span class="tribe-country-name">United States</span>\n'
'</span>\n'
'<a class="tribe-events-gmap" '
'href="https://maps.google.com/maps?f=q&source=s_q&hl=en&geocode=&q=19+Newtown+Turnpike+Westport+06880+United+States" '
'rel="noreferrer noopener" target="_blank" title="Click to view a '
'Google Map">+ Google Map</a> </address>\n'
'</li>\n'
'<li class="tribe-events-meta-item">\n'
'<span class="tribe-venue-tel-label '
'tribe-events-meta-label">Phone</span>\n'
'<span class="tribe-venue-tel tribe-events-meta-value"> '
'203-222-7070 </span>\n'
'</li>\n'
'<li class="tribe-events-meta-item">\n'
'<span class="tribe-venue-url tribe-events-meta-value"> <a '
'href="https://mocact.org/" rel="external" target="_self">View '
'Venue Website</a> </span>\n'
'</li>\n'
'</ul>\n'
'</div>\n'
'<div class="tribe-events-meta-group '
'tribe-events-meta-group-gmap">\n'
'<div class="tribe-events-venue-map">\n'
'<iframe allowfullscreen="" aria-label="Venue location map" '
'frameborder="0" height="350px" '
'src="https://www.google.com/maps/embed/v1/place?key=AIzaSyDNsicAsP6-VuGtAb1O9riI3oc_NOb7IOU&q=19+Newtown+Turnpike+Westport+06880+United+States+&zoom=10" '
'style="border:0" title="Google maps iframe displaying the '
'address to MoCA\\CT" width="100%">\n'
'</iframe>\n'
'</div>\n'
'</div>\n'
'</div>\n'
'<h2 class="tribe-events-related-events-title">\n'
'\tRelated Events</h2>\n'
'<ul class="tribe-related-events tribe-clearfix">\n'
'<li>\n'
'<div class="tribe-related-events-thumbnail">\n'
'<a aria-hidden="true" class="url" '
'href="https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-06-20/" '
'rel="bookmark" role="presentation" tabindex="-1"><img alt="Art '
'Adventures" '
'src="https://mocact.org/wp-content/plugins/events-calendar-pro/src/resources/images/tribe-related-events-placeholder.png"/></a>\n'
'</div>\n'
'<div class="tribe-related-event-info">\n'
'<h3 class="tribe-related-events-title"><a '
'class="tribe-event-url" '
'href="https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-06-20/" '
'rel="bookmark">Art Adventures</a></h3>\n'
'<span class="tribe-event-date-start">June 20 @ 12:00 pm</span> - '
'<span class="tribe-event-time">1:30 pm</span>\n'
'<span class="tribe-events-calendar-series-archive__container '
'tribe-events-calendar-series-archive__container--pill">\n'
'<a aria-label="Event series: Art Adventures" '
'class="tribe-events-calendar-series-archive__link" '
'href="https://mocact.org/series/art-adventures-2/">\n'
'<svg aria-hidden="true" class="tribe-common-c-svgicon '
'tribe-common-c-svgicon--series '
'tribe-events-series-archive__icon" fill="none" height="12" '
'viewbox="0 0 14 12" width="14" '
'xmlns="http://www.w3.org/2000/svg">\n'
'<rect height="7" width="9" x="0.5" y="4.5"></rect>\n'
'<path d="M2 2.5H11.5V10"></path>\n'
'<path d="M4 0.5H13.5V8"></path>\n'
'</svg>\n'
'</a>\n'
'</span>\n'
'</div>\n'
'</li>\n'
'<li>\n'
'<div class="tribe-related-events-thumbnail">\n'
'<a aria-hidden="true" class="url" '
'href="https://mocact.org/events-calendar/looking-for-history-member-preview/" '
'rel="bookmark" role="presentation" tabindex="-1"><img alt="" '
'class="attachment-large size-large wp-post-image" height="576" '
'src="https://mocact.org/wp-content/uploads/2026/06/Looking-for-History-Web-Banner-1024x576.png" '
'width="1024"/></a>\n'
'</div>\n'
'<div class="tribe-related-event-info">\n'
'<h3 class="tribe-related-events-title"><a '
'class="tribe-event-url" '
'href="https://mocact.org/events-calendar/looking-for-history-member-preview/" '
'rel="bookmark">Looking for History- Opening Reception for '
'General Public</a></h3>\n'
'<span class="tribe-event-date-start">June 25 @ 6:00 pm</span> - '
'<span class="tribe-event-time">8:00 pm</span> </div>\n'
'</li>\n'
'<li>\n'
'<div class="tribe-related-events-thumbnail">\n'
'<a aria-hidden="true" class="url" '
'href="https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-06-27/" '
'rel="bookmark" role="presentation" tabindex="-1"><img alt="Art '
'Adventures" '
'src="https://mocact.org/wp-content/plugins/events-calendar-pro/src/resources/images/tribe-related-events-placeholder.png"/></a>\n'
'</div>\n'
'<div class="tribe-related-event-info">\n'
'<h3 class="tribe-related-events-title"><a '
'class="tribe-event-url" '
'href="https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-06-27/" '
'rel="bookmark">Art Adventures</a></h3>\n'
'<span class="tribe-event-date-start">June 27 @ 12:00 pm</span> - '
'<span class="tribe-event-time">1:30 pm</span>\n'
'<span class="tribe-events-calendar-series-archive__container '
'tribe-events-calendar-series-archive__container--pill">\n'
'<a aria-label="Event series: Art Adventures" '
'class="tribe-events-calendar-series-archive__link" '
'href="https://mocact.org/series/art-adventures-2/">\n'
'<svg aria-hidden="true" class="tribe-common-c-svgicon '
'tribe-common-c-svgicon--series '
'tribe-events-series-archive__icon" fill="none" height="12" '
'viewbox="0 0 14 12" width="14" '
'xmlns="http://www.w3.org/2000/svg">\n'
'<rect height="7" width="9" x="0.5" y="4.5"></rect>\n'
'<path d="M2 2.5H11.5V10"></path>\n'
'<path d="M4 0.5H13.5V8"></path>\n'
'</svg>\n'
'</a>\n'
'</span>\n'
'</div>\n'
'</li>\n'
'</ul>\n'
'</div> <!-- #post-x -->\n'
'<!-- Event footer -->\n'
'<div id="tribe-events-footer">\n'
'<!-- Navigation -->\n'
'<nav aria-label="Event Navigation" '
'class="tribe-events-nav-pagination">\n'
'<ul class="tribe-events-sub-nav">\n'
'<li class="tribe-events-nav-previous"><a '
'href="https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-10-03/"><span>«</span> '
'Art Adventures</a></li>\n'
'<li class="tribe-events-nav-next"><a '
'href="https://mocact.org/events-calendar/art-adventures-4-2-6-2-2-2-2-2/2026-10-17/">Art '
'Adventures <span>»</span></a></li>\n'
'</ul>\n'
'<!-- .tribe-events-sub-nav -->\n'
'</nav>\n'
'</div>\n'
'<!-- #tribe-events-footer -->\n'
'</div><!-- #tribe-events-content -->\n'
'<div class="tribe-events-after-html"></div>\n'
'<!--\n'
'This calendar is powered by The Events Calendar.\n'
'http://evnt.is/18wn\n'
'-->\n'
'</section></body></html>'}
2026-06-16 10:41:53 [scrapy.core.engine] INFO: Closing spider (finished)
2026-06-16 10:41:53 [scrapy.extensions.feedexport] INFO: Stored csv feed (49 items) in: output/2026/06/16/moca_west.csv
2026-06-16 10:41:53 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 19913,
'downloader/request_count': 57,
'downloader/request_method_count/GET': 57,
'downloader/response_bytes': 10858228,
'downloader/response_count': 57,
'downloader/response_status_count/200': 57,
'dupefilter/filtered': 34,
'elapsed_time_seconds': 49.47558,
'feedexport/success_count/FileFeedStorage': 1,
'finish_reason': 'finished',
'finish_time': datetime.datetime(2026, 6, 16, 8, 41, 53, 950072, tzinfo=datetime.timezone.utc),
'item_scraped_count': 49,
'items_per_minute': 60.0,
'log_count/DEBUG': 264,
'log_count/ERROR': 7,
'log_count/INFO': 3,
'log_count/WARNING': 1,
'memusage/max': 93151232,
'memusage/startup': 93151232,
'request_depth_max': 1,
'response_received_count': 57,
'responses_per_minute': 69.79591836734694,
'scheduler/dequeued': 57,
'scheduler/dequeued/memory': 57,
'scheduler/enqueued': 57,
'scheduler/enqueued/memory': 57,
'scrapy-zyte-api/429': 0,
'scrapy-zyte-api/attempts': 57,
'scrapy-zyte-api/error_ratio': 0.0,
'scrapy-zyte-api/errors': 0,
'scrapy-zyte-api/fatal_errors': 0,
'scrapy-zyte-api/mean_connection_seconds': 4.090034937518731,
'scrapy-zyte-api/mean_response_seconds': 4.332182013511397,
'scrapy-zyte-api/processed': 57,
'scrapy-zyte-api/request_args/customHttpRequestHeaders': 7,
'scrapy-zyte-api/request_args/httpResponseBody': 57,
'scrapy-zyte-api/request_args/httpResponseHeaders': 57,
'scrapy-zyte-api/request_args/url': 57,
'scrapy-zyte-api/status_codes/200': 57,
'scrapy-zyte-api/success': 57,
'scrapy-zyte-api/success_ratio': 1.0,
'scrapy-zyte-api/throttle_ratio': 0.0,
'start_time': datetime.datetime(2026, 6, 16, 8, 41, 4, 474492, tzinfo=datetime.timezone.utc)}
2026-06-16 10:41:53 [scrapy.core.engine] INFO: Spider closed (finished)