Head
2026-06-16 05:40:38 [scrapy.utils.log] INFO: Scrapy 2.14.1 started (bot: event_scrapers)
2026-06-16 05:40:38 [scrapy.utils.log] INFO: Versions:
{'lxml': '6.0.2',
'libxml2': '2.14.6',
'cssselect': '1.3.0',
'parsel': '1.10.0',
'w3lib': '2.0.0',
'Twisted': '25.5.0',
'Python': '3.12.3 (main, Mar 23 2026, 19:04:32) [GCC 13.3.0]',
'pyOpenSSL': '25.3.0 (OpenSSL 3.5.4 30 Sep 2025)',
'cryptography': '46.0.3',
'Platform': 'Linux-6.8.0-90-generic-x86_64-with-glibc2.39'}
2026-06-16 05:40:38 [scrapy.crawler] DEBUG: Using AsyncCrawlerProcess
2026-06-16 05:40:38 [asyncio] DEBUG: Using selector: EpollSelector
2026-06-16 05:40:38 [scrapy.addons] INFO: Enabled addons:
[]
2026-06-16 05:40:39 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor
2026-06-16 05:40:39 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop
2026-06-16 05:40:39 [scrapy.extensions.telnet] INFO: Telnet Password: cf0ebd1f6e5e18df
2026-06-16 05:40:39 [scrapy.middleware] INFO: Enabled extensions:
['scrapy.extensions.corestats.CoreStats',
'scrapy.extensions.logcount.LogCount',
'scrapy.extensions.telnet.TelnetConsole',
'scrapy.extensions.memusage.MemoryUsage',
'scrapy.extensions.feedexport.FeedExporter',
'scrapy.extensions.logstats.LogStats']
2026-06-16 05:40:39 [scrapy.crawler] INFO: Overridden settings:
{'BOT_NAME': 'event_scrapers',
'FEED_EXPORT_ENCODING': 'utf-8',
'FEED_URI_PARAMS': <function _feed_uri_params at 0x7b6019138540>,
'LOG_FILE': '/root/event-list-scraping/logs/event_scrapers/curtain_call/218a7139693511f196bc0050565fa5d9.log',
'NEWSPIDER_MODULE': 'event_scrapers.spiders',
'REQUEST_FINGERPRINTER_CLASS': 'scrapy_zyte_api.ScrapyZyteAPIRequestFingerprinter',
'SPIDER_MODULES': ['event_scrapers.spiders']}
2026-06-16 05:40:39 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 05:40:39 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 05:40:39 [scrapy.middleware] INFO: Enabled downloader middlewares:
['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware',
'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
'scrapy.downloadermiddlewares.retry.RetryMiddleware',
'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
'scrapy_zyte_api.ScrapyZyteAPIDownloaderMiddleware',
'scrapy.downloadermiddlewares.cookies.CookiesMiddleware',
'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware',
'scrapy.downloadermiddlewares.stats.DownloaderStats']
2026-06-16 05:40:39 [scrapy.middleware] INFO: Enabled spider middlewares:
['scrapy.spidermiddlewares.start.StartSpiderMiddleware',
'scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
'scrapy_zyte_api.ScrapyZyteAPISpiderMiddleware',
'scrapy.spidermiddlewares.referer.RefererMiddleware',
'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
'scrapy.spidermiddlewares.depth.DepthMiddleware',
'scrapy_zyte_api.ScrapyZyteAPIRefererSpiderMiddleware']
2026-06-16 05:40:39 [scrapy.middleware] INFO: Enabled item pipelines:
['event_scrapers.pipelines.EventScrapersPipeline']
2026-06-16 05:40:39 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute.
self._check_mw_method_spider_arg(pipe.process_item)
2026-06-16 05:40:39 [scrapy.core.engine] INFO: Spider opened
2026-06-16 05:40:39 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.curtain_call.CurtainCallSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html
warn(
2026-06-16 05:40:39 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2026-06-16 05:40:39 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
2026-06-16 05:40:39 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 05:40:41 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://curtaincallinc.com/> (referer: None) ['zyte-api']
2026-06-16 05:40:41 [zyte_api._retry] DEBUG: Starting call to 'zyte_api._async.AsyncZyteAPI.get.<locals>.request', this is the 1st time calling it.
2026-06-16 05:40:50 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://curtaincallstamford.ludus.com/index.php> (referer: None) ['zyte-api']
2026-06-16 05:40:50 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): 144.91.120.141:80
2026-06-16 05:40:50 [urllib3.connectionpool] DEBUG: http://144.91.120.141:80 "POST /api/v1/raw-events/ HTTP/1.1" 201 24528
2026-06-16 05:40:50 [scrapy.core.scraper] DEBUG: Scraped from <200 https://curtaincallstamford.ludus.com/index.php>
{'event_url': 'https://curtaincallstamford.ludus.com/index.php#comedy_show_25_26_motts_pelazza',
'platform': 'Curtain Call',
'platform_hash': 'a7f21d4af8d3530735eb1fbd49151d9c',
'raw_body': '<div class="show_item" data-show-id="200485602" '
'data-event-categories="" data-cy="reserved_event">\n'
' '
'<div class="show_item_category_pills" style="display:none;">\n'
' '
'</div>\n'
' '
'<div class="show_item_cover_photo" '
'style="background-image:url(\'https://ludus.nyc3.digitaloceanspaces.com/1750013532.png\');"></div>\n'
' '
'<h2 class="show_item_title" style="">\n'
' COMEDY SHOW 25-26 MOTTS '
'PELAZZA '
'</h2>\n'
'\n'
' <div class="clear"></div>\n'
'\n'
'\n'
'\n'
' <div '
'class="show_about_buttons">\n'
Tail
' '
'</div>\n'
'\n'
' <div class="clear"></div>\n'
' </div>'}
2026-06-16 05:40:52 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): 144.91.120.141:80
2026-06-16 05:40:52 [urllib3.connectionpool] DEBUG: http://144.91.120.141:80 "POST /api/v1/raw-events/ HTTP/1.1" 201 9021
2026-06-16 05:40:52 [scrapy.core.scraper] DEBUG: Scraped from <200 https://curtaincallstamford.ludus.com/index.php>
{'event_url': 'https://curtaincallstamford.ludus.com/index.php#anywhere_any_time_free_concert_reading_of_a_new_musical',
'platform': 'Curtain Call',
'platform_hash': 'a7f21d4af8d3530735eb1fbd49151d9c',
'raw_body': '<div class="show_item" data-show-id="200530351" '
'data-event-categories="" data-cy="reserved_event">\n'
' '
'<div class="show_item_category_pills" style="display:none;">\n'
' '
'</div>\n'
' '
'<div class="show_item_cover_photo" '
'style="background-image:url(\'https://ludus.nyc3.digitaloceanspaces.com/1779338533.jpg\');"></div>\n'
' '
'<h2 class="show_item_title" style="">\n'
' Anywhere. Any Time (Free '
'concert reading of a new '
'musical) '
'</h2>\n'
'\n'
' <div class="clear"></div>\n'
'\n'
'\n'
'\n'
' <div '
'class="show_about_buttons">\n'
' '
'<a href="show_location.php?show_id=200530351">\n'
' <div '
'class="show_about_button directions_button" '
'style="width:100%;">\n'
' <i class="fa '
'fa-map-marker"></i> Get '
'Directions '
'</div>\n'
' </a>\n'
' \n'
' </div>\n'
'\n'
' \n'
' \n'
' <div class="clear"></div>\n'
'\n'
' '
'<div class="collapsed-showtimes-container">\n'
' \n'
' \n'
' <div '
'class="showtimes_item " id="showtimes_item320063" '
'data-showtime-id="320063" data-past-date="1" data-on-off="0" '
'data-waitlist-active="0" data-manual-soldout="0" '
'style="display:none;">\n'
' '
'<div class="admin_showtimes_item_title" '
'style="line-height:30px;">\n'
'\n'
' <div '
'class="desktop_copy">\n'
' '
'<span class="span_link" style="color:#757778 !important;">\n'
' '
'Monday, June 8, '
'2026 '
'<span style="color:#777777;">\n'
' \xa0 '
'7:30 '
'PM '
'</span>\n'
' '
'</span>\n'
' '
'</div>\n'
' <div '
'class="mobile_copy" style="display:none; margin-bottom:8px;">\n'
' '
'<span class="span_link" style="color:#757778 !important;">\n'
' '
'Monday, June 8, '
'2026 '
'<span style="color:#777777;">\n'
' \xa0 '
'7:30 '
'PM '
'</span>\n'
' '
'</span>\n'
' '
'</div>\n'
' '
'</div>\n'
' <div '
'class="skeleton-loader"></div>\n'
' <div '
'class="admin_showtimes_item_actions" '
'style="border:0px;background-color:transparent;box-shadow:none;padding-right:0px;padding-left:0px;align-items:stretch;display:none;flex-direction:row-reverse;">\n'
' <label '
'for="showtime_radio320063" '
'style="position:absolute;opacity:0;width:10px;height:10px;">Monday, '
'June 8, 2026 7:30 PM</label>\n'
' <input '
'type="radio" name="time_id" value="320063" '
'id="showtime_radio320063" class="time_radio_button" '
'style="display:none;">\n'
' '
'<div class="showtimes_item_get_tickets_button" '
'style="width:100%;border-top-left-radius: '
'5px;border-bottom-left-radius: 5px;">\n'
' '
'Get '
'Tickets '
'</div>\n'
' \n'
' </div>\n'
'\n'
' <div '
'class="red_span sold_out_span" style="float:right;">\n'
' <i '
'class="fa-solid fa-triangle-exclamation"></i> Sold '
'Out </div>\n'
' \n'
' <div '
'class="clear"></div>\n'
' </div>\n'
' \n'
' \n'
' '
'<div '
'style="font-size:14px;padding:20px;text-align:center;color:#757778;">\n'
' <i '
'class="fa-solid fa-circle-exclamation"></i> This event is over\n'
' </div>\n'
' \n'
' '
'</div>\n'
' \n'
' '
'<div '
'style="font-size:14px;padding:20px;text-align:center;color:#757778;">\n'
' <i '
'class="fa-solid fa-circle-exclamation"></i> This event is over\n'
' </div>\n'
' \n'
' <div '
'id="access-list-modal">\n'
' '
'</div>\n'
'\n'
' <div class="clear"></div>\n'
' </div>'}
2026-06-16 05:40:52 [scrapy.core.engine] INFO: Closing spider (finished)
2026-06-16 05:40:52 [scrapy.extensions.feedexport] INFO: Stored csv feed (18 items) in: output/2026/06/16/curtain_call.csv
2026-06-16 05:40:52 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 466,
'downloader/request_count': 2,
'downloader/request_method_count/GET': 2,
'downloader/response_bytes': 1774099,
'downloader/response_count': 2,
'downloader/response_status_count/200': 2,
'elapsed_time_seconds': 12.996916,
'feedexport/success_count/FileFeedStorage': 1,
'finish_reason': 'finished',
'finish_time': datetime.datetime(2026, 6, 16, 3, 40, 52, 299280, tzinfo=datetime.timezone.utc),
'item_scraped_count': 18,
'items_per_minute': 90.0,
'log_count/DEBUG': 58,
'log_count/ERROR': 5,
'log_count/INFO': 3,
'memusage/max': 93396992,
'memusage/startup': 93396992,
'request_depth_max': 1,
'response_received_count': 2,
'responses_per_minute': 10.0,
'scheduler/dequeued': 2,
'scheduler/dequeued/memory': 2,
'scheduler/enqueued': 2,
'scheduler/enqueued/memory': 2,
'scrapy-zyte-api/429': 0,
'scrapy-zyte-api/attempts': 2,
'scrapy-zyte-api/error_ratio': 0.0,
'scrapy-zyte-api/errors': 0,
'scrapy-zyte-api/fatal_errors': 0,
'scrapy-zyte-api/mean_connection_seconds': 5.036509300582111,
'scrapy-zyte-api/mean_response_seconds': 5.410627946257591,
'scrapy-zyte-api/processed': 2,
'scrapy-zyte-api/request_args/httpResponseBody': 2,
'scrapy-zyte-api/request_args/httpResponseHeaders': 2,
'scrapy-zyte-api/request_args/url': 2,
'scrapy-zyte-api/status_codes/200': 2,
'scrapy-zyte-api/success': 2,
'scrapy-zyte-api/success_ratio': 1.0,
'scrapy-zyte-api/throttle_ratio': 0.0,
'start_time': datetime.datetime(2026, 6, 16, 3, 40, 39, 302364, tzinfo=datetime.timezone.utc)}
2026-06-16 05:40:52 [scrapy.core.engine] INFO: Spider closed (finished)