- Analysis
- Categories
- Logs
- Crawler.stats
-
project event_scrapers spider district_music_hall job ba31e6d2693a11f18a2c0050565fa5d9 first_log_time 2026-06-16 06:20:45 latest_log_time 2026-06-16 06:21:11 runtime 0:00:26 crawled_pages 20 scraped_items 19 shutdown_reason N/A finish_reason finished log_critical_count 0 log_error_count 0 log_warning_count 2 log_redirect_count 19 log_retry_count 0 log_ignore_count 0 latest_crawl latest_scrape latest_log current_time latest_item {'event_url': 'https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/', 'platform': 'District music hall', 'platform_hash': '6b4f271fdfa7af47c0eb25b6ae68b653', 'raw_body': '<section class="wfea default collegestreetmusichall-v2 sg ' 'sg-details"><article class="sg-details sg__event status-live ' 'city-norwalk region-ct country-us event__public ' 'event__available">\n' ' <div class="sg__hero">\n' '\t\t <figure class="">\n' '\t\t<a id="wfea-popup-img-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'rel="bookmark"><img decoding="async" class="wp-post-image" ' 'src="https://img.evbuc.com/https%3A%2F%2Fcdn.evbuc.com%2Fimages%2F1177490937%2F533131535899%2F1%2Foriginal.20260213-185847?crop=focalpoint&fit=crop&h=200&w=450&auto=format%2Ccompress&q=75&sharp=10&fp-x=0.5&fp-y=0.5&s=1c774c58288b8edf9de1fe4f68c96726" ' 'alt="The Wood Brothers"></a> </figure>\n' '\t </div>\n' ' <div class="sg__content-wrap">\n' ' <div class="sg__head-group">\n' '\t\t\t\t\t<div class="sg__presented-by presented-by">Premier ' 'Concerts and Manic Presents:</div>\n' '\t\t<h2 class="sg__title wfea-header__title entry-title ">\n' '\t<a id="wfea-popup-title-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'title="Eventbrite link to The Wood Brothers" rel="bookmark">The ' 'Wood Brothers</a></h2>\n' '\n' '<div class="sg__summary">\n' '\twith Viv & Riley</div> </div>\n' '\n' ' \n' ' <div class="sg__content-group">\n' ' \t <time class="sg__head-date">\n' '\t\t<time class="eaw-time published" ' 'datetime="2026-06-16T20:00:00+00:00">Tue 6.16.26</time> ' '</time>\n' '\t\t\t<div class="sg__door-time door-time">Doors: 7:00 pm | ' 'Show: 8:00 pm</div>\n' '\t\t <div class="sg__age-resriction age-restriction">All ' 'Ages</div>\n' '\t\t <div class="sg__location location">\n' ' District Music Hall<div class="city-region">Norwalk, ' 'CT</div> </div>\n' ' </div>\n' '\n' ' <div class="sg__cta">\n' '\t <div class="sg__cta-wrap">\n' '\t\t\t\t\n' '\t\t\t\t<div class="sg__buttons">\n' '\t\t\t\t\t <div class="sg__booknow booknow ">\n' '\t\t<a id="wfea-popup-booknow-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'aria-label="TICKETS » on Eventbrite for Event Detail Page" ' 'class="book-now__link"><button>TICKETS »</button></a> </div>\n' '\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t</div>\n' '\n' '\t\t\t\t\t </div>\n' '\t </div>\n' ' </div>\n' '</article>\n' '\n' '<summary class="sg-details-summary">\n' '\t<div class="sg__content entry-content">\n' ' <div class="sg__excerpt excerpt">\n' '\t\t<div>with Viv & Riley</div><div style="margin-top: ' '20px"><div style="margin: 20px 10px;font-size: 15px;line-height: ' '22px;font-weight: 400;text-align: left"><p>This event is General ' 'Admission Standing Room Only on the Floor, and Reserved Seated ' 'in the Balcony.</p><p>The Wood Brothers have partnered with ' 'American Friends of Canadian Conservation so that $1 per ticket ' 'will support The Nature Trust of British Columbia (NTBC) in ' 'their efforts to conserve ecologically-rich wetlands and protect ' 'irreplaceable land from development. Every $1 donated will be ' 'matched by the U.S. Fish and Wildlife Service with $2 so more ' 'endangered wetlands can be saved. If you’d like to learn more, ' 'please visit <a ' 'href="https://conservecanada.org/portfolio-item/the-nature-trust-of-british-columbia/" ' 'title="https://conservecanada.org/portfolio-item/the-nature-trust-of-british-columbia/" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">this link</a>.</p></div><div style="margin: 20px ' '10px;font-size: 15px;line-height: 22px;font-weight: ' '400;text-align: left"><h3>THE WOOD BROTHERS</h3><p>Dubbed ' '"masters of soulful folk" by Paste, The Wood Brothers formed ' 'after brothers Chris and Oliver Wood pursued separate musical ' 'careers for 15 years. Chris already had legions of devoted fans ' 'for his incomparable work as one-third of Medeski Martin & ' 'Wood, while Oliver’s band King Johnson built a loyal following ' 'in the South. With drummer Jano Rix added as a permanent third ' 'member, The Wood Brothers have evolved into one of roots music’s ' 'most revered acts, playing sold out shows across North America, ' 'garnering a Grammy Award nomination and releasing nine studio ' 'albums, including their forthcoming release, <em></em><em>Puff ' 'of Smoke</em>, out August ' '1.</p><p><strong></strong><strong>Links: </strong><a ' 'href="https://www.thewoodbros.com/" ' 'title="https://www.thewoodbros.com/" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Official Website</a> | <a ' 'href="https://www.facebook.com/thewoodbrothers" ' 'title="https://www.facebook.com/thewoodbrothers" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Facebook</a> | <a ' 'href="https://www.instagram.com/thewoodbros/" ' 'title="https://www.instagram.com/thewoodbros/" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Instagram</a> | <a ' 'href="https://twitter.com/thewoodbrothers" ' 'title="https://twitter.com/thewoodbrothers" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Twitter</a> | <a ' 'href="https://open.spotify.com/artist/6FxuPrpa8phaP3Xn73emhT?autoplay=true" ' 'title="https://open.spotify.com/artist/6FxuPrpa8phaP3Xn73emhT?autoplay=true" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Spotify</a></p></div></div> </div>\n' '</div></summary> </section>'} -
-
warning_logs2 in total
2026-06-16 06:20:46 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute. self._check_mw_method_spider_arg(pipe.process_item)
2026-06-16 06:20:46 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.district_music_hall.ListingSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html warn(
WARNING+
-
redirect_logs19 in total
2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/cat-power-the-greatest-tour-1983549974369/> from <GET https://districtmusichall.com/info-page-sg/e/cat-power-the-greatest-tour-1983549974369>
2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/chat-pile-who-loves-the-sun-tour-2026-1991379538807/> from <GET https://districtmusichall.com/info-page-sg/e/chat-pile-who-loves-the-sun-tour-2026-1991379538807>
2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-dip-1984058251640/> from <GET https://districtmusichall.com/info-page-sg/e/the-dip-1984058251640>
2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-damned-final-damnation-50-1987541099933/> from <GET https://districtmusichall.com/info-page-sg/e/the-damned-final-damnation-50-1987541099933>
2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/evan-honer-it-s-an-even-longer-road-tour-1991488909939/> from <GET https://districtmusichall.com/info-page-sg/e/evan-honer-it-s-an-even-longer-road-tour-1991488909939>
2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/all-them-witches-21-shows-21-days-1988020814773/> from <GET https://districtmusichall.com/info-page-sg/e/all-them-witches-21-shows-21-days-1988020814773>
2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/deer-tick-1984571393461/> from <GET https://districtmusichall.com/info-page-sg/e/deer-tick-1984571393461>
2026-06-16 06:20:50 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-aquabats-1985445871047/> from <GET https://districtmusichall.com/info-page-sg/e/the-aquabats-1985445871047>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/nekrogoblikon-1991385987094/> from <GET https://districtmusichall.com/info-page-sg/e/nekrogoblikon-1991385987094>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/hatebreed-summer-slaughter-tour-2026-1990042491663/> from <GET https://districtmusichall.com/info-page-sg/e/hatebreed-summer-slaughter-tour-2026-1990042491663>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/bop-to-the-top-1988339580209/> from <GET https://districtmusichall.com/info-page-sg/e/bop-to-the-top-1988339580209>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/fruit-bats-1985461285151/> from <GET https://districtmusichall.com/info-page-sg/e/fruit-bats-1985461285151>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-church-the-singles-1980-2025-1982455247010/> from <GET https://districtmusichall.com/info-page-sg/e/the-church-the-singles-1980-2025-1982455247010>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/dakhabrakha-1990004793908/> from <GET https://districtmusichall.com/info-page-sg/e/dakhabrakha-1990004793908>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/jordan-jensen-1989611193639/> from <GET https://districtmusichall.com/info-page-sg/e/jordan-jensen-1989611193639>
2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/bertha-grateful-drag-1989098722825/> from <GET https://districtmusichall.com/info-page-sg/e/bertha-grateful-drag-1989098722825>
2026-06-16 06:21:06 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/> from <GET https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365>
2026-06-16 06:21:06 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/pup-june-2026-1983791130674/> from <GET https://districtmusichall.com/info-page-sg/e/pup-june-2026-1983791130674>
2026-06-16 06:21:06 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/hamilton-leithauser-summer-2026-tour-1988963792244/> from <GET https://districtmusichall.com/info-page-sg/e/hamilton-leithauser-summer-2026-tour-1988963792244>
INFO
-
scrapy_version
2.14.1
-
telnet_console
127.0.0.1:6023
-
telnet_password
b96bd3219e0ed7e1
-
latest_crawl
2026-06-16 06:21:11 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/> (referer: https://districtmusichall.com/)
-
latest_stat
2026-06-16 06:20:46 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
-
latest_scrape
2026-06-16 06:21:11 [scrapy.core.scraper] DEBUG: Scraped from <200 https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/>
-
latest_item
{'event_url': 'https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/', 'platform': 'District music hall', 'platform_hash': '6b4f271fdfa7af47c0eb25b6ae68b653', 'raw_body': '<section class="wfea default collegestreetmusichall-v2 sg ' 'sg-details"><article class="sg-details sg__event status-live ' 'city-norwalk region-ct country-us event__public ' 'event__available">\n' ' <div class="sg__hero">\n' '\t\t <figure class="">\n' '\t\t<a id="wfea-popup-img-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'rel="bookmark"><img decoding="async" class="wp-post-image" ' 'src="https://img.evbuc.com/https%3A%2F%2Fcdn.evbuc.com%2Fimages%2F1177490937%2F533131535899%2F1%2Foriginal.20260213-185847?crop=focalpoint&fit=crop&h=200&w=450&auto=format%2Ccompress&q=75&sharp=10&fp-x=0.5&fp-y=0.5&s=1c774c58288b8edf9de1fe4f68c96726" ' 'alt="The Wood Brothers"></a> </figure>\n' '\t </div>\n' ' <div class="sg__content-wrap">\n' ' <div class="sg__head-group">\n' '\t\t\t\t\t<div class="sg__presented-by presented-by">Premier ' 'Concerts and Manic Presents:</div>\n' '\t\t<h2 class="sg__title wfea-header__title entry-title ">\n' '\t<a id="wfea-popup-title-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'title="Eventbrite link to The Wood Brothers" rel="bookmark">The ' 'Wood Brothers</a></h2>\n' '\n' '<div class="sg__summary">\n' '\twith Viv & Riley</div> </div>\n' '\n' ' \n' ' <div class="sg__content-group">\n' ' \t <time class="sg__head-date">\n' '\t\t<time class="eaw-time published" ' 'datetime="2026-06-16T20:00:00+00:00">Tue 6.16.26</time> ' '</time>\n' '\t\t\t<div class="sg__door-time door-time">Doors: 7:00 pm | ' 'Show: 8:00 pm</div>\n' '\t\t <div class="sg__age-resriction age-restriction">All ' 'Ages</div>\n' '\t\t <div class="sg__location location">\n' ' District Music Hall<div class="city-region">Norwalk, ' 'CT</div> </div>\n' ' </div>\n' '\n' ' <div class="sg__cta">\n' '\t <div class="sg__cta-wrap">\n' '\t\t\t\t\n' '\t\t\t\t<div class="sg__buttons">\n' '\t\t\t\t\t <div class="sg__booknow booknow ">\n' '\t\t<a id="wfea-popup-booknow-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'aria-label="TICKETS » on Eventbrite for Event Detail Page" ' 'class="book-now__link"><button>TICKETS »</button></a> </div>\n' '\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t</div>\n' '\n' '\t\t\t\t\t </div>\n' '\t </div>\n' ' </div>\n' '</article>\n' '\n' '<summary class="sg-details-summary">\n' '\t<div class="sg__content entry-content">\n' ' <div class="sg__excerpt excerpt">\n' '\t\t<div>with Viv & Riley</div><div style="margin-top: ' '20px"><div style="margin: 20px 10px;font-size: 15px;line-height: ' '22px;font-weight: 400;text-align: left"><p>This event is General ' 'Admission Standing Room Only on the Floor, and Reserved Seated ' 'in the Balcony.</p><p>The Wood Brothers have partnered with ' 'American Friends of Canadian Conservation so that $1 per ticket ' 'will support The Nature Trust of British Columbia (NTBC) in ' 'their efforts to conserve ecologically-rich wetlands and protect ' 'irreplaceable land from development. Every $1 donated will be ' 'matched by the U.S. Fish and Wildlife Service with $2 so more ' 'endangered wetlands can be saved. If you’d like to learn more, ' 'please visit <a ' 'href="https://conservecanada.org/portfolio-item/the-nature-trust-of-british-columbia/" ' 'title="https://conservecanada.org/portfolio-item/the-nature-trust-of-british-columbia/" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">this link</a>.</p></div><div style="margin: 20px ' '10px;font-size: 15px;line-height: 22px;font-weight: ' '400;text-align: left"><h3>THE WOOD BROTHERS</h3><p>Dubbed ' '"masters of soulful folk" by Paste, The Wood Brothers formed ' 'after brothers Chris and Oliver Wood pursued separate musical ' 'careers for 15 years. Chris already had legions of devoted fans ' 'for his incomparable work as one-third of Medeski Martin & ' 'Wood, while Oliver’s band King Johnson built a loyal following ' 'in the South. With drummer Jano Rix added as a permanent third ' 'member, The Wood Brothers have evolved into one of roots music’s ' 'most revered acts, playing sold out shows across North America, ' 'garnering a Grammy Award nomination and releasing nine studio ' 'albums, including their forthcoming release, <em></em><em>Puff ' 'of Smoke</em>, out August ' '1.</p><p><strong></strong><strong>Links: </strong><a ' 'href="https://www.thewoodbros.com/" ' 'title="https://www.thewoodbros.com/" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Official Website</a> | <a ' 'href="https://www.facebook.com/thewoodbrothers" ' 'title="https://www.facebook.com/thewoodbrothers" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Facebook</a> | <a ' 'href="https://www.instagram.com/thewoodbros/" ' 'title="https://www.instagram.com/thewoodbros/" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Instagram</a> | <a ' 'href="https://twitter.com/thewoodbrothers" ' 'title="https://twitter.com/thewoodbrothers" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Twitter</a> | <a ' 'href="https://open.spotify.com/artist/6FxuPrpa8phaP3Xn73emhT?autoplay=true" ' 'title="https://open.spotify.com/artist/6FxuPrpa8phaP3Xn73emhT?autoplay=true" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Spotify</a></p></div></div> </div>\n' '</div></summary> </section>'}
DEBUG
-
-
-
Head
2026-06-16 06:20:45 [scrapy.utils.log] INFO: Scrapy 2.14.1 started (bot: event_scrapers) 2026-06-16 06:20:45 [scrapy.utils.log] INFO: Versions: {'lxml': '6.0.2', 'libxml2': '2.14.6', 'cssselect': '1.3.0', 'parsel': '1.10.0', 'w3lib': '2.0.0', 'Twisted': '25.5.0', 'Python': '3.12.3 (main, Mar 23 2026, 19:04:32) [GCC 13.3.0]', 'pyOpenSSL': '25.3.0 (OpenSSL 3.5.4 30 Sep 2025)', 'cryptography': '46.0.3', 'Platform': 'Linux-6.8.0-90-generic-x86_64-with-glibc2.39'} 2026-06-16 06:20:45 [scrapy.crawler] DEBUG: Using AsyncCrawlerProcess 2026-06-16 06:20:45 [asyncio] DEBUG: Using selector: EpollSelector 2026-06-16 06:20:45 [scrapy.addons] INFO: Enabled addons: [] 2026-06-16 06:20:46 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor 2026-06-16 06:20:46 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop 2026-06-16 06:20:46 [scrapy.extensions.telnet] INFO: Telnet Password: b96bd3219e0ed7e1 2026-06-16 06:20:46 [scrapy.middleware] INFO: Enabled extensions: ['scrapy.extensions.corestats.CoreStats', 'scrapy.extensions.logcount.LogCount', 'scrapy.extensions.telnet.TelnetConsole', 'scrapy.extensions.memusage.MemoryUsage', 'scrapy.extensions.feedexport.FeedExporter', 'scrapy.extensions.logstats.LogStats'] 2026-06-16 06:20:46 [scrapy.crawler] INFO: Overridden settings: {'BOT_NAME': 'event_scrapers', 'FEED_EXPORT_ENCODING': 'utf-8', 'FEED_URI_PARAMS': <function _feed_uri_params at 0x7e65cbe2c540>, 'LOG_FILE': '/root/event-list-scraping/logs/event_scrapers/district_music_hall/ba31e6d2693a11f18a2c0050565fa5d9.log', 'NEWSPIDER_MODULE': 'event_scrapers.spiders', 'REQUEST_FINGERPRINTER_CLASS': 'scrapy_zyte_api.ScrapyZyteAPIRequestFingerprinter', 'SPIDER_MODULES': ['event_scrapers.spiders']} 2026-06-16 06:20:46 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec' 2026-06-16 06:20:46 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec' 2026-06-16 06:20:46 [scrapy.middleware] INFO: Enabled downloader middlewares: ['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware', 'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware', 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware', 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware', 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware', 'scrapy.downloadermiddlewares.retry.RetryMiddleware', 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware', 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware', 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware', 'scrapy_zyte_api.ScrapyZyteAPIDownloaderMiddleware', 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware', 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware', 'scrapy.downloadermiddlewares.stats.DownloaderStats'] 2026-06-16 06:20:46 [scrapy.middleware] INFO: Enabled spider middlewares: ['scrapy.spidermiddlewares.start.StartSpiderMiddleware', 'scrapy.spidermiddlewares.httperror.HttpErrorMiddleware', 'scrapy_zyte_api.ScrapyZyteAPISpiderMiddleware', 'scrapy.spidermiddlewares.referer.RefererMiddleware', 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware', 'scrapy.spidermiddlewares.depth.DepthMiddleware', 'scrapy_zyte_api.ScrapyZyteAPIRefererSpiderMiddleware'] 2026-06-16 06:20:46 [scrapy.middleware] INFO: Enabled item pipelines: ['event_scrapers.pipelines.EventScrapersPipeline'] 2026-06-16 06:20:46 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute. self._check_mw_method_spider_arg(pipe.process_item) 2026-06-16 06:20:46 [scrapy.core.engine] INFO: Spider opened 2026-06-16 06:20:46 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.district_music_hall.ListingSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html warn( 2026-06-16 06:20:46 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min) 2026-06-16 06:20:46 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023 2026-06-16 06:20:46 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://districtmusichall.com/> (referer: None) 2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/cat-power-the-greatest-tour-1983549974369/> from <GET https://districtmusichall.com/info-page-sg/e/cat-power-the-greatest-tour-1983549974369> 2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/chat-pile-who-loves-the-sun-tour-2026-1991379538807/> from <GET https://districtmusichall.com/info-page-sg/e/chat-pile-who-loves-the-sun-tour-2026-1991379538807> 2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-dip-1984058251640/> from <GET https://districtmusichall.com/info-page-sg/e/the-dip-1984058251640> 2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-damned-final-damnation-50-1987541099933/> from <GET https://districtmusichall.com/info-page-sg/e/the-damned-final-damnation-50-1987541099933> 2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/evan-honer-it-s-an-even-longer-road-tour-1991488909939/> from <GET https://districtmusichall.com/info-page-sg/e/evan-honer-it-s-an-even-longer-road-tour-1991488909939> 2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/all-them-witches-21-shows-21-days-1988020814773/> from <GET https://districtmusichall.com/info-page-sg/e/all-them-witches-21-shows-21-days-1988020814773> 2026-06-16 06:20:49 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/deer-tick-1984571393461/> from <GET https://districtmusichall.com/info-page-sg/e/deer-tick-1984571393461> 2026-06-16 06:20:50 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-aquabats-1985445871047/> from <GET https://districtmusichall.com/info-page-sg/e/the-aquabats-1985445871047> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/nekrogoblikon-1991385987094/> from <GET https://districtmusichall.com/info-page-sg/e/nekrogoblikon-1991385987094> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/hatebreed-summer-slaughter-tour-2026-1990042491663/> from <GET https://districtmusichall.com/info-page-sg/e/hatebreed-summer-slaughter-tour-2026-1990042491663> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/bop-to-the-top-1988339580209/> from <GET https://districtmusichall.com/info-page-sg/e/bop-to-the-top-1988339580209> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/fruit-bats-1985461285151/> from <GET https://districtmusichall.com/info-page-sg/e/fruit-bats-1985461285151> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/the-church-the-singles-1980-2025-1982455247010/> from <GET https://districtmusichall.com/info-page-sg/e/the-church-the-singles-1980-2025-1982455247010> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/dakhabrakha-1990004793908/> from <GET https://districtmusichall.com/info-page-sg/e/dakhabrakha-1990004793908> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/jordan-jensen-1989611193639/> from <GET https://districtmusichall.com/info-page-sg/e/jordan-jensen-1989611193639> 2026-06-16 06:20:51 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to <GET https://districtmusichall.com/info-page-sg/e/bertha-grateful-drag-1989098722825/> from <GET https://districtmusichall.com/info-page-sg/e/bertha-grateful-drag-1989098722825> 2026-06-16 06:20:59 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://districtmusichall.com/info-page-sg/e/the-dip-1984058251640/> (referer: https://districtmusichall.com/) 2026-06-16 06:20:59 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://districtmusichall.com/info-page-sg/e/cat-power-the-greatest-tour-1983549974369/> (referer: https://districtmusichall.com/) 2026-06-16 06:20:59 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): 144.91.120.141:80 2026-06-16 06:20:59 [urllib3.connectionpool] DEBUG: http://144.91.120.141:80 "POST /api/v1/raw-events/ HTTP/1.1" 201 5809 2026-06-16 06:20:59 [scrapy.core.scraper] DEBUG: Scraped from <200 https://districtmusichall.com/info-page-sg/e/the-dip-1984058251640/> {'event_url': 'https://districtmusichall.com/info-page-sg/e/the-dip-1984058251640/', 'platform': 'District music hall', 'platform_hash': '6b4f271fdfa7af47c0eb25b6ae68b653', 'raw_body': '<section class="wfea default collegestreetmusichall-v2 sg ' 'sg-details"><article class="sg-details sg__event status-live ' 'city-norwalk region-ct country-us event__public ' 'event__available">\n' ' <div class="sg__hero">\n' '\t\t <figure class="">\n' -
Tail
'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Instagram</a> | <a ' 'href="https://x.com/babehavenband" ' 'title="https://x.com/babehavenband" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Twitter</a> | <a ' 'href="https://open.spotify.com/artist/0b0NRq58okVkvcHSGOzM4x?autoplay=true" ' 'title="https://open.spotify.com/artist/0b0NRq58okVkvcHSGOzM4x?autoplay=true" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Spotify</a></p></div><div style="margin: 20px ' '10px;font-size: 15px;line-height: 22px;font-weight: ' '400;text-align: left"><h3>CHEAP PERFUME</h3><p>Cheap Perfume ' "isn't here to politely ask for your attention—they're taking it, " 'champagne in hand. Formed in Colorado in 2015, this feminist ' 'punk powerhouse delivers razor-sharp riffs and unapologetically ' 'political lyrics wrapped in a glittering explosion of riot grrrl ' 'energy. Fronted by Stephanie Byrne (vocals) and Jane No ' '(guitar/vocals), with Geoff Brent on bass and David "Hott Dave" ' 'Grimm on drums, Cheap Perfume is the sound of dismantling ' 'oppression while having a good time.</p><p>Their debut album, ' '“Nailed It” (2016), threw punches at sexism, street harassment, ' 'and the absurdity of the Trump era, while “Burn It Down” (2019) ' 'blasted white supremacy, championed intersectional feminism and ' 'gave us the anti-fascist anthem “It’s Okay to Punch Nazis.” Now, ' 'Cheap Perfume returns angrier than ever with their upcoming ' "third album, “Don't Care. Didn't Ask.,” set for release in fall " '2025. Its infectious hooks amplify themes of capitalist ' 'exploitation, class solidarity, and the need for direct action ' 'and mutual aid as fascism tightens its grip in ' 'America.</p><p>With their incendiary live shows known for their ' 'community spirit and zero tolerance for bullshit, Cheap Perfume ' 'has cemented their place as one of the most vital voices in ' "punk. “Don't Care. Didn't Ask.” isn't just a record—it's a " 'rallying cry. Join the resistance; bring a ' 'bottle.</p><p><strong></strong><strong>Links:</strong> <a ' 'href="https://cheapperfume.bandcamp.com/merch" ' 'title="https://cheapperfume.bandcamp.com/merch" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Official Website</a> | <a ' 'href="https://www.facebook.com/cheapperfume719/" ' 'title="https://www.facebook.com/cheapperfume719/" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Facebook</a> | <a ' 'href="https://www.instagram.com/cheapperfumeband/?hl=en" ' 'title="https://www.instagram.com/cheapperfumeband/?hl=en" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Instagram</a> | <a ' 'href="https://open.spotify.com/artist/2vfwEEEv5PVpGMMnC5jajB?autoplay=true" ' 'title="https://open.spotify.com/artist/2vfwEEEv5PVpGMMnC5jajB?autoplay=true" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Spotify</a></p></div></div> </div>\n' '</div></summary> </section>'} 2026-06-16 06:21:11 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/> (referer: https://districtmusichall.com/) 2026-06-16 06:21:11 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): 144.91.120.141:80 2026-06-16 06:21:11 [urllib3.connectionpool] DEBUG: http://144.91.120.141:80 "POST /api/v1/raw-events/ HTTP/1.1" 201 5836 2026-06-16 06:21:11 [scrapy.core.scraper] DEBUG: Scraped from <200 https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/> {'event_url': 'https://districtmusichall.com/info-page-sg/e/the-wood-brothers-1982796051365/', 'platform': 'District music hall', 'platform_hash': '6b4f271fdfa7af47c0eb25b6ae68b653', 'raw_body': '<section class="wfea default collegestreetmusichall-v2 sg ' 'sg-details"><article class="sg-details sg__event status-live ' 'city-norwalk region-ct country-us event__public ' 'event__available">\n' ' <div class="sg__hero">\n' '\t\t <figure class="">\n' '\t\t<a id="wfea-popup-img-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'rel="bookmark"><img decoding="async" class="wp-post-image" ' 'src="https://img.evbuc.com/https%3A%2F%2Fcdn.evbuc.com%2Fimages%2F1177490937%2F533131535899%2F1%2Foriginal.20260213-185847?crop=focalpoint&fit=crop&h=200&w=450&auto=format%2Ccompress&q=75&sharp=10&fp-x=0.5&fp-y=0.5&s=1c774c58288b8edf9de1fe4f68c96726" ' 'alt="The Wood Brothers"></a> </figure>\n' '\t </div>\n' ' <div class="sg__content-wrap">\n' ' <div class="sg__head-group">\n' '\t\t\t\t\t<div class="sg__presented-by presented-by">Premier ' 'Concerts and Manic Presents:</div>\n' '\t\t<h2 class="sg__title wfea-header__title entry-title ">\n' '\t<a id="wfea-popup-title-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'title="Eventbrite link to The Wood Brothers" rel="bookmark">The ' 'Wood Brothers</a></h2>\n' '\n' '<div class="sg__summary">\n' '\twith Viv & Riley</div> </div>\n' '\n' ' \n' ' <div class="sg__content-group">\n' ' \t <time class="sg__head-date">\n' '\t\t<time class="eaw-time published" ' 'datetime="2026-06-16T20:00:00+00:00">Tue 6.16.26</time> ' '</time>\n' '\t\t\t<div class="sg__door-time door-time">Doors: 7:00 pm | ' 'Show: 8:00 pm</div>\n' '\t\t <div class="sg__age-resriction age-restriction">All ' 'Ages</div>\n' '\t\t <div class="sg__location location">\n' ' District Music Hall<div class="city-region">Norwalk, ' 'CT</div> </div>\n' ' </div>\n' '\n' ' <div class="sg__cta">\n' '\t <div class="sg__cta-wrap">\n' '\t\t\t\t\n' '\t\t\t\t<div class="sg__buttons">\n' '\t\t\t\t\t <div class="sg__booknow booknow ">\n' '\t\t<a id="wfea-popup-booknow-6a30cf3574eec-1982796051365" ' 'href="https://www.eventbrite.com/e/the-wood-brothers-tickets-1982796051365" ' 'aria-label="TICKETS » on Eventbrite for Event Detail Page" ' 'class="book-now__link"><button>TICKETS »</button></a> </div>\n' '\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t</div>\n' '\n' '\t\t\t\t\t </div>\n' '\t </div>\n' ' </div>\n' '</article>\n' '\n' '<summary class="sg-details-summary">\n' '\t<div class="sg__content entry-content">\n' ' <div class="sg__excerpt excerpt">\n' '\t\t<div>with Viv & Riley</div><div style="margin-top: ' '20px"><div style="margin: 20px 10px;font-size: 15px;line-height: ' '22px;font-weight: 400;text-align: left"><p>This event is General ' 'Admission Standing Room Only on the Floor, and Reserved Seated ' 'in the Balcony.</p><p>The Wood Brothers have partnered with ' 'American Friends of Canadian Conservation so that $1 per ticket ' 'will support The Nature Trust of British Columbia (NTBC) in ' 'their efforts to conserve ecologically-rich wetlands and protect ' 'irreplaceable land from development. Every $1 donated will be ' 'matched by the U.S. Fish and Wildlife Service with $2 so more ' 'endangered wetlands can be saved. If you’d like to learn more, ' 'please visit <a ' 'href="https://conservecanada.org/portfolio-item/the-nature-trust-of-british-columbia/" ' 'title="https://conservecanada.org/portfolio-item/the-nature-trust-of-british-columbia/" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">this link</a>.</p></div><div style="margin: 20px ' '10px;font-size: 15px;line-height: 22px;font-weight: ' '400;text-align: left"><h3>THE WOOD BROTHERS</h3><p>Dubbed ' '"masters of soulful folk" by Paste, The Wood Brothers formed ' 'after brothers Chris and Oliver Wood pursued separate musical ' 'careers for 15 years. Chris already had legions of devoted fans ' 'for his incomparable work as one-third of Medeski Martin & ' 'Wood, while Oliver’s band King Johnson built a loyal following ' 'in the South. With drummer Jano Rix added as a permanent third ' 'member, The Wood Brothers have evolved into one of roots music’s ' 'most revered acts, playing sold out shows across North America, ' 'garnering a Grammy Award nomination and releasing nine studio ' 'albums, including their forthcoming release, <em></em><em>Puff ' 'of Smoke</em>, out August ' '1.</p><p><strong></strong><strong>Links: </strong><a ' 'href="https://www.thewoodbros.com/" ' 'title="https://www.thewoodbros.com/" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Official Website</a> | <a ' 'href="https://www.facebook.com/thewoodbrothers" ' 'title="https://www.facebook.com/thewoodbrothers" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Facebook</a> | <a ' 'href="https://www.instagram.com/thewoodbros/" ' 'title="https://www.instagram.com/thewoodbros/" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Instagram</a> | <a ' 'href="https://twitter.com/thewoodbrothers" ' 'title="https://twitter.com/thewoodbrothers" target="_blank" ' 'data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Twitter</a> | <a ' 'href="https://open.spotify.com/artist/6FxuPrpa8phaP3Xn73emhT?autoplay=true" ' 'title="https://open.spotify.com/artist/6FxuPrpa8phaP3Xn73emhT?autoplay=true" ' 'target="_blank" data-msys-clicktrack="0" rel="nofollow noopener ' 'noreferrer">Spotify</a></p></div></div> </div>\n' '</div></summary> </section>'} 2026-06-16 06:21:11 [scrapy.core.engine] INFO: Closing spider (finished) 2026-06-16 06:21:11 [scrapy.extensions.feedexport] INFO: Stored csv feed (19 items) in: output/2026/06/16/district_music_hall.csv 2026-06-16 06:21:11 [scrapy.statscollectors] INFO: Dumping Scrapy stats: {'downloader/request_bytes': 12311, 'downloader/request_count': 39, 'downloader/request_method_count/GET': 39, 'downloader/response_bytes': 423435, 'downloader/response_count': 39, 'downloader/response_status_count/200': 20, 'downloader/response_status_count/301': 19, 'elapsed_time_seconds': 25.632912, 'feedexport/success_count/FileFeedStorage': 1, 'finish_reason': 'finished', 'finish_time': datetime.datetime(2026, 6, 16, 4, 21, 11, 915456, tzinfo=datetime.timezone.utc), 'httpcompression/response_bytes': 1422096, 'httpcompression/response_count': 20, 'item_scraped_count': 19, 'items_per_minute': 45.6, 'log_count/DEBUG': 96, 'log_count/INFO': 3, 'memusage/max': 93163520, 'memusage/startup': 93032448, 'request_depth_max': 1, 'response_received_count': 20, 'responses_per_minute': 48.0, 'scheduler/dequeued': 39, 'scheduler/dequeued/memory': 39, 'scheduler/enqueued': 39, 'scheduler/enqueued/memory': 39, 'start_time': datetime.datetime(2026, 6, 16, 4, 20, 46, 282544, tzinfo=datetime.timezone.utc)} 2026-06-16 06:21:11 [scrapy.core.engine] INFO: Spider closed (finished) -
Log
-
Source
http://127.0.0.1:6800/logs/event_scrapers/district_music_hall/ba31e6d2693a11f18a2c0050565fa5d9.log
-
-
source log last update time 2026-06-16 06:21:11 last update timestamp 1781583671 downloader / request bytes 12311 downloader / request count 39 downloader / request method count / GET 39 downloader / response bytes 423435 downloader / response count 39 downloader / response status count / 200 20 downloader / response status count / 301 19 elapsed time seconds 25.632912 feedexport / success count / FileFeedStorage 1 finish reason finished finish time datetime.datetime(2026, 6, 16, 4, 21, 11, 915456, tzinfo=datetime.timezone.utc) httpcompression / response bytes 1422096 httpcompression / response count 20 item scraped count 19 items per minute 45.6 log count / DEBUG 96 log count / INFO 3 memusage / max 93163520 memusage / startup 93032448 request depth max 1 response received count 20 responses per minute 48.0 scheduler / dequeued 39 scheduler / dequeued / memory 39 scheduler / enqueued 39 scheduler / enqueued / memory 39 start time datetime.datetime(2026, 6, 16, 4, 20, 46, 282544, tzinfo=datetime.timezone.utc)