-
Head
2026-06-16 09:21:00 [scrapy.utils.log] INFO: Scrapy 2.14.1 started (bot: event_scrapers)
2026-06-16 09:21:00 [scrapy.utils.log] INFO: Versions:
{'lxml': '6.0.2',
'libxml2': '2.14.6',
'cssselect': '1.3.0',
'parsel': '1.10.0',
'w3lib': '2.0.0',
'Twisted': '25.5.0',
'Python': '3.12.3 (main, Mar 23 2026, 19:04:32) [GCC 13.3.0]',
'pyOpenSSL': '25.3.0 (OpenSSL 3.5.4 30 Sep 2025)',
'cryptography': '46.0.3',
'Platform': 'Linux-6.8.0-90-generic-x86_64-with-glibc2.39'}
2026-06-16 09:21:00 [scrapy.crawler] DEBUG: Using AsyncCrawlerProcess
2026-06-16 09:21:00 [asyncio] DEBUG: Using selector: EpollSelector
2026-06-16 09:21:00 [scrapy.addons] INFO: Enabled addons:
[]
2026-06-16 09:21:00 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor
2026-06-16 09:21:00 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop
2026-06-16 09:21:00 [scrapy.extensions.telnet] INFO: Telnet Password: 6d44dc9e4bbbbdd1
2026-06-16 09:21:00 [scrapy.middleware] INFO: Enabled extensions:
['scrapy.extensions.corestats.CoreStats',
'scrapy.extensions.logcount.LogCount',
'scrapy.extensions.telnet.TelnetConsole',
'scrapy.extensions.memusage.MemoryUsage',
'scrapy.extensions.feedexport.FeedExporter',
'scrapy.extensions.logstats.LogStats']
2026-06-16 09:21:00 [scrapy.crawler] INFO: Overridden settings:
{'BOT_NAME': 'event_scrapers',
'FEED_EXPORT_ENCODING': 'utf-8',
'FEED_URI_PARAMS': <function _feed_uri_params at 0x757fe6848540>,
'LOG_FILE': '/root/event-list-scraping/logs/event_scrapers/lachattown_farm/e82d8fc7695311f193310050565fa5d9.log',
'NEWSPIDER_MODULE': 'event_scrapers.spiders',
'REQUEST_FINGERPRINTER_CLASS': 'scrapy_zyte_api.ScrapyZyteAPIRequestFingerprinter',
'SPIDER_MODULES': ['event_scrapers.spiders']}
2026-06-16 09:21:00 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 09:21:00 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with 'ff9baec'
2026-06-16 09:21:00 [scrapy.middleware] INFO: Enabled downloader middlewares:
['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware',
'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
'scrapy.downloadermiddlewares.retry.RetryMiddleware',
'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
'scrapy_zyte_api.ScrapyZyteAPIDownloaderMiddleware',
'scrapy.downloadermiddlewares.cookies.CookiesMiddleware',
'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware',
'scrapy.downloadermiddlewares.stats.DownloaderStats']
2026-06-16 09:21:00 [scrapy.middleware] INFO: Enabled spider middlewares:
['scrapy.spidermiddlewares.start.StartSpiderMiddleware',
'scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
'scrapy_zyte_api.ScrapyZyteAPISpiderMiddleware',
'scrapy.spidermiddlewares.referer.RefererMiddleware',
'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
'scrapy.spidermiddlewares.depth.DepthMiddleware',
'scrapy_zyte_api.ScrapyZyteAPIRefererSpiderMiddleware']
2026-06-16 09:21:00 [scrapy.middleware] INFO: Enabled item pipelines:
['event_scrapers.pipelines.EventScrapersPipeline']
2026-06-16 09:21:00 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/pipelines/__init__.py:47: ScrapyDeprecationWarning: EventScrapersPipeline.process_item() requires a spider argument, this is deprecated and the argument will not be passed in future Scrapy versions. If you need to access the spider instance you can save the crawler instance passed to from_crawler() and use its spider attribute.
self._check_mw_method_spider_arg(pipe.process_item)
2026-06-16 09:21:00 [scrapy.core.engine] INFO: Spider opened
2026-06-16 09:21:01 [py.warnings] WARNING: /root/.venv/lib/python3.12/site-packages/scrapy/core/spidermw.py:490: ScrapyDeprecationWarning: event_scrapers.spiders.lachattown_farm.ListingSpider defines the deprecated start_requests() method. start_requests() has been deprecated in favor of a new method, start(), to support asynchronous code execution. start_requests() will stop being called in a future version of Scrapy. If you use Scrapy 2.13 or higher only, replace start_requests() with start(); note that start() is a coroutine (async def). If you need to maintain compatibility with lower Scrapy versions, when overriding start_requests() in a spider class, override start() as well; you can use super() to reuse the inherited start() implementation without copy-pasting. See the release notes of Scrapy 2.13 for details: https://docs.scrapy.org/en/2.13/news.html
warn(
2026-06-16 09:21:01 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2026-06-16 09:21:01 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
2026-06-16 09:21:02 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://www.lachattownfarm.org/index.php> (referer: None)
2026-06-16 09:21:02 [scrapy.downloadermiddlewares.offsite] DEBUG: Filtered offsite request to 'lachat-town-farm.jumbula.com': <GET https://lachat-town-farm.jumbula.com/2026Fall/TinySproutsFall2026>
2026-06-16 09:21:02 [scrapy_zyte_api._params] WARNING: Cookies are enabled for request <GET https://www.lachattownfarm.org/register.php?eventid=2116>, and there are cookies in the cookiejar, but ZYTE_API_EXPERIMENTAL_COOKIES_ENABLED is False, so automatic mapping will not map cookies for this or any other request. To silence this warning, disable cookies for all requests that use automatic mapping, either with the COOKIES_ENABLED setting or with the dont_merge_cookies request metadata key.
2026-06-16 09:21:03 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://www.lachattownfarm.org/register.php?eventid=2116> (referer: https://www.lachattownfarm.org/index.php)
2026-06-16 09:21:03 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): 144.91.120.141:80
2026-06-16 09:21:03 [urllib3.connectionpool] DEBUG: http://144.91.120.141:80 "POST /api/v1/raw-events/ HTTP/1.1" 201 3141
2026-06-16 09:21:03 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.lachattownfarm.org/register.php?eventid=2116>
{'event_url': 'https://www.lachattownfarm.org/register.php?eventid=2116',
'platform': 'Lachatt own Farm',
'platform_hash': '2f1c873a619548b03f9b4cdc2146e010',
'raw_body': '<div class="col-md-12 custom-column-bg py-5 mb-1 mb-md-0 '
'appear-animation" data-appear-animation="fadeInTopShorter">\n'
'\t\t\t\t\t\t\t\t<div class="overflow-hidden mb-3">\t\t\t\t<h2 '
'class="text-color-dark font-weight-semi-bold text-6 '
'negative-ls-1 mb-0">Summer CIT Program</h2>\t\t\t\t\t\t\t</div>\n'
' <div> \t<div '
'class="row">\n'
' \t\t<div '
'class="col-md-4">\t\t\t\t\t\t\t\t<div class="owl-carousel '
'owl-theme nav-inside mb-0" data-plugin-options="{\'items\': 1, '
"'margin': 10, 'animateOut': 'fadeOut', 'autoplay': true, "
'\'autoplayTimeout\': 5000, \'loop\': true}"><div><img '
'class="img-fluid" '
'src="img/events/cit-program.jpg"></div>\t\t\t\t\t\t'
'</div>\t\t\t\t\t</div> \t\t<div '
'class="col-md-8"><h2 class="mb-1 text-5 text-primary">For rising '
'8th graders and up!</h2><p class="mb-1 '
'mr-2"><strong><strong>Summer 2026</strong></strong></p><p '
'class="">Lachat Town Farm is looking for volunteers to help '
'support the 2026 Summer Farmhands programs. The CIT program is a '
'great opportunity for teens to build their resumes, gain '
-
Tail
'class="text-decoration-none text-color-light '
'custom-fontsize-5">123-456-7890</a></span>---->\n'
'\t\t\t\t\t\t\t<span class="d-block text-color-light opacity-7 '
'custom-fontsize-5"><a href="mailto:info@lachattownfarm.org" '
'class="text-decoration-none text-color-light opacity-7 '
'custom-fontsize-5">info@lachattownfarm.org</a><br>106 Godfrey '
'Road West<br>Weston, CT 06883</span>\n'
'\t\t\t\t\t\t\t<ul class="header-social-icons social-icons d-none '
'd-sm-block ml-1 mt-4">\n'
'\t\t\t\t\t\t\t\t<li class="social-icons-facebook"><a '
'href="https://www.facebook.com/lachattownfarm/" target="_blank" '
'title="Facebook"><i class="fab fa-facebook-f"></i></a></li>\n'
'\t\t\t\t\t\t\t\t<li class="social-icons-instagram"><a '
'href="http://www.instagram.com/" target="_blank" '
'title="Instagram"><i class="fab fa-instagram"></i></a></li>\n'
'<!---\t\t\t\t\t\t\t\t<li class="social-icons-twitter"><a '
'href="http://www.twitter.com/" target="_blank" '
'title="Twitter"><i class="fab fa-twitter"></i></a></li>---->\n'
'\t\t\t\t\t\t\t</ul>\n'
'\t\t\t\t\t\t</div>\n'
'<!--------\n'
'\t\t\t\t<div class="col-md-3 col-lg-3">\n'
'\t\t\t\t\t<h2 class=" text-color-light custom-fontsize-6 '
'mb-1">FOLLOW US</h2>\n'
'\t\t\t\t\t\t<ul class="header-social-icons social-icons d-none '
'd-sm-block ml-1 mt-4">\n'
'\t\t\t\t\t\t\t<li class="social-icons-facebook"><a '
'href="https://www.facebook.com/lachattownfarm/" target="_blank" '
'title="Facebook"><i class="fab fa-facebook-f"></i></a></li>\n'
'\t\t\t\t\t\t\t<li class="social-icons-instagram"><a '
'href="http://www.instagram.com/" target="_blank" '
'title="Instagram"><i class="fab fa-instagram"></i></a></li>\n'
'\t\t\t\t\t\t\t<li class="social-icons-twitter"><a '
'href="http://www.twitter.com/" target="_blank" '
'title="Twitter"><i class="fab fa-twitter"></i></a></li>\n'
'\t\t\t\t\t\t</ul>\n'
'\t\t\t\t\t</div>\n'
'<!----->\n'
'\t\t\t\t\t</div>\n'
'\t\t\t\t</div>\n'
'\t\t\t\t<!----->\n'
' <div class="footer-copyright '
'custom-bg-color-footer-2 border-top-0 mt-0">\n'
'\t\t\t\t\t<div class="container">\n'
'\t\t\t\t\t\t<hr class="solid opacity-1 mb-0">\n'
'\t\t\t\t\t\t<div class="row">\n'
'\t\t\t\t\t\t\t<div class="col mt-4 mb-4">\n'
'\t\t\t\t\t\t<!--\t\t<p class="text-center text-2 mb-0">© '
'Copyright 2021. All Rights Reserved.</p>--->\n'
'\t\t\t\t\t\t\t</div>\n'
'\t\t\t\t\t\t</div>\n'
'\t\t\t\t\t</div>\n'
'\t\t\t\t</div>\n'
'\t\t\t</footer>\n'
'\n'
'\t\t<!-- Vendor -->\n'
'\t\t<script src="vendor/jquery/jquery.min.js"></script>\n'
'\t\t<script '
'src="vendor/jquery.appear/jquery.appear.min.js"></script>\n'
'\t\t<script '
'src="vendor/jquery.easing/jquery.easing.min.js"></script>\n'
'\t\t<script '
'src="vendor/jquery.cookie/jquery.cookie.min.js"></script>\n'
'\t\t<script src="vendor/popper/umd/popper.min.js"></script>\n'
'\t\t<script '
'src="vendor/bootstrap/js/bootstrap.min.js"></script>\n'
'\t\t<script '
'src="vendor/jquery.validation/jquery.validate.min.js"></script>\n'
'\t\t<script '
'src="vendor/jquery.easy-pie-chart/jquery.easypiechart.min.js"></script>\n'
'\t\t<script '
'src="vendor/jquery.gmap/jquery.gmap.min.js"></script>\n'
'\t\t<script src="vendor/lazysizes/lazysizes.min.js"></script>\n'
'\t\t<script '
'src="vendor/isotope/jquery.isotope.min.js"></script>\n'
'\t\t<script '
'src="vendor/owl.carousel/owl.carousel.min.js"></script>\n'
'\t\t<script '
'src="vendor/magnific-popup/jquery.magnific-popup.min.js"></script>\n'
'\t\t<script src="vendor/vide/jquery.vide.min.js"></script>\n'
'\t\t<script src="vendor/vivus/vivus.min.js"></script>\n'
'\n'
'\t\t<!-- Theme Base, Components and Settings -->\n'
'\t\t<script src="js/theme.js"></script>\n'
'\t\t<!-- Current Page Vendor and Views -->\n'
'\t\t<script src="js/views/view.contact.js"></script>\n'
'\t\t<!-- Current Page Vendor and Views -->\n'
'\t\t<script '
'src="vendor/rs-plugin/js/jquery.themepunch.tools.min.js"></script>\n'
'\t\t<script '
'src="vendor/rs-plugin/js/jquery.themepunch.revolution.min.js"></script>\n'
'\n'
'\t\t<!-- Current Page Vendor and Views -->\n'
'\t\t<script src="js/views/view.contact.js"></script>\n'
'\n'
'\t\t<!-- Theme Custom -->\n'
'\t\t<script src="js/custom.js"></script>\n'
'\n'
'\t\t<!-- Theme Initialization Files -->\n'
'\t\t<script src="js/theme.init.js"></script>\n'
'\n'
'\t\t<script src="js/examples.portfolio.js"></script>\t\t\n'
'\t\t\n'
'\t\t<script src="js/views/view.home.js"></script>\n'
'\n'
'\t</div></div>'}
2026-06-16 09:21:07 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://www.lachattownfarm.org/register.php?eventid=2115> (referer: https://www.lachattownfarm.org/index.php)
2026-06-16 09:21:07 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): 144.91.120.141:80
2026-06-16 09:21:07 [urllib3.connectionpool] DEBUG: http://144.91.120.141:80 "POST /api/v1/raw-events/ HTTP/1.1" 201 2861
2026-06-16 09:21:07 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.lachattownfarm.org/register.php?eventid=2115>
{'event_url': 'https://www.lachattownfarm.org/register.php?eventid=2115',
'platform': 'Lachatt own Farm',
'platform_hash': '2f1c873a619548b03f9b4cdc2146e010',
'raw_body': '<div class="col-md-12 custom-column-bg py-5 mb-1 mb-md-0 '
'appear-animation" data-appear-animation="fadeInTopShorter">\n'
'\t\t\t\t\t\t\t\t<div class="overflow-hidden mb-3">\t\t\t\t<h2 '
'class="text-color-dark font-weight-semi-bold text-6 '
'negative-ls-1 mb-0">High School Interns - Join '
'Us!</h2>\t\t\t\t\t\t\t</div>\n'
' <div> \t<div '
'class="row">\n'
' \t\t<div '
'class="col-md-4">\t\t\t\t\t\t\t\t<div class="owl-carousel '
'owl-theme nav-inside mb-0" data-plugin-options="{\'items\': 1, '
"'margin': 10, 'animateOut': 'fadeOut', 'autoplay': true, "
'\'autoplayTimeout\': 5000, \'loop\': true}"><div><img '
'class="img-fluid" '
'src="img/events/after-school-intern.jpg"></div>\t\t\t\t\t\t'
'</div>\t\t\t\t\t</div> \t\t<div '
'class="col-md-8"><p class="mb-1 mr-2"><strong><strong>Spring '
'2026</strong></strong></p><p class="mb-0">.</p><p '
'class="">Lachat Town Farm is looking for High School volunteers '
'to help support after school programs for grades K-5. Interns '
'will have the opportunity to work with experienced program '
'leaders in topics related to cooking, nature, and art. The '
'ideal intern has a love of children and the outdoors, and a '
'willingness to get their hands dirty!</p><p class="">Interns are '
'asked to be at the farm from 3:15-5:15pm each day that class is '
'running. Students who don’t have transportation can take Bus 1 '
'from WHS to the Farm. The session begins January 5 and each '
'class runs 1 day a week for 8 weeks. Please view the class '
'listings (link below) for exact dates the programs will run, as '
'start/end dates vary with the school calendar. </p><p '
'class="">Please submit applications as early as possible, '
'applications will continue to be considered on a rolling basis '
'as long as there is availability. Questions can be directed to '
'childrensprograms@lachattownfarm.org.</p><p class=""><a '
'href="https://docs.google.com/forms/d/e/1FAIpQLSfde93606FeQnI1VHV-LINncM4WatW9iR_YbfPV2-N5dfsDJA/viewform?usp=sharing&ouid=113912751658580906530" '
'target="_blank">APPLY TO HELP WITH AFTER SCHOOL PROGRAMS '
'HERE!</a></p><p class=""><a '
'href="https://lachat-town-farm.jumbula.com/spring-2026" '
'target="_blank">Additional details about these programs '
'(including exact dates) can be found here.</a></p> <div '
'class="row"><div class="col-md-12">\n'
'\n'
'\n'
'\n'
'\n'
'<ul class="list list-icons">\t\t\t\t\t\t\t </ul></div>\n'
'\t\t\t\t\t\t\t</div>\n'
'\t\t\t\t\t\t</div>\n'
'\t\t\t\t\t</div>\n'
'\t\t\t\t\n'
'\n'
'\t\t\t</div> \n'
'\t\t</div>'}
2026-06-16 09:21:07 [scrapy.core.engine] INFO: Closing spider (finished)
2026-06-16 09:21:07 [scrapy.extensions.feedexport] INFO: Stored csv feed (10 items) in: output/2026/06/16/lachattown_farm.csv
2026-06-16 09:21:07 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 9115,
'downloader/request_count': 11,
'downloader/request_method_count/GET': 11,
'downloader/response_bytes': 56876,
'downloader/response_count': 11,
'downloader/response_status_count/200': 11,
'elapsed_time_seconds': 6.586977,
'feedexport/success_count/FileFeedStorage': 1,
'finish_reason': 'finished',
'finish_time': datetime.datetime(2026, 6, 16, 7, 21, 7, 623908, tzinfo=datetime.timezone.utc),
'httpcompression/response_bytes': 248903,
'httpcompression/response_count': 11,
'item_scraped_count': 10,
'items_per_minute': 100.0,
'log_count/DEBUG': 42,
'log_count/ERROR': 1,
'log_count/INFO': 3,
'log_count/WARNING': 1,
'memusage/max': 92958720,
'memusage/startup': 92958720,
'offsite/domains': 1,
'offsite/filtered': 1,
'request_depth_max': 1,
'response_received_count': 11,
'responses_per_minute': 110.0,
'scheduler/dequeued': 11,
'scheduler/dequeued/memory': 11,
'scheduler/enqueued': 11,
'scheduler/enqueued/memory': 11,
'start_time': datetime.datetime(2026, 6, 16, 7, 21, 1, 36931, tzinfo=datetime.timezone.utc)}
2026-06-16 09:21:07 [scrapy.core.engine] INFO: Spider closed (finished)
-
Log
/1/log/utf8/event_scrapers/lachattown_farm/e82d8fc7695311f193310050565fa5d9/?job_finished=True
-
Source
http://127.0.0.1:6800/logs/event_scrapers/lachattown_farm/e82d8fc7695311f193310050565fa5d9.log