What is the recommended tool (or maybe recommend approach) for locating web elements on a page?

Alex_Koniushenko · December 12, 2024, 4:36pm

I need to locate and extract a specific web element from a webpage. The challenge is that I cannot rely on selectors (e.g., CSS or XPath) since I don’t know the exact HTML structure of the page in advance. Instead, I can describe the target element, such as: it should be a div, table, list, or grid containing a structured set of items that represent users, participants etc - some kind of catalog.

For simpler pages, I’ve successfully used the FirecrawlScrapeWebsiteTool to scrape the entire page content and passed the output to a second agent. This agent could locate the element based on my description.

However, this approach fails on larger pages where the HTML content is massive (thousands of lines). Such large content often exceeds the context window size for processing, making it impractical.

I’ve tried several tools, including:

WebsiteSearchTool
SeleniumScrapingTool
ScrapeWebsiteTool
FirecrawlScrapeWebsiteTool

But none have consistently succeeded in efficiently locating the element.

I attempted to save the HTML markup of the page to a local file and read it using the FileReadTool. My plan was to process the saved HTML file to locate the desired web element. However, I encountered this recurring error:

File reading error: 'charmap' codec can't decode byte 0x98 in position ...

This probably suggests the FileReadTool is unable to properly decode the saved HTML file, likely due to encoding issues.

So I would appreciate if someone, who already solved similar problems will suggest:

What’s the most efficient way to solve my task?
Should I focus on locating the element directly on the page in real-time, possibly using a tool that can interpret the page dynamically?
If saving the HTML and processing it offline is the better approach, what tool can I use to accurately read and process the HTML file?

rokbenko · December 12, 2024, 4:48pm

The FileReadTool only supports text-based file formats, as you’ve already figured out. As stated in the docs:

The FileReadTool conceptually represents a suite of functionalities within the crewai_tools package aimed at facilitating file reading and content retrieval. This suite includes tools for processing batch text files, reading runtime configuration files, and importing data for analytics. It supports a variety of text-based file formats such as .txt, .csv, .json, and more. Depending on the file type, the suite offers specialized functionality, such as converting JSON content into a Python dictionary for ease of use.

Alex_Koniushenko · December 12, 2024, 5:07pm

Yes, I’ve seen this documentation, but I hoped that “and more” maybe includes html))

Topic		Replies	Views
I have problems with using the SeleniumScrapingTool General tools_issues	2	84	April 9, 2025
Tools for scraping dynamic sites Crews tools_issues	4	49	April 27, 2025
How ScrapeWebsiteTool works with 2 Agents CrewAI Community Support tools_issues , agent	3	136	February 11, 2025
PDFSearchTool error CrewAI Community Support	0	134	January 14, 2025
Tools comparison General tools_issues	1	104	December 2, 2024

What is the recommended tool (or maybe recommend approach) for locating web elements on a page?

Related topics