Turn a site crawl export into a clean text catalog that highlights the best pages for AI and SEO work. Ideal for content and SEO teams that want a fast way to prepare curated links with titles and short descriptions.
The flow starts with a simple form where you enter the site name, a short summary, and upload a CSV from your crawler. The file is parsed, then mapped to seven key fields like URL, title, description, status, indexability, content type, and word count. A filter keeps only pages that return 200, are indexable, and are text HTML. You can also enable an AI step with OpenAI to classify pages as useful content or other content. Each page is formatted as a simple line, then all lines are combined and saved as a downloadable text file. You can swap the last node to upload the file to cloud storage.
Use a CSV export that includes internal URLs, ideally the internal HTML version. The mapping handles multiple languages, so non English exports still work. Expect big time savings by moving from manual sorting to a guided flow. Teams can build a clean list in minutes and reuse the same steps for new sites or larger crawls.
Ask in the Free Futurise Community.
These templates were sourced from publicly available materials across the web, including n8n’s official website, YouTube and public GitHub repositories. We have consolidated and categorized them for easy search and filtering, and supplemented them with links to integrations, step-by-step setup instructions, and personalized support in the Futurise community. Content in this library is provided for education, evaluation and internal use. Users are responsible for checking and complying with the license terms with the author of the templates before commercial use or redistribution.Where an original author was identified, attribution has been provided. Some templates did not include author information. If you know who created this template, please let us know so we can add the appropriate credit and reference link. If you are the author and would like this template removed from the library, email us at info@futurise.com and we will remove it promptly.