User-agent: * # who can crawl pages on this website Clean-param: utm # remove duplicate pages with any UTM parameters Sitemap: https://example.com/sitemap # map for indexable pages Host: https://example.com/ # main mirror of this website