BazQux bot

What is BazQux?

BazQux is a specialized web fetcher designed to retrieve metadata for link preview generation across various platforms. It's operated by BazQux and functions as a content retrieval tool for its RSS reader service. The bot identifies itself in server logs with a user agent string that typically follows this format: Mozilla/5.0 (compatible; BazQux/2.4; +https://bazqux.com/fetcher; 2 subscribers). This string includes a compatibility declaration, software version identifier, service documentation link, and information about subscriber count.

As a dedicated fetcher, BazQux operates through on-demand requests triggered by user interactions in applications that integrate its services. It's designed to comply with web standards while playing a role in content presentation ecosystems.

Why is BazQux crawling my site?

BazQux crawls websites primarily to retrieve content for its RSS reader service. When users of the BazQux Reader subscribe to content from your site, the service needs to fetch that content to display it to subscribers. The crawler visits your site to gather article content, images, and metadata that help create a complete reading experience for its users.

The frequency of visits depends on how many BazQux users have subscribed to your content and how often you update your site. Sites with more subscribers or frequent updates will see more regular visits. The crawler is triggered when new content is published or when users request updates through the reader interface.

These crawling activities are generally considered authorized as they serve a legitimate purpose of content syndication, similar to how other RSS readers and content aggregation services operate.

What is the purpose of BazQux?

BazQux serves as the content retrieval mechanism for the BazQux Reader, an RSS feed reader service. Its primary functions include content preview generation through title extraction, thumbnail image acquisition from target resources, and structured data parsing for rich media displays.

The data collected by BazQux is used to provide BazQux Reader users with a streamlined reading experience, allowing them to consume content from multiple sources in a single interface. For content creators and website owners, this service can extend the reach of your content to audiences who prefer consuming information through RSS readers rather than visiting individual websites directly.

Website owners benefit from having their content distributed to interested readers who might not otherwise visit their site regularly. However, the trade-off is that some users may read your content exclusively through the reader interface rather than visiting your website directly.

How do I block BazQux?

BazQux respects the standard robots.txt protocol, making it straightforward to control its access to your content. If you wish to block BazQux from crawling specific sections of your site or your entire site, you can add directives to your robots.txt file. Here's how to implement these controls:

User-agent: BazQux
Disallow: /private-path/

This example would prevent BazQux from accessing any content in the "/private-path/" directory while allowing it to crawl the rest of your site. If you want to block BazQux from your entire site, you can use:

User-agent: BazQux
Disallow: /

Keep in mind that blocking BazQux means that users of the BazQux Reader won't be able to access your content through that service. This could potentially reduce your content's reach among readers who rely on RSS aggregators. If you publish content for public consumption and want to maximize its distribution, you might consider allowing BazQux to access at least your public-facing content.

For more granular control over how your content appears in preview contexts, you can implement Open Graph meta tags that explicitly define preview titles and images. This approach allows you to maintain visibility while controlling exactly how your content is represented.

Something incorrect or have feedback?
Share feedback
BazQux bot logo

Operated by

Data fetcher

Documentation

Go to docs

AI model training

Not used to train AI or LLMs

Acts on behalf of user

Yes, behavior is triggered by a real user action

Obeys directives

Yes, obeys robots.txt rules

User Agent

Mozilla/5.0 (compatible; BazQux/2.4; +https://bazqux.com/fetcher; 2 subscribers)