LinkedInBot
What is LinkedInBot?
LinkedInBot is a web crawler developed and operated by LinkedIn Corporation, the professional networking platform owned by Microsoft. It functions as an automated web crawler that systematically browses websites to collect information for LinkedIn's services. The bot helps LinkedIn gather data about companies, job listings, and professional content across the web to enhance its platform's functionality.
When LinkedInBot visits your website, it identifies itself through user-agent strings such as LinkedInBot/1.0 (compatible; Mozilla/5.0; Apache-HttpClient http://www.linkedin.com)
or LinkedInBot/1.0 (compatible; Mozilla/5.0; Jakarta Commons-HttpClient/3.1 +http://www.linkedin.com)
. These identifiers allow server administrators to recognize when LinkedIn's crawler is accessing their site.
The bot operates like other major web crawlers by following links, analyzing page content, and sending information back to LinkedIn's servers. It's designed to respect standard web protocols and crawling conventions, including robots.txt directives.
Why is LinkedInBot crawling my site?
LinkedInBot typically crawls websites to gather information relevant to professional networking and business intelligence. If your site contains job listings, company information, professional articles, or business profiles, LinkedIn's crawler may visit to index this content.
The bot is particularly interested in content that could enhance LinkedIn's database of companies, jobs, and professional information. It may visit your site to:
- Update information about your company's profile on LinkedIn
- Index job postings that could appear in LinkedIn's job search
- Gather professional content that might be relevant to LinkedIn users
- Collect data about businesses and professional services
The frequency of visits depends on how often your content changes and its relevance to LinkedIn's services. Sites with frequent job postings or company updates may experience more regular crawling activity.
What is the purpose of LinkedInBot?
LinkedInBot serves LinkedIn's core mission of connecting professionals and businesses. The data it collects helps populate and maintain LinkedIn's vast database of companies, job opportunities, and professional content.
The bot's primary functions include:
- Enhancing company profiles on LinkedIn with current information
- Keeping job listings up-to-date in LinkedIn's job search engine
- Gathering business intelligence that helps LinkedIn provide relevant recommendations to users
- Improving LinkedIn's ability to connect professionals with appropriate opportunities
For website owners, particularly those managing corporate sites or job boards, LinkedInBot's activities can be beneficial. When the bot correctly indexes your content, it can increase your company's visibility on LinkedIn's platform, potentially driving professional engagement and qualified job applicants to your organization.
How do I block LinkedInBot?
If you prefer to limit or block LinkedInBot from crawling your site, you can use standard robots.txt directives. LinkedIn's crawler is designed to respect robots.txt instructions, making this the simplest method for controlling access.
To block LinkedInBot completely, add the following to your robots.txt file:
User-agent: LinkedInBot
Disallow: /
To allow LinkedInBot to crawl only specific sections of your site, you can use more selective directives:
User-agent: LinkedInBot
Allow: /public/
Allow: /jobs/
Disallow: /
This example would allow LinkedIn's crawler to access only the "/public/" and "/jobs/" directories while blocking access to all other content.
Before blocking LinkedInBot, consider the potential benefits of allowing LinkedIn to index your professional content. Blocking the bot might reduce your company's visibility on LinkedIn's platform and could limit your reach to potential employees, partners, or customers who use LinkedIn to discover businesses and opportunities. If you're concerned about specific sensitive content, consider using selective blocking rather than completely restricting the bot's access.
Operated by
SEO crawler
Documentation
Go to docsAI model training
Acts on behalf of user
Obeys directives
User Agent
LinkedInBot/1.0 (compatible; Mozilla/5.0; Apache-HttpClient http://www.linkedin.com)