| Signed-off-by: Celeste Horgan <celeste@cncf.io> | ||
|---|---|---|
| .. | ||
| README.md | ||
		
			
				
				README.md
			
		
		
			
			
		
	
	Internal link checking tool
You can use htmltest to check for broken links in /content/en/. This is useful when refactoring sections of content, moving pages around, or renaming files or page headers.
How the tool works
htmltest scans links in the generated HTML files of the kubernetes website repository. It runs using a make command which does the following:
- Builds the site and generates output HTML in the /publicdirectory of your localkubernetes/websiterepository
- Pulls the wdjp/htmltestDocker image
- Mounts your local kubernetes/websiterepository to the Docker image
- Scans the files generated in the /publicdirectory and provides command line output when it encounters broken internal links
What it does and doesn't check
The link checker scans generated HTML files, not raw Markdown. The htmltest tool depends on a configuration file, .htmltest.yml, to determine which content to examine.
The link checker scans the following:
- All content generated from Markdown in /content/en/docsdirectory, excluding:- Generated API references, for example https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.18/
 
- All internal links, excluding:
- Empty hashes (<a href="#">or[title](#)) and empty hrefs (<a href="">or[title]())
- Internal links to images and other media files
 
- Empty hashes (
The link checker does not scan the following:
- Links included in the top and side nav bars, footer links, or links in a page's <head>section, such as links to CSS stylesheets, scripts, and meta information
- Top level pages and their children, for example: /training,/community,/case-studies/adidas
- Blog posts
- API Reference documentation, for example: https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.18/
- Localizations
Prerequisites and installation
You must install
Running the link checker
To run the link checker:
- 
Navigate to the root directory of your local kubernetes/websiterepository.
- 
Run the following command: 
make docker-internal-linkcheck
Understanding the output
If the link checker finds broken links, the output is similar to the following:
tasks/access-kubernetes-api/custom-resources/index.html
  hash does not exist --- tasks/access-kubernetes-api/custom-resources/index.html --> #preserving-unknown-fields
  hash does not exist --- tasks/access-kubernetes-api/custom-resources/index.html --> #preserving-unknown-fields
This is one set of broken links. The log adds an output for each page with broken links.
In this output, the file with broken links is tasks/access-kubernetes-api/custom-resources.md.
The tool gives a reason: hash does not exist. In most cases, you can ignore this.
The target URL is #preserving-unknown-fields.
One way to fix this is to:
- Navigate to the Markdown file with a broken link.
- Using a text editor, do a full-text search (usually Ctrl+F or Command+F) for the broken link's URL, #preserving-unknown-fields.
- Fix the link. For a broken page hash (or anchor) link, check whether the topic was renamed or removed.
Run htmltest to verify that broken links are fixed.