generate-sitemap

Generate an XML sitemap for a GitHub Pages site using GitHub Actions

MIT License

Stars
71
Committers
4

Bot releases are visible (Hide)

generate-sitemap - generate-sitemap, v1.10.1 Latest Release

Published by cicirello 4 months ago

[1.10.1] - 2024-06-08

Fixed

  • Escape characters that must be escaped in XML.

Dependencies

  • Bump cicirello/pyaction from 4.26.0 to 4.30.0
generate-sitemap - generate-sitemap, v1.10.0

Published by cicirello 11 months ago

[1.10.0] - 2023-11-15

Added

  • Ability to specify list of paths to exclude from sitemap, via new input exclude-paths.

Dependencies

  • Bump cicirello/pyaction from 4.25.0 to 4.26.0
generate-sitemap - generate-sitemap, v1.9.2

Published by cicirello about 1 year ago

[1.9.2] - 2023-10-05

Fixed

  • Fix minor bug in regex used to detect if a page has a meta robots noindex directive in head.

CI/CD

  • Bump Python to 3.12 in CI/CD workflows when running unit tests.

Dependencies

  • Bump cicirello/pyaction from 4.14.1 to 4.25.0, including upgrading Python within the Docker container to 3.12.
generate-sitemap - generate-sitemap, v1.9.1

Published by cicirello almost 2 years ago

[1.9.1] - 2023-01-16

Fixed

  • Case-insensitive check for <meta name="robots" content="noindex"> in head of html files.
  • Correct handling of <meta content="noindex" name="robots"> (i.e., content before name).

Dependencies

  • Bump cicirello/pyaction from 4.12.0 to 4.14.1
generate-sitemap - generate-sitemap, v1.9.0

Published by cicirello almost 2 years ago

[1.9.0] - 2022-10-25

Added

  • Option to include dates only in the lastmod fields of XML sitemaps. Default includes full date-time.

CI/CD

  • Bump Python to 3.11 in CI/CD workflows.

Dependencies

  • Bump cicirello/pyaction from 4.11.1 to 4.12.0, which includes upgrading Python in the Docker container to 3.11.0.
generate-sitemap - generate-sitemap, v1.8.6

Published by cicirello almost 2 years ago

[1.8.6] - 2022-10-24

Fixed

  • Some users may be using the action on a self-hosted runner not yet updated to a version supporting the
    new GitHub Actions GITHUB_OUTPUT environment file. This patch adds backwards compatibility for those
    users (e.g., it falls back to using the deprecated set-output if GITHUB_OUTPUT doesn't exist).

Dependencies

  • Bump cicirello/pyaction from 4.11.0 to 4.11.1
generate-sitemap - generate-sitemap, v1.8.5

Published by cicirello almost 2 years ago

[1.8.5] - 2022-10-20

Fixed

  • Replaced the usage of GitHub Action's deprecated set-output with the new $GITHUB_OUTPUT env file.
  • Handle alternate casing of boolean inputs in GitHub workflow YAML files (previously expected lowercase).
  • Refactored entrypoint for improved maintainability, and ease of planned new functionality.

Dependencies

  • Bumped pyaction from 4.7.0 to 4.11.0, which includes upgrading Python in the Docker container to 3.10.7.
generate-sitemap - generate-sitemap, v1.8.4

Published by cicirello about 2 years ago

[1.8.4] - 2022-08-03

Changed

  • Refactored index.html dropping logic to ease support for additional dropped index filenames.

Fixed

Dependencies

  • Bumped base docker image cicirello/pyaction from 4.3.1 to 4.7.0.

New Contributors

generate-sitemap - generate-sitemap, v1.8.3

Published by cicirello over 2 years ago

[1.8.3] - 2022-04-22

Fixed

  • Corrected check for robots noindex directive in case when non-utf8 characters present in an html file.
  • Disabled pycache to protect against potential future bug. Currently no imports so no pycache created, but if future versions import local py modules, a pycache would be created during run in repo. Disabled creation of pycache now to avoid.

Dependencies

  • Bumped base Docker image cicirello/pyaction from 4.2.0 to 4.3.1.
generate-sitemap - generate-sitemap, v1.8.2

Published by cicirello over 2 years ago

[1.8.2] - 2022-03-04

Changed

  • Bumped Python to 3.10.
  • Bumped base Docker image to pyaction 4.2.0.
  • Now pulls base Docker image from the GitHub Container Registry rather
    than Docker Hub under the assumption that pulling from GitHub from
    GitHub Actions is likely faster than from Docker Hub.
generate-sitemap - generate-sitemap, v1.8.1

Published by cicirello about 3 years ago

[1.8.1] - 2021-07-29

Changed

  • Upgraded base docker image to cicirello/pyaction:4.
generate-sitemap - generate-sitemap, v1.8.0

Published by cicirello over 3 years ago

[1.8.0] - 2021-06-28

Added

  • Added option to exclude .html from URLs listed in the sitemap
    for html files. GitHub Pages automatically serves a corresponding
    html file if a user browses to a page with a URL with no file extension.
    This new option to the generate-sitemap action enables your sitemap to
    match this behavior if you prefer the extension-less look of URLs. There
    is a new action input, drop-html-extension, to control this behavior.

Changed

  • Use major release tag when pulling base docker image (e.g.,
    automatically get non-breaking changes to base image, such as
    bug fixes, etc without need to update Dockerfile).
generate-sitemap - generate-sitemap, v1.7.2

Published by cicirello over 3 years ago

[1.7.2] - 2021-05-13

Changed

  • Switched tag used to pull base Docker image from latest to the
    specific release that is the current latest, to enable testing
    against base image updates prior to releases. This is a purely
    non-functional change.

Fixed

  • Bug involving missing lastmod dates for website files created by
    the workflow, but not yet committed. These are now set using the
    current date and time.
generate-sitemap - generate-sitemap, v1.7.1

Published by cicirello over 3 years ago

[1.7.1] - 2021-05-06

Changed

  • Refactored to improve code maintainability.

CI/CD

  • Introduced major version tag.
generate-sitemap - generate-sitemap, v1.7.0

Published by cicirello over 3 years ago

[1.7.0] - 2021-4-26

Added

  • New action input, additional-extensions, that enables adding
    other indexable file types to the sitemap.

CI/CD

  • Enabled CodeQL code scanning on all push/pull-request events.
generate-sitemap - generate-sitemap, v1.6.2

Published by cicirello over 3 years ago

[1.6.2] - 2021-3-10

Changed

  • Improved the documentation (otherwise, this release is
    functionally equivalent to the previous release).
generate-sitemap - generate-sitemap, v1.6.1

Published by cicirello about 4 years ago

[1.6.1] - 2020-9-24

Fixed

  • Bug in generating URL for files with names ending in "index.html" but not exactly equal to "index.html", such as "aindex.html". Previous version would incorrectly truncate this to just "a", dropping the "index.html". This version now correctly identifies "index.html" files.
generate-sitemap - generate-sitemap, v1.6.0

Published by cicirello about 4 years ago

[1.6.0] - 2020-9-21

Added

  • Support for robots.txt: In addition to the previous functionality of excluding html URL's that contain <meta name="robots" content="noindex"> directives, the generate-sitemap GitHub action now parses a robots.txt file, if present at the root of the website, excluding any URLs from the sitemap that match Disallow: rules for User-agent: *.
generate-sitemap - generate-sitemap, v1.5.0

Published by cicirello about 4 years ago

generate-sitemap, v1.5.0

This action generates a sitemap for a website hosted on GitHub Pages. It supports both xml and txt sitemaps. When generating an xml sitemap, it uses the last commit date of each file to generate the <lastmod> tag in the sitemap entry. It can include html as well as pdf files in the sitemap, and has inputs to control the included file types (defaults include both html and pdf files in the sitemap). It skips html files that contain <meta name="robots" content="noindex">. It otherwise does not currently attempt to respect a robots.txt file. The sitemap entries are sorted in a consistent order (primary sort is by depth of page in site, and URLs at same depth are then sorted alphabetically).

[1.5.0] - 2020-9-14

Changed

  • Minor refactoring of python, and optimized action load time by using a prebuilt base docker image that includes exactly what is needed (git and python).
generate-sitemap - generate-sitemap, v1.4.0

Published by cicirello about 4 years ago

generate-sitemap, v1.4.0

This action generates a sitemap for a website hosted on GitHub Pages. It supports both xml and txt sitemaps. When generating an xml sitemap, it uses the last commit date of each file to generate the <lastmod> tag in the sitemap entry. It can include html as well as pdf files in the sitemap, and has inputs to control the included file types (defaults include both html and pdf files in the sitemap). It skips over html files that contain <meta name="robots" content="noindex">. It otherwise does not currently attempt to respect a robots.txt file. The sitemap entries are sorted in a consistent order (primary sort is by depth of page in site, and URLs at same depth are then sorted alphabetically).

[1.4.0] - 2020-9-11

Changed

  • Completely re-implemented in Python to enable more easily adding planned future functionality.
Package Rankings
Top 3.65% on Github actions
Badges
Extracted from project README
cicirello/generate-sitemap - Generate XML sitemaps for static websites in GitHub Actions GitHub release (latest by date) Count of Action Users build CodeQL GitHub GitHub top language GitHub Sponsors Liberapay Ko-Fi
Related Projects