domain.com/this-page-here
can serve entirely different content from domain.com/this-page-here/
depending on the server (and application) configuration.Pretty much the only way to 100% reliably deduplicate URLs is to look at their content, and somehow magically compare content that can change from page load to page load -- which is a whole other problem.