We installed 301 URL Tracker on a site a few weeks ago, and it all seem to work fine, untill we got complaints that the site became very slow... which was true. After a few weeks it took at least 20 seconds to completely show the homepage. It turns out that 301 URL Tracker was the problem. The table 'icUrlTracker' in the database contains now over 400.000 records. But what surprised me the most was that a lot of those records containted path like /images/ or /fonts/.
I would expect that images and other files shouldn't be handled by this tracker, but someway it does... Is this a bug? And is there a way to ignore those complete paths?
Yes I've ran into this myself with a large website we've built.
There were literally millions of 404's logged in the icUrlTracker table, so 400.000 doesn't sound that bad ;-)
It's worth pointing out that if you are working with legacy sites, installing this package isn't a silver bullet solution for all your 404's, they don't just go away once you've installed it. I found this package to be immensely useful in helping to clear up many years of URL abuse caused by unregulated content changes, but you have to be prepared initially to invest time regularly dealing with those 404's with well thought out 301,302 or 410 responses. Eventually the 404's dry up. The recent addition of the 'Delete all 404 enties' button also helped me stay ahead of performance issues.
Images and other files tracked by url tracker?
We installed 301 URL Tracker on a site a few weeks ago, and it all seem to work fine, untill we got complaints that the site became very slow... which was true. After a few weeks it took at least 20 seconds to completely show the homepage. It turns out that 301 URL Tracker was the problem. The table 'icUrlTracker' in the database contains now over 400.000 records. But what surprised me the most was that a lot of those records containted path like /images/ or /fonts/.
I would expect that images and other files shouldn't be handled by this tracker, but someway it does... Is this a bug? And is there a way to ignore those complete paths?
Some examples:
Id OldUrl OldUrlQueryString OldRegex RedirectRootNodeId RedirectNodeId RedirectUrl RedirectHttpCode RedirectPassThroughQueryString Notes Is404 Referrer Inserted ForceRedirect
1 images/icons/svg/icon_arrow_right_white.png NULL NULL 1141 NULL NULL 301 1 NULL 1 https://www.xxx.nl/actueel/nieuws/dressuur/stephanie-kooijman-presteert-goed-in-chio-rotterdam-cup/ 2014-02-17 15:53:39.177 0
2 images/icon_search.png NULL NULL 1141 NULL NULL 301 1 NULL 1 https://www.xxx.nl/actueel/nieuws/xxxxxx/ 2014-02-17 15:53:39.283 0
7 fonts/2686E9_0_0.woff NULL NULL 1141 NULL NULL 301 1 NULL 1 https://www.xxxx.nl/xxxxx/ 2014-02-17 15:53:55.667 0
94 css/images/ui-bg_flat_75_ffffff_40x100.png NULL NULL 1141 NULL NULL 301 1 NULL 1 https://www.xxxx.nl/kenniscentrum/?q=xxxx 2014-02-17 15:57:47.257 0
Yes I've ran into this myself with a large website we've built.
There were literally millions of 404's logged in the icUrlTracker table, so 400.000 doesn't sound that bad ;-)
It's not a bug though; all 404's are logged by default, with some exceptions.
I think this would be really helpful for these situations: http://our.umbraco.org/projects/developer-tools/301-url-tracker/version-2/48922-404PathToIgnore-setting
Also I'm thinking about giving warnings in the umbraco back-office when the table is becoming large, but how does one define 'large'?
Ok thanks, so I guess we need to wait for the next version then.
For now I disabled the package. The whole server became slow, including their cloud environment... So, looking forward to the next update ;-)
It's worth pointing out that if you are working with legacy sites, installing this package isn't a silver bullet solution for all your 404's, they don't just go away once you've installed it. I found this package to be immensely useful in helping to clear up many years of URL abuse caused by unregulated content changes, but you have to be prepared initially to invest time regularly dealing with those 404's with well thought out 301,302 or 410 responses. Eventually the 404's dry up. The recent addition of the 'Delete all 404 enties' button also helped me stay ahead of performance issues.
is working on a reply...
This forum is in read-only mode while we transition to the new forum.
You can continue this topic on the new forum by tapping the "Continue discussion" link below.