Go to main content

Textpattern CMS support forum

You are not logged in. Register | Login | Help

#1 2006-05-21 22:27:32

Sootah
Member
Registered: 2006-05-04
Posts: 27

Issue with permalinks and 404 errors

The SE crawlers are hitting my site nicely, but there’s an issue:

http://www.tweaksforgeeks.com/category/Internet/HIVEConsole/HIVEConsole/Barts_PE_McAfee_Setup.html

That doesn’t exist. It doesn’t even <i>kinda</i> exist. That’s some weird combo of a page that I have on my site, as well as a directory I have on my site, as well as the textpattern structure.

Notice how it shows a page? That should be a 404 error!

Legit URL: http://www.tweaksforgeeks.com/Barts_PE_McAfee_Setup.html
Legit URL: http://www.tweaksforgeeks.com/HIVEConsole

Where that other odd combo is coming from I don’t know, nor do I know why it doesn’t give a 404 error. I’m using the /section/id/title style of permalink and only have the auto excerpt plugin and a captcha plugin for comments installed.

Offline

#2 2006-05-21 22:46:18

zem
Developer Emeritus
From: Melbourne, Australia
Registered: 2004-04-08
Posts: 2,579

Re: Issue with permalinks and 404 errors

What version of Textpattern are you running? A similar URL works correctly for me (404 error) with recent revs.


Alex

Offline

#3 2006-05-21 23:09:51

Sootah
Member
Registered: 2006-05-04
Posts: 27

Re: Issue with permalinks and 404 errors

Latest version

Offline

#4 2006-05-22 01:40:06

zem
Developer Emeritus
From: Melbourne, Australia
Registered: 2004-04-08
Posts: 2,579

Re: Issue with permalinks and 404 errors

Ah, I think I can guess what’s going on: you have a category named ‘Internet’. /category/Internet is a valid Textpattern URL for that category. Textpattern ignores the extra /HIVEConsole/HIVEConsole/Barts_PE_McAfee_Setup.html junk after the category name, and just shows the Internet category (which contains no articles, by the look of it).

You could delete that category, if it’s superfluous – that should trigger a 404 error.


Alex

Offline

#5 2006-05-22 05:23:08

Sootah
Member
Registered: 2006-05-04
Posts: 27

Re: Issue with permalinks and 404 errors

I deleted the Internet category and now it just shows the main page when I try to go to that URL.

I really really want it to 404 because I’m pretty sure it’ll start dicking with my search engine results/placement if every URL that is invalid just starts showing results for stuff (and the same results at that)

Offline

#6 2006-05-22 05:28:39

Sootah
Member
Registered: 2006-05-04
Posts: 27

Re: Issue with permalinks and 404 errors

Interesting.. Textpattern handles the /category/ folder in an odd way. The URL: http://www.tweaksforgeeks.com/Internet/HIVEConsole/HIVEConsole/Barts_PE_McAfee_Setup.html will throw a 404. But if it’s http://www.tweaksforgeeks.com/category/really/anything/can-go-here

Then it just displays the category if it matches something, or displays everything if it doesn’t. I really don’t want it to do this.

Offline

#7 2006-05-22 05:31:11

Sootah
Member
Registered: 2006-05-04
Posts: 27

Re: Issue with permalinks and 404 errors

Here’s come crawling from the logs:

<code>
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Internet/HIVEConsole/HIVEConsole/Dell_A94…
0_Print_Spool_Hangs.html
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Windows-XP/HIVEConsole/HIVEConsole/Extrac…
t_Windows_XP_CD_Key.html
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Windows-Vista/HIVEConsole/HIVEConsole/msn…
metal.dll_0x00000485.html
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Tips/HIVEConsole/HIVEConsole/Windows_Upda…
te_Error_0x800a0007.HTML
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/News/HIVEConsole/HIVEConsole/Windows_Upda…
te_Error_0x800a0007.HTML
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Tips/HIVEConsole/HIVEConsole/Windows_Upda…
te_Error_0x800A01AE.html
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Windows-Vista/HIVEConsole/HIVEConsole/Mak…
e_Win_XP_Boot_Floppy.html
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Windows-XP/HIVEConsole/HIVEConsole/Clean_…
up_MSConfig_Entries.html
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/News/HIVEConsole/HIVEConsole/Windows_Upda…
te_Error_0x800A01AE.html
5/22 1:33 am crawl-66-249-72-48.googlebot.com category/Internet/HIVEConsole/HIVEConsole/Cant_Acc…
ess_Secure_Web_Site.html
</code>

Why EVER is it doing that? There are NO links to pages like that. Those pages exist, the .html part, but NONE of them are in the HIVEConsole directory. They’re all in the root.

Offline

#8 2006-05-22 21:58:53

Mary
Sock Enthusiast
Registered: 2004-06-27
Posts: 6,236

Re: Issue with permalinks and 404 errors

That’s weird (both the strange urls and the category mess). Do your logs state the referer url(s)?

Offline

#9 2006-05-22 22:36:09

zem
Developer Emeritus
From: Melbourne, Australia
Registered: 2004-04-08
Posts: 2,579

Re: Issue with permalinks and 404 errors

Crawlers almost never provide referrer strings.

The links might not be there at the moment, but they were at some point, perhaps while you were editing a page. Most likely you had some relative links (“HIVEConsole/HIVEConsole/Barts_PE_McAfee_Setup.html”) that were visible on the /category/Internet page.


Alex

Offline

Board footer

Powered by FluxBB