Textpattern CMS support forum
You are not logged in. Register | Login | Help
- Topics: Active | Unanswered
Pages: 1
consistent urls
Google penalises duplicate content and unfortunately the inconsistency of the urls produced by txp tags is not helping…
For the coming xmas I’d wish that Santa Wet and Santa Ruud to give us some consistency and prevent bad Google from punishing us.
the tags in question?
<txp:category_list />, <txp:section_list /> and any other which I cannot think of just now…
Till then, if anyone can think of a generic htaccess rule to redirect messy to clean, let us know:)
Last edited by colak (2008-12-27 17:54:47)
Yiannis
——————————
NeMe  | hblack.art | EMAP | A Sea change | Toolkit of Care
I do my best editing after I click on the submit button.
Offline
#2 2008-12-27 17:51:01
- els
- Moderator
 
- From: The Netherlands
- Registered: 2004-06-06
- Posts: 7,458
Re: consistent urls
Yiannis, category_list only produces (partly) messy URLs when using section pages (/section-name/?c=category-name), and you can solve that with gbp_permanent_links. But what is wrong with section_list?
Offline
Re: consistent urls
Hi Els, you are right section_list behaves just fine. Edited my original post. What I would like to see is consistency. This is not about faking site structures.
As we all know <txp:category_list /> produces /?c=category-name whereas <txp:category link="1" title="1" /> produces /category/category-name/
It is this inconsistency I am referring to. What I would like is to see is the category_list producing clean urls like the category tag. Alternatively for the category tag to produce messy urls like the category_list.
zem_redirect handles this but as Alex is not accepting any more subscribers to his plugin, I believe that at least this behaviour should be part of the core.
Yiannis
——————————
NeMe  | hblack.art | EMAP | A Sea change | Toolkit of Care
I do my best editing after I click on the submit button.
Offline
Re: consistent urls
As we all know <txp:category_list /> produces /?c=category-name
Only if the section is also mentioned in the URL, otherwise it does create /category/catname URLs.
Offline
Re: consistent urls
I do not think that Textpattern contains any inherent duplicate content generator as long as you manage to get your meta name="robots" content="noindex" right, for instance by emitting this element on category pages, search result pages, or article lists in general. 
IMHO, site architectures and corresponding URL schemata tend to differ too much to cast any fixed behaviour into core.
Offline
Re: consistent urls
Um… but we do have a fixed behaviour. All those links are created by the same function.
I cannot reproduce the problem Yiannis describes.
Offline
Re: consistent urls
re: duplicated content: 
on Google Webmaster Central blog: Demystifying the “duplicate content penalty”
I agree with wet: some manual work here is better than trying to find a fixed-in-the-core and long term solution.
I’m doodling since some weeks ago about opening a thread (yeah, in the “Feature ideas” category…) with this subject:
“adding /section/category/title URLs to core… or ‘Do we really need subsections?’”
Short version: I don’t think subsections are really the big thing that TXP lacks. And beyond TXP, subsections (old school subsections, in the sense of a folder) have proven not to be the panacea.
I mean, look at Gmail: it doesn’t have subfolders (“subsections”) where mails are trapped forever. GMail destroyed the metaphor of subfolders, by using labels (aka category, tags) and having just one first level of folders (“sections”, like Inbox, Sent, Drafts, etc).
Categories + some new URL schemas are the way to go.
That’s why I didn’t open the thread: I’m waiting for the new gbp_permanent_links version. :D
Offline
Re: consistent urls
True, we do have a fixed behaviour. What I meant above is you’d probably achieve similar content on some sites by either linking to the homepage or by linking to the “articles” section with a certain category, and the like.
As this duplication depends on the site’s architecture, we can do almost nothing about this duplication in the core. It’s up to the site’s designer to sprinkle meta name="robots" content="noindex" all over the place for category/search result/single article/… pages whereever she sees fit.
Offline
Re: consistent urls
colak wrote:
Till then, if anyone can think of a generic htaccess rule to redirect messy to clean, let us know:)
Rah_metas includes redirect and dublicate content related attributes :)
Rahform.biz’s rah_metas page wrote:
messy_to_clean_redirect
Use messy to clean redirect. Use this only if you use clean urls. With messy urls this only causes eternal loop.
Example:messy_to_clean_redirect="1"Default:""
Currently it redirects two core GETs, s and id. But not c as Els already explained why :)
Last edited by Gocom (2008-12-28 02:13:41)
Offline
Pages: 1




