Textpattern CMS support forum
You are not logged in. Register | Login | Help
- Topics: Active | Unanswered
#1 2004-06-28 06:13:15
- zem
- Developer Emeritus
- From: Melbourne, Australia
- Registered: 2004-04-08
- Posts: 2,579
[plugin] [ORPHAN] zem_redirect
Hi,
Textpattern articles are currently reachable via several different URL paths. For example:
http://example.com/article/5/
http://example.com/article/5/article-title
http://example.com/article/5/old-article-title-before-i-renamed-it-to-something-shorter
External factors can multiply these (e.g. www.example.com vs. example.com).
This can cause problems with search engines and link aggregators, since they don’t have any way of knowing that all of these URLs represent the same page (as opposed to multiple pages with identical content).
txp:zem_redirect is designed to help reduce the problem a bit. It checks the URL used to fetch each article; if it doesn’t match the correct URL for the article (including the article-title and full hostname as specified in the Textpattern config), it’ll return a 301 Moved Permanently response to direct the user’s browser to the correct URL. It also works as a 404 handler for articles that don’t exist or are not visible.
Just plonk the tag <txp:zem_redirect /> at the top of your article page, and it’ll do the rest. It must be the very first tag on the page, before the doctype. It has no effect in messy URL mode, or on pages other than individual articles.
warning – if you’re already using URL rewriting or other clever redirection, there’s a chance this could cause circular redirects. Test carefully, and don’t say I didn’t warn ye.
http://vigilant.tv/documents/tp/zem_redirect-0.5.txt
z.
As this plugin is no longer supported we provide it here
a:10:{s:4:"name";s:12:"zem_redirect";s:6:"author";s:11:"Alex Shiels";s:10:"author_uri";s:26:"http://thresholdstate.com/";s:7:"version";s:5:"1.2.1";s:11:"description";s:59:"Redirect Pro: handle incorrect, outdated and duplicate URLs";s:4:"help";s:297:"<p>This plug-in's help documents are available to paying subscribers only. <strong>New subscriptions are currently unavailable.</strong> Current subscribers can access the help documents by visiting: <a href="http://thresholdstate.com/subscribers/ ">http://thresholdstate.com/subscribers/ </a></p>";s:4:"code";s:5670:"/*

Copyright 2005-2007 Alex Shiels http://thresholdstate.com/

This program is free software; you can redistribute it and/or
modify it under the terms of the GNU General Public License
version 2 as published by the Free Software Foundation.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

*/

function zem_redirect_fix_url($url, $debug=0) {
	$parts = parse_url($url);
	extract($parts);
	$q = preg_replace('@&amp;@', '&', @$query);
	$q = $q ? '?'.$q : '';

	$host = empty($scheme) ? serverSet('HTTP_HOST') : $host;
	$scheme = empty($scheme) ? 'http' : $scheme;
	$port = empty($port) ? '' : ':'.$port;

	return $scheme.'://'.$host.$port.'/'.ltrim($path, '/').$q;
}

function zem_redirect_send_headers($location, $type='', $debug=0) {
	global $prefs, $pretext;
	extract($prefs);

	if ($production_status == 'debug' or $debug) {
		echo '<div style="color:#000;background-color:#fee;">'
			.'zem_redirect: from <a href="'.$pretext['request_uri'].'">'.$pretext['request_uri'].'</a> to <a href="'.$location.'">'.$location.'</a></div>'.n;
		return $location;
	}
	else {
		if ($production_status == 'live') {
			if (!$type)
				$type = '301 Moved Permanently';
		}
		else
			$type = '302 Found';

		txp_status_header($type);
		header("Location: $location");
		ob_flush();
		exit;
	}

}


	function zem_redirect_url($test=0) {
		global $pretext, $siteurl, $path_from_root, $prefs;

		extract($prefs);

 		if (empty($permlink_mode)
 			or $permlink_mode == 'messy'
			or strcasecmp(serverSet('REQUEST_METHOD'), 'POST') == 0
			or gps('txpreview'))
 			return;


		$actual_host = serverSet("HTTP_HOST");
		$actual_path = ($test !== 0 ? $test : $pretext['request_uri']);
		$actual_url = 'http://'.$actual_host.$actual_path;
		$canonical_url = '';

		# other unknown query parameters might be used by plugins
		$get = gpsa(array_keys($_GET));
		if (!$get) $get = array();
		unset($get['id']);
		unset($get['s']);
		unset($get['c']);

		// remove bogus GET paramaters inserted by rss_unlimited_categories
		global $plugins;
		if (@in_array('rss_unlimited_categories', $plugins)) {
			unset($get['request_uri']);
			unset($get['qs']);
		}

		if (gps('atom') or gps('rss')) {
			$canonical_url = pagelinkurl($get);
		}
		elseif ($pretext['status'] == 404) {
			// url with subdir stripped
			$url = @$pretext['req'];

			if (preg_match('@/(\d{4})/(\d{2})/(\d{2})/([^/?]*)/?@', $url, $m)) {
				$when = $m[1].'-'.$m[2].'-'.$m[3];
				$rs = lookupByDateTitle($when,$m[4]);
				if (!$rs)
					$rs = lookupByTitle($m[4]);
			}
			elseif (preg_match('@/([^/]+)/(\d+)/([^/?]+)/?@', $url, $m)) {
				$rs = lookupByID($m[2]);
			}
			elseif (preg_match('@/([^/?]+)/([^/?]+)/?@', $url, $m)) {
				if (is_numeric($m[2]))
					$rs = lookupByID($m[2]);
				else {
					$rs = lookupByTitleSection($m[2], $m[1]);
					if (!$rs)
						$rs = lookupByTitle($m[2]);
				}
			}
			elseif (preg_match('@/([^/?]+)/?@', $url, $m)) {
				$rs = lookupByTitle($m[1]);
			}

			if ($rs) {
				$canonical_url = permlinkurl_id($rs['ID']) . join_qs($get);
			}
			else {
				return;
			}
		}
		elseif ($pretext['s'] == 'file_download') {
			return;
		}
		elseif ($pretext['month'] and $permlink_mode == 'year_month_day_title') {
			return;
		}
		elseif ($pretext["id"]) {
			# Article page
			$parts = parse_url($pretext['req']);
			if (isset($parts['path'])) {
				$pathinfo = pathinfo($parts['path']);
				if (!empty($pathinfo['extension']) and $pathinfo['basename'] != 'index.php') {
					return;
				}
			}
			$id = safe_field("ID","textpattern","ID='".doSlash($pretext["id"])."' and Status IN ('4', '5') limit 1");
			if ($id) {
				$canonical_url = permlinkurl_id($id);
			}

			# reattach messy URL parameters, if any
			$canonical_url .= join_qs($get);
		}
		else {
			# List page
			$canonical_url = pagelinkurl($get, array('s'=>$pretext['s'], 'c'=>$pretext['c']));
		}

		if ($canonical_url) {

			# fix up ampersands
			$canonical_url = zem_redirect_fix_url($canonical_url);
			if ($actual_url != $canonical_url) {
				return zem_redirect_send_headers($canonical_url, '301 Moved Permanently', $test !== 0);
			}
		}

	}


function zem_redirect_handler($event, $step) {
	if ($event == 'textpattern')
		zem_redirect_url();
}

register_callback('zem_redirect_handler', 'textpattern');

if (txpinterface == 'public' and (gps('atom') or gps('rss'))) {
	zem_redirect_url();
}


function zem_redirect($atts) {
	global $pretext;

	extract(lAtts(array(
		'to' => '',
		'from' => '',
		'debug' => 0,
		'type' => '302',
	), $atts));

	$to = trim($to);
	$from = trim($from);


	$dest = '';
	if ($from and $to) {
		$from = addcslashes($from, '@');
		$to = addcslashes($to, '@');
		if (preg_match('@'.$from.'@', $pretext['request_uri'])) {
			$out = preg_replace('@'.$from.'@', $to, $pretext['request_uri']);
			$dest = zem_redirect_fix_url($out, $debug);
		}
	}
	elseif ($to) {
		$dest = zem_redirect_fix_url($to, $debug);
	}
	else {
		trigger_error('No destination specified');
	}

	$actual_host = serverSet("HTTP_HOST");
	$actual_path = ($test !== 0 ? $test : $pretext['request_uri']);
	$actual_url = 'http://'.$actual_host.$actual_path;

	if ($dest and $dest != $actual_url) {
		$status = ($type == '301' ? '301 Moved Permanently' : '302 Found');
		return zem_redirect_send_headers($dest, $status, $debug);
	}

}
";s:4:"type";s:1:"0";s:5:"order";s:1:"5";s:3:"md5";s:32:"d3e364be665cd83c521eaf87b247432f";}
Last edited by zem (2004-07-10 02:53:25)
Alex
Offline
Re: [plugin] [ORPHAN] zem_redirect
Hi zem,
thank you very much. That sounds like a great plugin. :)
I’ll definitely give it a try.
Offline
Re: [plugin] [ORPHAN] zem_redirect
This is a great idea for a plugin. I can’t wait to try it out as well.
Offline
#4 2004-06-28 21:12:54
- DougBTX
- Archived Plugin Author
- Registered: 2004-05-23
- Posts: 22
Re: [plugin] [ORPHAN] zem_redirect
belle
Offline
Re: [plugin] [ORPHAN] zem_redirect
works great! no more ‘redirect 301’! thanks, zem!
Offline
#6 2004-07-04 01:14:50
- zem
- Developer Emeritus
- From: Melbourne, Australia
- Registered: 2004-04-08
- Posts: 2,579
Re: [plugin] [ORPHAN] zem_redirect
There’s a new version up:
http://vigilant.tv/documents/tp/zem_redirect-0.2.txt
Changes in this version:
- Now supports the article URL title feature
- Works with section and default pages (adds a trailing slash if neccesary, etc)
Last edited by zem (2004-07-04 01:15:05)
Alex
Offline
#7 2004-07-04 01:41:40
- zem
- Developer Emeritus
- From: Melbourne, Australia
- Registered: 2004-04-08
- Posts: 2,579
Re: [plugin] [ORPHAN] zem_redirect
And again:
http://vigilant.tv/documents/tp/zem_redirect-0.3.txt
Changes:
- Returns a 404 response for non-existant or unpublished articles
- Calls exit() immediately after redirect or 404, to stop further output
Alex
Offline
Re: [plugin] [ORPHAN] zem_redirect
I can’t seem to get it to work. The thing keeps on redirecting itself whenever i try to access an individual article. Any ideas?
Offline
Re: [plugin] [ORPHAN] zem_redirect
Excellent plugin, zem. You might want consider letting people specify an article number (through a plugin parameter, preferably) containing an error message and redirect to it in case of a 404.
Also, you should check for ‘query’ index in a way that doesn’t generate the ‘Undefined indexes’ notice, e. g.
if (!empty($parts["query"]))
Who’s gonna textdrive you home tonight?
Offline
#10 2004-07-05 02:11:49
- zem
- Developer Emeritus
- From: Melbourne, Australia
- Registered: 2004-04-08
- Posts: 2,579
Re: [plugin] [ORPHAN] zem_redirect
ThrokFeroth, do you have any other URL redirection or rewriting configured? Is your “web domain” setting correct?
Mamash – thanks for the suggestions. I’ll put together a new version that supports a few attributes to control its behaviour.
Alex
Offline
Re: [plugin] [ORPHAN] zem_redirect
Firstly, I use my default template for my Thoughts section. However if I add the redirect, the thing will redirect me to http://relativemind.net/default which is not a section at all, but merely the page template name.
Next, if I create a template called Thoughts and add the redirect, the thing exceeds redirection limits everytime I try to access the individual article.
Well, I do have a redirect that redirects all who access http://www.relativemind.net to http://relativemind.net. Is this the thing which causes it not to work?
Offline
Re: [plugin] [ORPHAN] zem_redirect
i had the same problem with it constantly appending /default/ to the URL. but i’m not really worried about catching 404s anyway, so i won’t worry about it right now.
Listen to Kenneth
Offline