Go to main content

Textpattern CMS support forum

You are not logged in. Register | Login | Help

#1 2007-07-27 15:41:03

Michael
Member
From: Vienna/Austria
Registered: 2004-03-25
Posts: 147
Website

WP-Import: Article-bodies stop where umlaut should appear

What I did

  1. I imported TXP-articles into Wordpress (to give it a try), using the textile-plugin there to display those articles correctly. (See here)
  2. I installed TXP to go back to this CMS and imported all entries from the WP-Database (TXP database no longer available).

The Problem

Now I see all my entries imported into TXP (even comments and so on), but each article stops whenever there is an umlaut.

If the original sentence is “Ich wundere mich darüber sehr”, the imported sentence is “Ich wundere mich dar” (and the article-body stops here) – the import script stopped before the umlaut… Compare, for example, these two articles:

The Question

How can I fix this problem?

Thanks a lot for any help!

Michael

Last edited by Michael (2007-07-27 18:23:41)

Offline

#2 2007-07-28 10:20:30

Michael
Member
From: Vienna/Austria
Registered: 2004-03-25
Posts: 147
Website

Re: WP-Import: Article-bodies stop where umlaut should appear

Anyone?

Offline

#3 2007-07-28 10:44:43

Sencer
Archived Developer
From: cgn, de
Registered: 2004-03-23
Posts: 1,803
Website

Re: WP-Import: Article-bodies stop where umlaut should appear

What is the characterset of the wordpress tables? And what is the characterset of the textpattern tables? You can find out with a tool like phpmyadmin, it should be something like “latin” or “utf8” – be careful to check in the right place (the columns and tables) as mysql allows the cahracterset to be set for a dozen things in different places.

Offline

#4 2007-07-28 13:35:33

Michael
Member
From: Vienna/Austria
Registered: 2004-03-25
Posts: 147
Website

Re: WP-Import: Article-bodies stop where umlaut should appear

Both, TXP and WP are on the same server. Both use databases on the same server.

WP-Database: everything “utf8_general_ci” (except the Text-Link-Ads-Plugin-tabels, but I think they do not bother…)
TXP-Database: everything “utf8_general_ci”

Offline

#5 2007-07-28 15:30:14

Sencer
Archived Developer
From: cgn, de
Registered: 2004-03-23
Posts: 1,803
Website

Re: WP-Import: Article-bodies stop where umlaut should appear

Can you post or link to your high level diagnostics from the textpattern administration panel. Did you ever have any issues on your wordpress install with unicode or certain characters?

Offline

#6 2007-07-28 15:39:37

Michael
Member
From: Vienna/Austria
Registered: 2004-03-25
Posts: 147
Website

Re: WP-Import: Article-bodies stop where umlaut should appear

Here it is:

——————-

Textpattern-Version: 4.0.5 (r2466)
Letzte Aktualisierung: 2007-07-27 17:42:43/2007-07-26 22:38:18
Document Root: /BLA/domains/3th.be/html (/BLA/domains/3th.be/html)
$path_to_site: /BLA/domains/3th.be/html/neu
Textpattern-Pfad: /BLA/domains/3th.be/html/neu/textpattern
Schema der URLs: year_month_day_title
Temporäres Verzeichnis: /BLA/domains/3th.be/html/neu/textpattern/tmp
Seiten-URL: 3th.be/neu
PHP-Version: 4.4.7
GD-Graphikbibliothek: bundled (2.0.28 compatible); Unterstützte Grafikformate: GIF, JPG, PNG.
Lokale Serverzeit: 2007-07-28 08:37:02
MySQL: 4.1.11-Debian_4sarge7
Regionale Einstellungen: de_DE.UTF-8
Server: Apache/2.0.54
PHP Server API: cgi-fcgi
RFC-2616-Header:
Betriebssystem des Servers: Linux 2.6.20.2-1

Inhalt der .htaccess-Datei:
————————————
#DirectoryIndex index.php index.html

#Options +FollowSymLinks
#Options -Indexes

<IfModule mod_rewrite.c> RewriteEngine On #RewriteBase /relative/web/path/

RewriteCond %{REQUEST_FILENAME} -f [OR] RewriteCond %{REQUEST_FILENAME} -d RewriteRule ^(.+) – [PT,L]

RewriteRule ^(.*) index.php
</IfModule>

#php_value register_globals 0
————————————

Charset (default/config): latin1/utf8
character_set_client: utf8
character_set_connection: utf8
character_set_database: utf8
character_set_results: utf8
character_set_server: latin1
character_set_system: utf8
character_sets_dir: /usr/share/mysql/charsets/
17 Tables: -

PHP-Erweiterungen: zip, xslt, xmlrpc/0.51, xml, wddx, tokenizer/0.1, standard/4.4.7, sockets, session, pspell, posix, pgsql, overload, mysql, mime_magic/0.1, mhash, mcrypt, mcal, mbstring, ldap, imap, iconv, gettext, gd, ftp, filepro, exif/1.4 $Id: exif.c,v 1.118.2.37.2.7 2007/01/09 11:38:04 tony2001 Exp $, domxml/20020815, dbx, dba, curl, ctype, crack, calendar, bz2, bcmath, zlib/1.1, pcre, openssl, Zend Optimizer

pretext_data: array ( ‘id’ => ‘’, ‘s’ => ‘’, ‘c’ => ‘’, ‘q’ => ‘’, ‘pg’ => ‘’, ‘p’ => ‘’, ‘month’ => ‘’, ‘author’ => ‘’, ‘request_uri’ => ‘/neu/5fcca1b503f194ba985b95d8d30bab10/?txpcleantest=1’, ‘qs’ => ‘txpcleantest=1’, ‘subpath’ => ‘\\/neu\\/’, ‘req’ => ‘/5fcca1b503f194ba985b95d8d30bab10/?txpcleantest=1’,
)

/include/txp_category.php: r2243 (3706fea923cd77f7053f7803de169df4)
/include/txp_plugin.php: r1917 (c63f72f33986c08367672fc9fe7b42dd)
/include/txp_auth.php: r2356 (33255ec1ea1a825163c78272496d8783)
/include/txp_form.php: r1913 (ecea3fecf9d7d1f8088cda67f097eceb)
/include/txp_section.php: r1891 (1f0121b3e2969d94bc8a7fb98bfdfbd5)
/include/txp_tag.php: r2260 (1bd67bdb9dcfb72e34ea967e39406216)
/include/txp_list.php: r2450 (997a3b1bec7115bf49b76f62b28da146)
/include/txp_page.php: r2099 (56bde34b6c7bcb9123ac91e73065e894)
/include/txp_discuss.php: r2451 (91e0b29ef39a9471ae5c78d0b1bba086)
/include/txp_prefs.php: r2405 (a4b76476930b2376199f23fbfd5f1ac9)
/include/txp_log.php: r2439 (16730c34e2a437dd88b8f5cc7eff8218)
/include/txp_preview.php: r1238 (696728f35f3557b648c011bb4d6496c3)
/include/txp_image.php: r2439 (9fac6ed0d9d4c3d8196492051f38dc9a)
/include/txp_article.php: r2453 (bdac8fcac5df2f93f10afa7e50c3fb6f)
/include/txp_css.php: r2403 (4e8c52bb1cf5bfe2e2f0640892f9b92e)
/include/txp_admin.php: r2403 (f8700a3d453ece08e7f137b47c967eda)
/include/txp_link.php: r2463 (0a0171bf606296106332d3fdcb83a678)
/include/txp_diag.php: r2361 (dccf3269049dd25e59afdd7ad8d235cd)
/include/txp_file.php: r2403 (e62abd5fcadabe629322ed17135d89eb)
/include/txp_import.php: r1238 (70a6207c0f3604ecfc4b20369986c4d7)
/lib/admin_config.php: r1747 (a2eb09f94d7902a6e95750fc4abcea17)
/lib/txplib_misc.php: r2464 (615afd44a10311f1c0b7852d9bc15d24)
/lib/taglib.php: r1535 (9b519f9dc88791e5ee8eacc029dd6975)
/lib/txplib_head.php: r2404 (2e067b25997cf67cddbdd365570e69d5)
/lib/classTextile.php: r2462 (a031e2ea894e339711c601f230c5ee71)
/lib/txplib_html.php: r2403 (97e173da3058b438513df67fd7d1ceca)
/lib/txplib_db.php: r2406 (5ed67642f805639b54e381fb22efd208)
/lib/IXRClass.php: r765 (137b91497628f0058a2fca9eba5c3b7f)
/lib/txplib_forms.php: r2403 (438a734b52acef40b36d8a3ba23987e8)
/lib/class.thumb.php: r2329 (b2a2fda54371dbd6c40ba553941f090e)
/lib/constants.php: r2361 (ab6d51668fab1e3c98e7d520b1a59f0f)
/lib/txplib_update.php: r1239 (10f28a986d23187b436369dc29ab552f)
/lib/txplib_wrapper.php: r2286 (419125ec74a17a70bf1e86ebfcd45253)
/publish/taghandlers.php: r2444 (cc9de8f2018b01398a2ba542c5f5bdc6)
/publish/atom.php: r2402 (46c4402717f695fde0d49d806adfa4c4)
/publish/log.php: r1637 (5254d0f3942086bc55723923307a51db)
/publish/comment.php: r2460 (2d1ae1dec0784f044e7005fa5ed50930)
/publish/search.php: r1748 (8c86ebcb5be08e214d81ca15a32164ca)
/publish/rss.php: r2393 (09aac29bf22ffa71c1e118e851cff3c3)
/publish.php: r2436 (7087864f1e7c6efe096d3b8e07c350b1)
/index.php: r2466 (30ecf35de5c1edc6ef68e780c8c79daa)
/css.php: r944 (8beba8f83a091068723435cdcdc02f2f)

Last edited by Michael (2007-07-28 15:40:16)

Offline

#7 2007-07-29 07:25:40

Sencer
Archived Developer
From: cgn, de
Registered: 2004-03-23
Posts: 1,803
Website

Re: WP-Import: Article-bodies stop where umlaut should appear

All of that indicates that things should work out ok. I am a bit surprised by that. Maybe one of the more knowledgable people on wordpress have an idea what could trigger it.
If there isn’t, I could offer to look at the dump and make sure it’s not a problem with that, if you are ok with sending me a dump odf the wp-db.

Offline

#8 2007-07-29 09:51:25

Michael
Member
From: Vienna/Austria
Registered: 2004-03-25
Posts: 147
Website

Re: WP-Import: Article-bodies stop where umlaut should appear

No problem, I can do that. Just tell me two things, please:

  • Which options do I have to use to export the dump and
  • Where should I send it to?

Last edited by Michael (2007-07-29 12:32:47)

Offline

#9 2007-09-09 10:53:46

Rei
Member
From: MY / SG (GMT +08)
Registered: 2007-08-31
Posts: 14

Re: WP-Import: Article-bodies stop where umlaut should appear

“each article stops whenever there is an umlaut” – yes, i had the same problems too.
Finally i have to use the wp-export-mt.php script to export the database in MovableType format (very important: save the text file in utf-8), and the “import from MovableType file” went well. Refer my other related reply .

Offline

Board footer

Powered by FluxBB