Textpattern CMS support forum
You are not logged in. Register | Login | Help
- Topics: Active | Unanswered
WP-Import: Article-bodies stop where umlaut should appear
What I did
- I imported TXP-articles into Wordpress (to give it a try), using the textile-plugin there to display those articles correctly. (See here)
- I installed TXP to go back to this CMS and imported all entries from the WP-Database (TXP database no longer available).
The Problem
Now I see all my entries imported into TXP (even comments and so on), but each article stops whenever there is an umlaut.
If the original sentence is “Ich wundere mich darüber sehr”, the imported sentence is “Ich wundere mich dar” (and the article-body stops here) – the import script stopped before the umlaut… Compare, for example, these two articles:
- Arrivederci on Wordpress (as it should be)
- Arrivederci on Textpattern (cut off before the umlaut)
The Question
How can I fix this problem?
Thanks a lot for any help!
Michael
Last edited by Michael (2007-07-27 18:23:41)
Offline
Offline
Re: WP-Import: Article-bodies stop where umlaut should appear
What is the characterset of the wordpress tables? And what is the characterset of the textpattern tables? You can find out with a tool like phpmyadmin, it should be something like “latin” or “utf8” – be careful to check in the right place (the columns and tables) as mysql allows the cahracterset to be set for a dozen things in different places.
Offline
Re: WP-Import: Article-bodies stop where umlaut should appear
Both, TXP and WP are on the same server. Both use databases on the same server.
WP-Database: everything “utf8_general_ci” (except the Text-Link-Ads-Plugin-tabels, but I think they do not bother…)
TXP-Database: everything “utf8_general_ci”
Offline
Re: WP-Import: Article-bodies stop where umlaut should appear
Can you post or link to your high level diagnostics from the textpattern administration panel. Did you ever have any issues on your wordpress install with unicode or certain characters?
Offline
Re: WP-Import: Article-bodies stop where umlaut should appear
Here it is:
——————-
Textpattern-Version: 4.0.5 (r2466)
Letzte Aktualisierung: 2007-07-27 17:42:43/2007-07-26 22:38:18
Document Root: /BLA/domains/3th.be/html (/BLA/domains/3th.be/html)
$path_to_site: /BLA/domains/3th.be/html/neu
Textpattern-Pfad: /BLA/domains/3th.be/html/neu/textpattern
Schema der URLs: year_month_day_title
Temporäres Verzeichnis: /BLA/domains/3th.be/html/neu/textpattern/tmp
Seiten-URL: 3th.be/neu
PHP-Version: 4.4.7
GD-Graphikbibliothek: bundled (2.0.28 compatible); Unterstützte Grafikformate: GIF, JPG, PNG.
Lokale Serverzeit: 2007-07-28 08:37:02
MySQL: 4.1.11-Debian_4sarge7
Regionale Einstellungen: de_DE.UTF-8
Server: Apache/2.0.54
PHP Server API: cgi-fcgi
RFC-2616-Header:
Betriebssystem des Servers: Linux 2.6.20.2-1
Inhalt der .htaccess-Datei:
————————————
#DirectoryIndex index.php index.html
#Options +FollowSymLinks
#Options -Indexes
<IfModule mod_rewrite.c> RewriteEngine On #RewriteBase /relative/web/path/
RewriteCond %{REQUEST_FILENAME} -f [OR] RewriteCond %{REQUEST_FILENAME} -d RewriteRule ^(.+) – [PT,L]
RewriteRule ^(.*) index.php
</IfModule>
#php_value register_globals 0
————————————
Charset (default/config): latin1/utf8
character_set_client: utf8
character_set_connection: utf8
character_set_database: utf8
character_set_results: utf8
character_set_server: latin1
character_set_system: utf8
character_sets_dir: /usr/share/mysql/charsets/
17 Tables: -
PHP-Erweiterungen: zip, xslt, xmlrpc/0.51, xml, wddx, tokenizer/0.1, standard/4.4.7, sockets, session, pspell, posix, pgsql, overload, mysql, mime_magic/0.1, mhash, mcrypt, mcal, mbstring, ldap, imap, iconv, gettext, gd, ftp, filepro, exif/1.4 $Id: exif.c,v 1.118.2.37.2.7 2007/01/09 11:38:04 tony2001 Exp $, domxml/20020815, dbx, dba, curl, ctype, crack, calendar, bz2, bcmath, zlib/1.1, pcre, openssl, Zend Optimizer
pretext_data: array (
‘id’ => ‘’,
‘s’ => ‘’,
‘c’ => ‘’,
‘q’ => ‘’,
‘pg’ => ‘’,
‘p’ => ‘’,
‘month’ => ‘’,
‘author’ => ‘’,
‘request_uri’ => ‘/neu/5fcca1b503f194ba985b95d8d30bab10/?txpcleantest=1’,
‘qs’ => ‘txpcleantest=1’,
‘subpath’ => ‘\\/neu\\/’,
‘req’ => ‘/5fcca1b503f194ba985b95d8d30bab10/?txpcleantest=1’,
)
/include/txp_category.php: r2243 (3706fea923cd77f7053f7803de169df4)
/include/txp_plugin.php: r1917 (c63f72f33986c08367672fc9fe7b42dd)
/include/txp_auth.php: r2356 (33255ec1ea1a825163c78272496d8783)
/include/txp_form.php: r1913 (ecea3fecf9d7d1f8088cda67f097eceb)
/include/txp_section.php: r1891 (1f0121b3e2969d94bc8a7fb98bfdfbd5)
/include/txp_tag.php: r2260 (1bd67bdb9dcfb72e34ea967e39406216)
/include/txp_list.php: r2450 (997a3b1bec7115bf49b76f62b28da146)
/include/txp_page.php: r2099 (56bde34b6c7bcb9123ac91e73065e894)
/include/txp_discuss.php: r2451 (91e0b29ef39a9471ae5c78d0b1bba086)
/include/txp_prefs.php: r2405 (a4b76476930b2376199f23fbfd5f1ac9)
/include/txp_log.php: r2439 (16730c34e2a437dd88b8f5cc7eff8218)
/include/txp_preview.php: r1238 (696728f35f3557b648c011bb4d6496c3)
/include/txp_image.php: r2439 (9fac6ed0d9d4c3d8196492051f38dc9a)
/include/txp_article.php: r2453 (bdac8fcac5df2f93f10afa7e50c3fb6f)
/include/txp_css.php: r2403 (4e8c52bb1cf5bfe2e2f0640892f9b92e)
/include/txp_admin.php: r2403 (f8700a3d453ece08e7f137b47c967eda)
/include/txp_link.php: r2463 (0a0171bf606296106332d3fdcb83a678)
/include/txp_diag.php: r2361 (dccf3269049dd25e59afdd7ad8d235cd)
/include/txp_file.php: r2403 (e62abd5fcadabe629322ed17135d89eb)
/include/txp_import.php: r1238 (70a6207c0f3604ecfc4b20369986c4d7)
/lib/admin_config.php: r1747 (a2eb09f94d7902a6e95750fc4abcea17)
/lib/txplib_misc.php: r2464 (615afd44a10311f1c0b7852d9bc15d24)
/lib/taglib.php: r1535 (9b519f9dc88791e5ee8eacc029dd6975)
/lib/txplib_head.php: r2404 (2e067b25997cf67cddbdd365570e69d5)
/lib/classTextile.php: r2462 (a031e2ea894e339711c601f230c5ee71)
/lib/txplib_html.php: r2403 (97e173da3058b438513df67fd7d1ceca)
/lib/txplib_db.php: r2406 (5ed67642f805639b54e381fb22efd208)
/lib/IXRClass.php: r765 (137b91497628f0058a2fca9eba5c3b7f)
/lib/txplib_forms.php: r2403 (438a734b52acef40b36d8a3ba23987e8)
/lib/class.thumb.php: r2329 (b2a2fda54371dbd6c40ba553941f090e)
/lib/constants.php: r2361 (ab6d51668fab1e3c98e7d520b1a59f0f)
/lib/txplib_update.php: r1239 (10f28a986d23187b436369dc29ab552f)
/lib/txplib_wrapper.php: r2286 (419125ec74a17a70bf1e86ebfcd45253)
/publish/taghandlers.php: r2444 (cc9de8f2018b01398a2ba542c5f5bdc6)
/publish/atom.php: r2402 (46c4402717f695fde0d49d806adfa4c4)
/publish/log.php: r1637 (5254d0f3942086bc55723923307a51db)
/publish/comment.php: r2460 (2d1ae1dec0784f044e7005fa5ed50930)
/publish/search.php: r1748 (8c86ebcb5be08e214d81ca15a32164ca)
/publish/rss.php: r2393 (09aac29bf22ffa71c1e118e851cff3c3)
/publish.php: r2436 (7087864f1e7c6efe096d3b8e07c350b1)
/index.php: r2466 (30ecf35de5c1edc6ef68e780c8c79daa)
/css.php: r944 (8beba8f83a091068723435cdcdc02f2f)
Last edited by Michael (2007-07-28 15:40:16)
Offline
Re: WP-Import: Article-bodies stop where umlaut should appear
All of that indicates that things should work out ok. I am a bit surprised by that. Maybe one of the more knowledgable people on wordpress have an idea what could trigger it.
If there isn’t, I could offer to look at the dump and make sure it’s not a problem with that, if you are ok with sending me a dump odf the wp-db.
Offline
Re: WP-Import: Article-bodies stop where umlaut should appear
No problem, I can do that. Just tell me two things, please:
- Which options do I have to use to export the dump and
- Where should I send it to?
Last edited by Michael (2007-07-29 12:32:47)
Offline
#9 2007-09-09 10:53:46
- Rei
- Member
- From: MY / SG (GMT +08)
- Registered: 2007-08-31
- Posts: 14
Re: WP-Import: Article-bodies stop where umlaut should appear
“each article stops whenever there is an umlaut” – yes, i had the same problems too.
Finally i have to use the wp-export-mt.php script to export the database in MovableType format (very important: save the text file in utf-8), and the “import from MovableType file” went well. Refer my other related reply .
Offline