mailing list archives x-out non-ascii characters

Bug #647232 reported by Ddorda
78
This bug affects 14 people
Affects Status Importance Assigned to Milestone
Launchpad itself
Fix Released
Low
Curtis Hovey

Bug Description

The launchpad mailing list archive does not support non-latin languages!
for ex. you may check the ubuntu-il LoCo mailing list: https://lists.launchpad.net/ubuntu-il/

Revision history for this message
Martin Pool (mbp) wrote :

Strangely enough some of the details pages do work, eg <https://lists.launchpad.net/ubuntu-il/msg00033.html> but not the others.

As a workaround you could register the list on gmane.org and use their archive.

summary: - Mailing list archive does not support non-latin languages
+ list archive summary page omits not-latin-1 characters
summary: - list archive summary page omits not-latin-1 characters
+ mailing list archives x-out non-ascii characters
Revision history for this message
Barry Warsaw (barry) wrote :

All public Launchpad mailing lists are already archived at mail-archive.com, so you might check there. We've talked about automatically registering lists with gmane but haven't done that yet

Curtis Hovey (sinzui)
affects: launchpad → launchpad-registry
Changed in launchpad-registry:
importance: Undecided → Low
status: New → Triaged
tags: added: ml-archive-sucks
Revision history for this message
arjuna rao chavala (arjunaraoc) wrote :

Refer #2
mail-archive is still not perfect for utf-8 mails, as the subject line is garbled by breaking conjucts, adding spaces, not depicting conjuct signs. http://<email address hidden>/msg00003.html
If this is not fixed, it may be preferrable for people to migrate to google mailing lists.

Revision history for this message
arjuna rao chavala (arjunaraoc) wrote :

Sorry, missed to add In the hyperlink given previous comment, the subject line is repeated in the body. please note the difference.

Revision history for this message
Barry Warsaw (barry) wrote :

@#3: Why not help make open source better, rather than move to a closed source system like google groups?

Revision history for this message
Curtis Hovey (sinzui) wrote :

I think the bug tag somes up monharc very well. It sucks. I do not think we can fix any of these bugs until we switch to an archiver that we can integrate with launchpad. Users expect the archive message to look and behave like comments in Launchpad.

Revision history for this message
Barry Warsaw (barry) wrote :

The sad truth is that the art of open source archivers has pretty much stagnated. MHonArc is implemented in Perl and I for one will never touch it. If I had the resources and cycles, I would work to improve the most heinous bits of Pipermail, in rough order of importance:

* persistent urls
* better database backend
* skinning
* standalone performance

Revision history for this message
Stanislav Hanzhin (hanzhin-stas) wrote :

https://savannah.nongnu.org/bugs/index.php?32534 - this bug was created today by me in MHonArc bug tracker. Please, provide further assistance to developers for solvind this problem.

Revision history for this message
Stanislav Hanzhin (hanzhin-stas) wrote :

MHonArc maintainer Earl Hood asks to provide him raw data of mail example and MHonArc configuration used on launchpad to provide us some help. See discussion at https://savannah.nongnu.org/bugs/index.php?32534 If you will try to solve, I' agreed to provide https://lists.launchpad.net/openerp-russian/msg00001.html message contents to MHonArc developers.

Revision history for this message
Stanislav Hanzhin (hanzhin-stas) wrote :

https://savannah.nongnu.org/bugs/index.php?26577 - here is a suggestion for "what can it be, this bug"

tags: removed: ml-archive-sucks
Curtis Hovey (sinzui)
tags: added: ml-archive-sucks
Revision history for this message
Curtis Hovey (sinzui) wrote :

The tag is need to find all the related bugs.

This is template used to generate the the mrc: http://bazaar.launchpad.net/~launchpad-pqm/launchpad/devel/view/head:/lib/lp/services/mailman/monkeypatches/lp-mhonarc-common.mrc

Revision history for this message
Curtis Hovey (sinzui) wrote :

I have tested a fix for this on the ubuntu-il archive. I think the re-encoding directive will solve 90%+ of the character display issues. Encodings that do not match the charset/encoding in the email will fail, but this case is certainly a sender issue, not an archive issue.

Changed in launchpad:
assignee: nobody → Curtis Hovey (sinzui)
status: Triaged → In Progress
Curtis Hovey (sinzui)
Changed in launchpad:
milestone: none → 11.04
Revision history for this message
serfus (serfus) wrote :

Those are really great news!
I hope this will solve our problem for good.
thank you Curtis and everybody who worked on this.

Revision history for this message
Curtis Hovey (sinzui) wrote :

QA is blocked until we get lists.staging.launchpad.net fixed.

Curtis Hovey (sinzui)
Changed in launchpad:
status: In Progress → Fix Released
Revision history for this message
arjuna rao chavala (arjunaraoc) wrote :

Thanks everyone. It works fine for Telugu. I have moved the links to ubuntu mailing list archives from Telugu-l10n translation page.

Revision history for this message
Stanislav Hanzhin (hanzhin-stas) wrote :
Revision history for this message
William Grant (wgrant) wrote : Re: [Bug 647232] Re: mailing list archives x-out non-ascii characters

On 05/04/11 15:37, Stanislav Hanzhin wrote:
> Not working for russian. See http://lists.launchpad.net/openerp-russian/
> and http://lists.launchapad.net/lp-l10n-ru

We haven't regenerated all of the old archives yet. Could you confirm if
new emails are shown OK? We'll hopefully be able to regenerate the
archives within a week or so.

Revision history for this message
Stanislav Hanzhin (hanzhin-stas) wrote :

William, I confirm that for new mails in lists everything is OK.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Related questions

Remote bug watches

Bug watches keep track of this bug in other bug trackers.