04-27-2014, 12:56 PM | #16 | |
Grand Sorcerer
Posts: 27,598
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Quote:
I still recommend small, complex chunks be smartened and reviewed--individually--in order to understand exactly what to expect from calibre's smartening routines before tackling whole books. Last edited by DiapDealer; 04-27-2014 at 01:03 PM. |
|
04-27-2014, 01:01 PM | #17 | |
Ex-Helpdesk Junkie
Posts: 19,421
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Quote:
At least the difference skips the step of wading through and tracking down all the punctuation before checking if it should be changed. |
|
Advert | |
|
04-27-2014, 01:15 PM | #18 | |
Grand Sorcerer
Posts: 27,598
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Quote:
Roger wants to learn what can be expected from the smartening algorithm. I believe studying small test cases (that he crafts specifically for this purpose) is going to help him achieve that goal more easily (and systematically) than eyeballing entire books' worth of random punctuation differences will. But you're free to disagree. Last edited by DiapDealer; 04-27-2014 at 01:23 PM. |
|
04-27-2014, 02:23 PM | #19 |
Color me gone
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
|
It depends on whether a native speaker wrote the algorithm, I think.
|
04-27-2014, 03:56 PM | #20 | |
Wizard
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
|
Quote:
- straight apostrophes were changed to curly ones - three points to horizontal ellipsis - emdash changed to endash The last one is not really convenient for my use. For dialogues we can use the former or the latter, but I prefer using emdash. And that's it. So, it appears either to be a very basic tool or a very poor book. I will now follow DiapDealer advice to get forcefully more information. Last edited by roger64; 04-27-2014 at 03:58 PM. |
|
Advert | |
|
04-28-2014, 07:12 AM | #21 |
Grand Sorcerer
Posts: 27,598
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
That last one doesn't sound right. The algorithm should leave existing emdash (or endash) characters alone. It should only alter double (and triple) regular dash/hyphen instances. Under no circumstances have I witnessed the smartening algorithm change existing emdash characters (u2014) to endash characters (u2013).
Is there some confusion about what constitutes an em|en dash? |
04-28-2014, 07:58 AM | #22 | |
US Navy, Retired
Posts: 9,863
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Nexus 7
|
Quote:
|
|
04-28-2014, 09:49 AM | #23 |
Wizard
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
|
Hi
No confusion. One of the good things wiith the calibre editor is that we can read the name of the sign on the lower right. And we can see the difference too. All my emdashes have consistently been replaced by endashes. However, all my tirets de dialogue (this is when I only use an emdash), according to French "arcane" rules, are followed by a no-break space (here an utf-8 sign): maybe this induced SP to take this unusual behaviour? I'll try to replicate with a shorter text and I'll post a test EPUB. — 2014 – 2013 Edit: I can't reproduce. I try again... Last edited by roger64; 04-28-2014 at 10:25 AM. |
04-28-2014, 10:35 AM | #24 |
Wizard
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
|
Apologies for my mistake.
Yesterday I really got ENdashes (there are still here today) but as the corrected book was not mine, the only explanation I can find today is that it had previously EN-dashes (which is slightly unusual). Last edited by roger64; 04-28-2014 at 10:38 AM. |
04-29-2014, 05:56 AM | #25 |
Village idiot
Posts: 157
Karma: 519566
Join Date: Mar 2014
Location: Belgium
Device: sony PRS T-1
|
I tried it on a Dutch book, and every ' becomes a ‘.
This is fine for the conversations, but stuff like 't becomes ‘t. So smarten is quite useless for me. |
04-29-2014, 07:05 AM | #26 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
@Jlius, sent me a PM. I might have something to help.
Sent from my Nexus 5 using Forum Fiend v1.2.5. |
04-29-2014, 08:30 AM | #27 |
Grand Sorcerer
Posts: 27,598
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
|
04-30-2014, 10:06 AM | #28 |
Wizard
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
|
A little trial, to make myself forgiven. Don't pay any attention to the meaning. There is none. The joint EPUB is uncorrected. You are supposed to smarten its pants.
This is what I got from this trial: - modified nnbsp &#_8239; entity was converted to its utf-8 counterpart (invisible \_u202F) three points to ellipsis as said before --- and -- were both converted to EN dash which is a little surprising - unmodified EM and EN dashes were maintained as such & ditto the trailing spaces (after AR and ligne) were left untouched the forward space (before Centrage) ditto the PARAGRAH SEPARATOR (before nombre) was left untouched the untouched section sign stopped the Compare algorithm (unrelated to SP) Last edited by roger64; 04-30-2014 at 10:13 AM. |
04-30-2014, 02:21 PM | #29 | |||
Grand Sorcerer
Posts: 27,598
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Quote:
Quote:
Quote:
Last edited by DiapDealer; 04-30-2014 at 02:41 PM. |
|||
04-30-2014, 03:55 PM | #30 |
Wizard
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
|
Well, I'll really need to buy magnifying glasses.
What about the "section" sign? |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Punctuation - who knows where? | gmw | Writers' Corner | 13 | 08-03-2013 01:16 AM |
Strange punctuation | gafitz | Conversion | 8 | 01-15-2012 07:07 PM |
Punctuation problems | Halk | Calibre | 0 | 10-13-2011 09:02 PM |
Punctuation | Dresden | Calibre | 7 | 08-31-2010 05:14 AM |
Punctuation | jgray | Workshop | 10 | 04-14-2010 07:38 AM |