Remove mbstring as a dependency #40

philsturgeon · 2014-12-26T20:49:17Z

Half done with #38, but 2 of the 3 uses are removed.

Found some weird hack for string length that involves regex. I'll take it.

Not sure what to do about mb_strtoupper() though.

https://github.com/thephpleague/commonmark/search?utf8=%E2%9C%93&q=mb_strtoupper&type=Code

GrahamCampbell · 2014-12-26T21:21:02Z

php 5.3 and hhvm aren't happy.

philsturgeon · 2014-12-26T21:23:05Z

I get emails about that. :)

Before 82421eb, the mb_decode_numericentity function wouldn't convert code points beyond plane 2 (per the flags). This change essentially adds the same behavior.

colinodell · 2014-12-26T22:55:12Z

Thanks for getting this started! I've resolved the current failures and am working on mb_strtoupper now - I'll let you know how far I get.

cebe · 2014-12-27T00:06:54Z

src/Util/TextHelper.php

@@ -36,7 +36,8 @@ public static function detabLine($string)

        foreach ($parts as $part) {
            // Calculate number of spaces; insert them followed by the non-tab contents
-            $amount = 4 - mb_strlen($line, 'UTF-8') % 4;
+            $lineLength = strlen(utf8_decode($line));


are you sure this works with non ascii characters? You should include some tests for this.

Yep it does. utf8_decode will turn the multi-byte characters into a single byte representation. It's okay if the conversion fails (and we get ? characters) since we just need a count here.

Per the spec, reference link matching is done by normalizing the label with a Unicode case fold, which mb_strtoupper provides. But since not all systems have the relevant extension installed, we need to manually implement this logic by converting characters according to this table: http://www.unicode.org/Public/UNIDATA/CaseFolding.txt

Remove mbstring as a dependency

Killed 2 out of 3 mbstring uses.

82421eb

colinodell added 2 commits December 26, 2014 17:20

Different approach to counting UTF-8 string length

b81ab7b

Only convert code points within planes 0-2

bbbd67c

Before 82421eb, the mb_decode_numericentity function wouldn't convert code points beyond plane 2 (per the flags). This change essentially adds the same behavior.

cebe reviewed Dec 27, 2014
View reviewed changes

colinodell added 4 commits December 26, 2014 19:28

Change mbstring requirement to a suggestion

c7331db

Update changelog

888cf76

Make scrutinizer happy

da667f2

colinodell added a commit that referenced this pull request Dec 27, 2014

Merge pull request #40 from thephpleague/rm-mbstring

ec8bcb9

Remove mbstring as a dependency

colinodell merged commit ec8bcb9 into master Dec 27, 2014

colinodell deleted the rm-mbstring branch December 27, 2014 00:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove mbstring as a dependency #40

Remove mbstring as a dependency #40

philsturgeon commented Dec 26, 2014

GrahamCampbell commented Dec 26, 2014

philsturgeon commented Dec 26, 2014

colinodell commented Dec 26, 2014

cebe Dec 27, 2014

colinodell Dec 27, 2014

Remove mbstring as a dependency #40

Remove mbstring as a dependency #40

Conversation

philsturgeon commented Dec 26, 2014

GrahamCampbell commented Dec 26, 2014

philsturgeon commented Dec 26, 2014

colinodell commented Dec 26, 2014

cebe Dec 27, 2014

Choose a reason for hiding this comment

colinodell Dec 27, 2014

Choose a reason for hiding this comment