aboutsummaryrefslogtreecommitdiffstats
path: root/enc
Commit message (Collapse)AuthorAgeFilesLines
* Remove Unicode 13.0.0 related filesMartin Dürst2022-03-162-50521/+0
|
* update Unicode Version to 14.0.0 and Emoji version to 14.0Martin Dürst2022-03-132-0/+52210
|
* Revert "enc/depend: fix out-of-src build with --with-static-linked-ext" (#5616)Nobuyoshi Nakada2022-03-021-7/+1
| | | | This reverts commit 32ad8df9d1e07e1b2435a8890d070802fcd2989f, which broke out-of-src build with the pre-generated transcoder sources.
* enc/depend: fix out-of-src build with --with-static-linked-extYuta Saito2022-03-021-1/+7
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When out-of-src build, at the beginning of a build, `make -f enc.mk srcs` generates trans C sources under build dir. On the other hand, enc/trans/*.o were built from trans C sources generated under srcdir due to the following auto-generated rules from enc/depend. ``` encsrcdir = ../src/enc ... enc/trans/big5.$(OBJEXT): $(encsrcdir)/trans/big5.c ``` Therefore, trans C sources are generated twice under srcdir and build dir during a build. Ideally, trans C sources have always been built before compilation of enc/trans/*.o because the source generation is prereq, so making enc/trans/*.o doesn't trigger trans C source generation and shouldn't require MINIRUBY as a make arg for enc.mk. However, the second trans C source gen is unintentionally triggered by enc/trans/*.o, so `make -f enc.mk libencs` requires MINIRUBY for now. When no `--with-static-linked-ext`, `make -f enc.mk libencs` is triggered from common.mk with MINIRUBY, so there is no problem. But when `--with-static-linked-ext`, libencs should be statically-linked to ruby, so `make -f enc.mk libencs` is triggered from exts.mk, and exts.mk invokes it without MINIRUBY. Therefore, when out-of-src build and with `--with-static-linked-ext`, the second trans C source gen fails due to missing MINIRUBY. This issue is deterministically reproducible without -j because common.mk's `main` rule also has libencs prerequisite. This patch supresses the second trans C source gen.
* [Feature #18249] Update dependenciesPeter Zhu2022-02-221-0/+4218
|
* [Feature #18249] Include ruby.h in extensions to have ABI versionPeter Zhu2022-02-222-0/+3
| | | | | All shared libraries must have `include/ruby/internal/abi.h` to include the ABI version. Including `ruby.h` will guarantee that.
* Update dependenciesNobuyoshi Nakada2021-11-211-67/+0
|
* ruby tool/update-deps --fix卜部昌平2021-10-051-850/+895
|
* Remove no longer needed include files (Unicode Version 12.1.0)Martin Dürst2021-07-092-49238/+0
|
* Add directory and include files for Unicode version 13.0.0Martin Dürst2021-07-082-0/+50521
| | | | | | - Add directory enc/unicode/13.0.0 - Add include files casefold.h and name2ctype.h for Unicode version 13.0.0
* Use $ignore_error defined in mkmf.rbNobuyoshi Nakada2021-07-031-9/+9
|
* dependency updates卜部昌平2021-04-131-65/+0
|
* transcode-tblgen.rb: make silent a little when just -vNobuyoshi Nakada2020-12-292-1/+2
|
* Add string encoding IBM720 alias CP720 (#3803)Lars Kanis2020-11-223-1/+126
| | | | | | The mapping table is generated from the ICU project: https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/ibm-720_P100-1997.ucm Fixes bug 16233 : https://bugs.ruby-lang.org/issues/16233
* sed -i '/rmodule.h/d'卜部昌平2020-08-271-27/+0
|
* sed -i '/r_cast.h/d'卜部昌平2020-08-271-27/+0
|
* sed -i '\,2/extern.h,d'卜部昌平2020-08-271-65/+0
|
* Use https instead of httpKazuhiro NISHIYAMA2020-07-2813-15/+15
|
* Encode ' as ' when using encode(xml: :attr)Jeremy Evans2020-07-101-1/+2
| | | | Fixes [Bug #16922]
* sed -i 's|ruby/impl|ruby/internal|'卜部昌平2020-05-111-5414/+5414
| | | | To fix build failures.
* sed -i s|ruby/3|ruby/impl|g卜部昌平2020-05-111-5414/+5414
| | | | This shall fix compile errors.
* Suppress warnings by gcc 10.1.0-RC-20200430Nobuyoshi Nakada2020-05-041-0/+4
| | | | | | | | | | | | | | | | | | | | | | * Folding results should not be empty. If `OnigCodePointCount(to->n)` were 0, `for` loop using `fn` wouldn't execute and `ncs` elements are not initialized. ``` enc/unicode.c:557:21: warning: 'ncs[0]' may be used uninitialized in this function [-Wmaybe-uninitialized] 557 | for (i = 0; i < ncs[0]; i++) { | ~~~^~~ ``` * Cast to `enum yytokentype` Additional enums for scanner events by ripper are not included in `yytokentype`. ``` ripper.y:7274:28: warning: implicit conversion from 'enum <anonymous>' to 'enum yytokentype' [-Wenum-conversion] ```
* Merge pull request #2991 from shyouhei/ruby.h卜部昌平2020-04-081-0/+6080
| | | Split ruby.h
* Suppress warnings: reserved for numbered parameterKazuki Tsujimoto2020-04-051-1/+1
|
* Added tooldir variableNobuyoshi Nakada2020-04-052-2/+3
|
* more on NULL versus functions.卜部昌平2020-02-071-5/+5
| | | | | | Function pointers are not void*. See also ce4ea956d24eab5089a143bba38126f2b11b55b6 8427fca49bd85205f5a8766292dd893f003c0e48
* update dependencies卜部昌平2019-12-261-0/+33
|
* decouple internal.h headers卜部昌平2019-12-261-1/+1
| | | | | | | | | | | | | | | | | | Saves comitters' daily life by avoid #include-ing everything from internal.h to make each file do so instead. This would significantly speed up incremental builds. We take the following inclusion order in this changeset: 1. "ruby/config.h", where _GNU_SOURCE is defined (must be the very first thing among everything). 2. RUBY_EXTCONF_H if any. 3. Standard C headers, sorted alphabetically. 4. Other system headers, maybe guarded by #ifdef 5. Everything else, sorted alphabetically. Exceptions are those win32-related headers, which tend not be self- containing (headers have inclusion order dependencies).
* enc/x_emoji.h: fixed dead-links [ci skip]Nobuyoshi Nakada2019-12-241-2/+2
| | | | English version pages seem no longer provided.
* Fixed misspellingsNobuyoshi Nakada2019-12-222-2/+2
| | | | | Fixed misspellings reported at [Bug #16437], missed and a new typo.
* Update dependenciesNobuyoshi Nakada2019-11-181-43/+0
|
* Init function is need to link staticallyNobuyoshi Nakada2019-08-101-0/+5
|
* Removed unnecessary headersNobuyoshi Nakada2019-08-101-32/+0
|
* Use ENC_REPLICATE to copy an encodingNobuyoshi Nakada2019-08-101-8/+1
|
* Revert "Removed unused includes"Yusuke Endoh2019-08-102-0/+34
| | | | | | | | | | | | | | | | | | This reverts commit c9eb8f82e9febeb634a23bec6aeea915eb25fe26. The change caused "implicit declaration" warning and actual segfault. ``` /tmp/ruby/v2/src/trunk-gc-asserts/enc/gb2312.c: In function ‘Init_gb2312’: /tmp/ruby/v2/src/trunk-gc-asserts/enc/gb2312.c:6:31: warning: implicit declaration of function ‘rb_enc_find’ [-Wimplicit-function-declaration] rb_enc_register("GB2312", rb_enc_find("EUC-KR")); ^~~~~~~~~~~ /tmp/ruby/v2/src/trunk-gc-asserts/enc/gb2312.c:6:31: warning: passing argument 2 of ‘rb_enc_register’ makes pointer from integer without a cast [-Wint-conversion] <command-line>:0:19: note: expected ‘OnigEncoding {aka const struct OnigEncodingTypeST *}’ but argument is of type ‘int’ /tmp/ruby/v2/src/trunk-gc-asserts/regenc.h:231:12: note: in expansion of macro ‘ONIG_ENC_REGISTER’ extern int ONIG_ENC_REGISTER(const char *, OnigEncoding); ^~~~~~~~~~~~~~~~~ ```
* Removed unused includesNobuyoshi Nakada2019-08-092-34/+0
|
* Include ruby/assert.h in ruby/ruby.h so that assertions can be thereNobuyoshi Nakada2019-07-141-0/+24
|
* Update dependencies for 369ff79394765ce198ac7cee872a8c739d895aaaTakashi Kokubun2019-07-141-0/+10
| | | | | Just copy-pasting diff from https://travis-ci.org/ruby/ruby/jobs/558407687
* add encoding conversion from/to CESU-8Martin Dürst2019-07-141-0/+85
| | | | | | | | | | | | | | | Add encoding conversion (transcoding) from UTF-8 to CESU-8 and back. CESU-8 is an encoding similar to UTF-8, but encodes codepoints above U+FFFF as two surrogates, these surrogates again being encoded as if they were UTF-8 codepoints. This preserves the same binary sorting order as in UTF-16. It is also somewhat similar (although not exactly identical) to an encoding used internally by Java. This completes issue #15995. enc/trans/cesu_8.trans: Add encoding conversion from/to CESU-8 test/ruby/test_transcode.rb: Add tests for above
* Update dependenciesNobuyoshi Nakada2019-07-091-0/+7
|
* remove UNREACHABLENARUSE, Yui2019-06-241-2/+0
|
* Add new encoding CESU-8 [Feature #15931]NARUSE, Yui2019-06-241-0/+456
|
* remove Unicode 12.0.0 related directory and generated filesduerst2019-04-052-48546/+0
| | | | | | This completes issue #15195. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67453 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* update to Unicode Version 12.1.0 (beta)duerst2019-04-052-0/+49238
| | | | | | | | | | | | | | | | | | | | | | Unicode Version 12.1.0 adds one single character, U+32FF SQUARE ERA NAME REIWA, for the new Japanese era starting on May 1st. 12.1.0 will be finalized only on May 7th, so we go with the beta version because further changes in the data we need are highly unlikely, and we want to make sure Ruby is ready for the new era. * common.mk: change UNICODE_VERSION to 12.1.0, UNICODE_BETA to YES * enc/unicode/12.1.0, enc/unicode/12.1.0/casefold.h, enc/unicode/12.1.0/name2ctype.h: add directory and generated data files for new version * lib/unicode_normalize/tables.rb: update for new character * test/ruby/test_regexp.rb: add test for character property age=12.1 * test/test_unicode_normalize.rb: add test for NFKC decomposition of new character This (mostly) completes issue #15195. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67441 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* delete directory and files related to Unicode version 11.0.0duerst2019-03-062-47428/+0
| | | | | | this completes and closes feature #15321 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67174 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* update Unicode version (and Emoji version) to 12.0.0duerst2019-03-062-0/+48546
| | | | | | | | | | | | | - common.mk: set UNICODE_VERSION and UNICODE_EMOJI_VERSION to 12.0.0 - lib/unicode_normalize/tables.rb: update table data to Unicode version 12.0.0 - enc/unicode/12.0.0/casefold.h, enc/unicode/12.0.0/name2ctype.h: add generated files for Unicode version 12.0.0 This is the main commit for #15321. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67169 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* Removed duplicate dependentsnobu2019-02-141-3/+0
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67072 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* Update dependencies, internal.h includes ruby.hnobu2019-02-121-0/+1
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67057 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* implement special behavior for Georgian for String#capitalizeduerst2018-12-091-1/+5
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The modern Georgian script is special in that it has an 'uppercase' variant called MTAVRULI which can be used for emphasis of whole words, for screamy headlines, and so on. However, in contrast to all other bicameral scripts, there is no usage of capitalizing the first letter in a word or a sentence. Words with mixed capitalization are not used at all. We therefore implement special behavior for String#capitalize. Formally, we define String#capitalize as first applying String#downcase for the whole string, then using titlecase on the first letter. Because Georgian defines titlecase as the identity function both for MTAVRULI ('uppercase') and Mkhedruli (lowercase), this results in String#capitalize being equivalent to String#downcase for Georgian. This avoids undesirable mixed case. * enc/unicode.c: Actual implementation * string.c: Add mention of this special case for documentation * test/ruby/enc/test_case_mapping.rb: Add two tests, a general one that uses String#capitalize on some (including nonsensical) combinations of MTAVRULI and Mkhedruli, and a canary test to detect the potential assignment of characters to the currently open slots (holes) at U+1CBB and U+1CBC. * test/ruby/enc/test_case_comprehensive.rb: Tweak generation of expectation data. Together with r65933, this closes issue #14839. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66300 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* delete Unicode 10.0.0 related files, no longer needed [#14802]duerst2018-12-092-45436/+0
| | | | | | | | | This line, and those below, will be ignored-- D enc/unicode/10.0.0 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66295 b2dd03c8-39d4-4d8f-98ff-823fe69b080e