ruby.git - rhe's working repository

	Commit message (Collapse)	Author	Age	Files	Lines
*	Right size the iseq coverage branches tmp array - initializes with 5 elements	Lourens Naudé	2019-10-29	1	-1/+1
\|
*	Pin keys of this st_table	Aaron Patterson	2019-10-28	1	-1/+1
\|
*	respect `param.flags.ruby2_keywords` at to_binary.	Koichi Sasada	2019-10-25	1	-1/+3
\| \| \| \| \|	`param.flags.ruby2_keywords` is not store/load correctly at to_binary so restore this flag correctly.
*	Define arguments forwarding as `ruby2_keywords` style	Nobuyoshi Nakada	2019-10-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Get rid of these redundant and useless warnings. ``` $ ruby -e 'def bar(a) a; end; def foo(...) bar(...) end; foo({})' -e:1: warning: The last argument is used as the keyword parameter -e:1: warning: for `foo' defined here -e:1: warning: The keyword argument is passed as the last hash parameter -e:1: warning: for `bar' defined here ```
*	Use CPDEBUG for debug code	Alan Wu	2019-10-24	1	-2/+2
\|
*	Combine call info and cache to speed up method invocation	Alan Wu	2019-10-24	1	-122/+117
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	To perform a regular method call, the VM needs two structs, `rb_call_info` and `rb_call_cache`. At the moment, we allocate these two structures in separate buffers. In the worst case, the CPU needs to read 4 cache lines to complete a method call. Putting the two structures together reduces the maximum number of cache line reads to 2. Combining the structures also saves 8 bytes per call site as the current layout uses separate two pointers for the call info and the call cache. This saves about 2 MiB on Discourse. This change improves the Optcarrot benchmark at least 3%. For more details, see attached bugs.ruby-lang.org ticket. Complications: - A new instruction attribute `comptime_sp_inc` is introduced to calculate SP increase at compile time without using call caches. At compile time, a `TS_CALLDATA` operand points to a call info struct, but at runtime, the same operand points to a call data struct. Instruction that explicitly define `sp_inc` also need to define `comptime_sp_inc`. - MJIT code for copying call cache becomes slightly more complicated. - This changes the bytecode format, which might break existing tools. [Misc #16258]
*	Fix the exception when CPDEBUG	Nobuyoshi Nakada	2019-10-23	1	-1/+4
\|
*	Fix build for CPDEBUG=1	Alan Wu	2019-10-22	1	-1/+1
\| \| \| \|	The declarations went out-of-sync in dcfb7f6.
*	Right size the numtable in insn_make_insn_table to VM_INSTRUCTION_SIZE	Lourens Naudé	2019-10-11	1	-1/+1
\|
*	avoid overflow in integer multiplication	卜部昌平	2019-10-09	1	-16/+30
\| \| \| \| \| \| \|	This changeset basically replaces `ruby_xmalloc(x * y)` into `ruby_xmalloc2(x, y)`. Some convenient functions are also provided for instance `rb_xmalloc_mul_add(x, y, z)` which allocates x * y + z byes.
*	Make parser_params have parent_iseq instead of base_block	Yusuke Endoh	2019-10-04	1	-8/+4
\| \| \| \| \| \| \| \| \| \| \| \|	The parser needs to determine whether a local varaiable is defined or not in outer scope. For the sake, "base_block" field has kept the outer block. However, the whole block was actually unneeded; the parser used only base_block->iseq. So, this change lets parser_params have the iseq directly, instead of the whole block.
*	Iseq#to_binary: dump flag for **nil (#2508)	Alan Wu	2019-10-02	1	-5/+7
\| \| \| \|	RUBY_ISEQ_DUMP_DEBUG=to_binary and the attached test case was failing. Dump the flag to make sure `**nil` can round-trip properly.
*	Drop eliminated catch-entries	Nobuyoshi Nakada	2019-09-27	1	-0/+12
\| \| \| \| \|	Drop catch table entries used in eliminated block, as well as call_infos. [Bug #16184]
*	Adjusted spaces [ci skip]	Nobuyoshi Nakada	2019-09-27	1	-10/+11
\|
*	Replace `freeze_string` with `rb_fstring`	Aaron Patterson	2019-09-26	1	-14/+8
\|
*	Remove `iseq_add_mark_object_compile_time`	Aaron Patterson	2019-09-26	1	-37/+28
\| \| \| \| \|	This function is just a synonym for RB_OBJ_WRITTEN, so we can just directly call that.
*	Execute write barrier instead of adding to array	Aaron Patterson	2019-09-26	1	-1/+1
\| \| \| \| \|	We can mark everything via the instruction objects, so just execute the write barrier instead of appending to the array
*	Pull `iseq_add_mark_object_compile_time` out of `freeze_string`	Aaron Patterson	2019-09-26	1	-4/+11
\| \| \| \| \| \| \|	`freeze_string` essentially called iseq_add_mark_object_compile_time. I need to know where all writes occur on the `rb_iseq_t`, so this commit separates the function calls so we can add write barriers in the right place.
*	Pull "mark object" up	Aaron Patterson	2019-09-26	1	-11/+18
\| \| \| \| \| \| \|	Move the "add mark object" function to the location where we should be calling RB_OBJ_WRITTEN. I'm going to add verification code next so we can make sure the objects we're adding to the array are also reachable from the mark function.
*	Scan the ISEQ arena for markables and mark them	Aaron Patterson	2019-09-26	1	-0/+51
\| \| \| \| \|	This commit scans the ISEQ arena for objects that can be marked and marks them. This should make the mark array unnecessary.
*	Allocate `INSN *` out of a separate arena	Aaron Patterson	2019-09-26	1	-1/+2
\|
*	Introduce a secondary arena	Aaron Patterson	2019-09-26	1	-1/+1
\| \| \| \| \| \|	We'll scan the secondary arena during GC mark. So, we should only allocate "markable" instruction linked list nodes out of the secondary arena.
*	Pass in arena to allocator	Aaron Patterson	2019-09-26	1	-4/+10
\| \| \| \|	This is so we can configure a new arena later
*	Allows calling a private method only with bare `self`	Nobuyoshi Nakada	2019-09-20	1	-1/+10
\|
*	Allow calling a private accessor with `self.`	Nobuyoshi Nakada	2019-09-20	1	-4/+4
\| \| \| \|	[Feature #11297] [Feature #16123]
*	Allow calling a private method with `self.`	Dylan Thacker-Smith	2019-09-20	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	This makes it consistent with calling private attribute assignment methods, which currently is allowed (e.g. `self.value =`). Calling a private method in this way can be useful when trying to assign the return value to a local variable with the same name. [Feature #11297] [Feature #16123]
*	Use EXPECT_NODE_NONULL	Nobuyoshi Nakada	2019-09-19	1	-2/+1
\|
*	Check COMPILE_RECV result	Nobuyoshi Nakada	2019-09-19	1	-4/+8
\|
*	Improve the output of `RubyVM::InstructionSequence#to_binary` (#2450)	NagayamaRyoga	2019-09-19	1	-619/+978
\| \| \| \| \| \| \| \| \| \|	The output of RubyVM::InstructionSequence#to_binary is extremely large. We have reduced the output of #to_binary by more than 70%. The execution speed of RubyVM::InstructionSequence.load_from_binary is about 7% slower, but when reading a binary from a file, it may be faster than the master. Since Bootsnap gem uses #to_binary, this proposal reduces the compilation cache size of Rails projects to about 1/4. See details: [Feature #16163]
*	introduce IBF_(MAJOR\|MINOR)_VERSION.	Koichi Sasada	2019-09-13	1	-5/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	RubyVM::InstructionSequence.to_binary generates a bytecode binary representation. To check compatibility with binary and loading MRI we prepared major/minor version and compare them at loading time. However, development version of MRI can change this format but we can not increment minor version to make them consistent with Ruby's major/minor versions. To solve this issue, we introduce new minor version scheme (binary's minor_version = ruby's minor * 10000 + dev ver) and we can check incompatibility with older dev version.
*	Fix a typo [ci skip]	Kazuhiro NISHIYAMA	2019-09-09	1	-1/+1
\|
*	compile.c (compile_hash): rewrite keyword splat handling	Yusuke Endoh	2019-09-08	1	-10/+22
\| \| \| \| \|	and add some comments. (I confirm that `foo(**{})` allocates no hash object.)
*	compile.c (compile_hash): rewrite the compilation algorithm	Yusuke Endoh	2019-09-08	1	-74/+108
\| \| \| \| \|	This is a similar refactoring to 8c908c989077c74eed26e02912b98362e509b8a3, but the target is compile_hash.
*	compile.c (NODE_OP_ASGN1): Remove unneeded DECL_ANCHOR	Yusuke Endoh	2019-09-08	1	-4/+1
\|
*	compile.c (keyword_node_p): Refactor out keyword node checks	Yusuke Endoh	2019-09-08	1	-4/+9
\|
*	compile.c (compile_hash): Remove redundant check for NODE_ZLIST	Yusuke Endoh	2019-09-08	1	-24/+6
\| \| \| \| \|	NODE_ZLIST case is handled in compile_hash, so iseq_compile_each0 doesn't have to do the same check redundantly.
*	compile.c (compile_hash): Simplify the keyword handling	Yusuke Endoh	2019-09-08	1	-9/+6
\| \| \| \| \| \|	The length of NODE_LIST chain in NODE_HASH is always even because it represents key-value pairs. There is no need to check for the odd-length case.
*	compile.c (compile_hash): don't add a temporal array to mark_ary	Yusuke Endoh	2019-09-08	1	-4/+0
\| \| \| \| \|	The array is just for a temporal buffer to create a hash, not stored in the final iseq.
*	compile.c (compile_array): undef a temporal macro	Yusuke Endoh	2019-09-08	1	-1/+1
\|
*	* remove trailing spaces. [ci skip]	git	2019-09-07	1	-1/+1
\|
*	compile.c (compile_array): rewrite the compilation algorithm	Yusuke Endoh	2019-09-07	1	-56/+92
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original code looks unnecessarily complicated (to me). Also, it creates a pre-allocated array only for the prefix of the array. The new code optimizes not only the prefix but also the subsequence that is longer than 0x40 elements. # not optimized 10000000.times { [1+1, 1,2,3,4,...,63] } # 2.12 sec. # (1+1; push 1; push 2; ...; puts 63; newarray 64; concatarray) # optimized 10000000.times { [1+1, 1,2,3,4,...,63,64] } # 1.46 sec. # (1+1; newarray 1; putobject [1,2,3,...,64]; concatarray)
*	compile.c (compile_hash): refactoring	Yusuke Endoh	2019-09-07	1	-122/+112
\| \| \| \|	The same refactoring as to b601b13c7267889bf394146353c5f2b0eb488278.
*	compile.c (compile_array): refactoring	Yusuke Endoh	2019-09-07	1	-69/+67
\| \| \| \| \| \|	"popped" case can be so simple, so this change moves the branch to the first, instead of scattering `if (popped)` branches to the main part. Also, the return value "len" is not used. So it returns just 0 or 1.
*	compile.c: Separate compile_list to two functions for Array and Hash	Yusuke Endoh	2019-09-07	1	-121/+160
\| \| \| \| \| \| \| \| \|	compile_list was for the compilation of Array literal and Hash literal. I guess it was originally reasonable to handle them in one function, but now, compilation of Array is very different from Hash. So the function was complicated by many branches for Array and Hash. This change separates the function to two ones for Array and Hash.
*	compile.c (compile_list): allow an odd-length hidden array literal	Yusuke Endoh	2019-09-07	1	-10/+19
\| \| \| \| \| \| \| \| \| \| \| \| \|	An array literal [1,2,...,301] was compiled to the following iseq: duparray [1,2,...,300] putobject [301] concatarray The Array literal optimization took every two elements maybe because it must handle not only Array but also Hash. Now the optimization takes each element if it is an Array literal. So the new iseq is: duparray [1,2,...,301].
*	compile.c (compile_list): emit newarraykwsplat only at the last chunk	Yusuke Endoh	2019-09-07	1	-8/+3
\| \| \| \| \| \| \|	`[{}, {}, {}, ..., {}, *{}]` is wrongly created. A big array literal is created and concatenated for every 256 elements. The newarraykwsplat must be emitted only at the last chunk.
*	Rename some function/definition names that handles NODE_LIST	Yusuke Endoh	2019-09-07	1	-8/+8
\| \| \| \| \|	from array to list. Follow up to ac50ac03aeb210763730cdc45f230e236519223d
*	Rename NODE_ARRAY to NODE_LIST to reflect its actual use cases	Yusuke Endoh	2019-09-07	1	-30/+30
\| \| \| \| \| \| \| \| \| \|	and NODE_ZARRAY to NODE_ZLIST. NODE_ARRAY is used not only by an Array literal, but also the contents of Hash literals, method call arguments, dynamic string literals, etc. In addition, the structure of NODE_ARRAY is a linked list, not an array. This is very confusing, so I believe `NODE_LIST` is a better name.
*	Make m(**{}) mean call without keywords	Jeremy Evans	2019-09-05	1	-6/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, {} was removed by the parser: ``` $ ruby --dump=parse -e '{{}}' @ NODE_SCOPE (line: 1, location: (1,0)-(1,6)) +- nd_tbl: (empty) +- nd_args: \| (null node) +- nd_body: @ NODE_HASH (line: 1, location: (1,0)-(1,6))* +- nd_brace: 1 (hash literal) +- nd_head: (null node) ``` Since it was removed by the parser, the compiler did not know about it, and `m({})` was therefore treated as `m()`. This modifies the parser to not remove the `{}`. A simple approach for this is fairly simple by just removing a few lines from the parser, but that would cause two hash allocations every time it was used. The approach taken here modifies both the parser and the compiler, and results in `{}` not allocating any hashes in the usual case. The basic idea is we use a literal node in the parser containing a frozen empty hash literal. In the compiler, we recognize when that is used, and if it is the only keyword present, we just push it onto the VM stack (no creation of a new hash or merging of keywords). If it is the first keyword present, we push a new empty hash onto the VM stack, so that later keywords can merge into it. If it is not the first keyword present, we can ignore it, since the there is no reason to merge an empty hash into the existing hash. Example instructions for `m({})` Before (note ARGS_SIMPLE): ``` == disasm: #<ISeq:<main>@-e:1 (1,0)-(1,7)> (catch: FALSE) 0000 putself ( 1)[Li] 0001 opt_send_without_block <callinfo!mid:m, argc:0, FCALL\|ARGS_SIMPLE>, <callcache> 0004 leave ``` After (note putobject and KW_SPLAT): ``` == disasm: #<ISeq:<main>@-e:1 (1,0)-(1,7)> (catch: FALSE) 0000 putself ( 1)[Li] 0001 putobject {} 0003 opt_send_without_block <callinfo!mid:m, argc:1, FCALL\|KW_SPLAT>, <callcache> 0006 leave ``` Example instructions for `m(h, {})` Before and After (no change): ``` == disasm: #<ISeq:<main>@-e:1 (1,0)-(1,12)> (catch: FALSE) 0000 putself ( 1)[Li] 0001 putspecialobject 1 0003 newhash 0 0005 putself 0006 opt_send_without_block <callinfo!mid:h, argc:0, FCALL\|VCALL\|ARGS_SIMPLE>, <callcache> 0009 opt_send_without_block <callinfo!mid:core#hash_merge_kwd, argc:2, ARGS_SIMPLE>, <callcache> 0012 opt_send_without_block <callinfo!mid:m, argc:1, FCALL\|KW_SPLAT>, <callcache> 0015 leave ``` Example instructions for `m({}, h)` Before: ``` == disasm: #<ISeq:<main>@-e:1 (1,0)-(1,12)> (catch: FALSE) 0000 putself ( 1)[Li] 0001 putspecialobject 1 0003 newhash 0 0005 putself 0006 opt_send_without_block <callinfo!mid:h, argc:0, FCALL\|VCALL\|ARGS_SIMPLE>, <callcache> 0009 opt_send_without_block <callinfo!mid:core#hash_merge_kwd, argc:2, ARGS_SIMPLE>, <callcache> 0012 opt_send_without_block <callinfo!mid:m, argc:1, FCALL\|KW_SPLAT>, <callcache> 0015 leave ``` After (basically the same except for the addition of swap): ``` == disasm: #<ISeq:<main>@-e:1 (1,0)-(1,12)> (catch: FALSE) 0000 putself ( 1)[Li] 0001 newhash 0 0003 putspecialobject 1 0005 swap 0006 putself 0007 opt_send_without_block <callinfo!mid:h, argc:0, FCALL\|VCALL\|ARGS_SIMPLE>, <callcache> 0010 opt_send_without_block <callinfo!mid:core#hash_merge_kwd, argc:2, ARGS_SIMPLE>, <callcache> 0013 opt_send_without_block <callinfo!mid:m, argc:1, FCALL\|KW_SPLAT>, <callcache> 0016 leave ```
*	Unify SUPPORT_JOKE and OPT_SUPPORT_JOKE	Takashi Kokubun	2019-09-03	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	for simplicity and consistency. Now SUPPORT_JOKE needs to be prefixed with OPT_ to make the config visible in `RubyVM::VmOptsH`, and the inconsistency was introduced. As it has never been available for override in configure (no #ifndef guard), it should be fine to rename the config.