Fix Unicode identifier parsing and related runtime Unicode handling #149
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary\n- Make lexer code-point aware for Unicode identifiers (surrogate pairs).\n- Validate identifiers using Unicode XID properties and UTF-8 byte length limits.\n- Fix
substr/lvalue substring operations to operate on Unicode code points.\n- Improve parser diagnostics (e.g.forloop variable error).\n- Fix strict-vars handling for special sort vars/to avoid aborting regex test files under strict.\n\n## Test results\n-make: PASS\n-perl dev/tools/perl_test_runner.pl perl5_t/t/comp/parser.t: improved (73/193)\n-perl dev/tools/perl_test_runner.pl perl5_t/t/uni/variables.t: improved (66764/66880)\n-perl dev/tools/perl_test_runner.pl perl5_t/t/re/pat_rt_report.t: unblocked (2379/2514)\n\n## Notes\nNo test files were modified.