…::Client:: CHARSET MAP[' latin1'] = Encoding:: UTF 8
This will solve your problems in Ruby but will not solve your problems in the database: MySQL will still sort and compare strings thinking they are latin1 and thus do so incorrectly. Here's an example:
CREATE TABLE names_latin1 ( name varchar ( 32 ) character SET latin1 ) ; INSERT INTO names_latin1 VALUES ( 'Martin Strauße' ) ; INSERT INTO names_latin1 VALUES ( 'Martin Straure' ) ; INSERT …
The first two bugs are important. Those using pt-archiver -bulk-insert with UTF-8 data should definitely upgrade. Those using pt-heartbeat -utc should also definitely upgrade.
Users may note the revival of the -show-all option in pt-query-digest. This had been removed in 2.2.1, but resulted in too much output in certain cases.
A new -recursion-method was added to pt-table-checksum: cluster. This method attempts to auto-discover cluster nodes, alleviating the need to specify cluster node DSNs…
UTF-8 is now the default encoding
The list of new features is more than we can cover here. If you really wanted to dig in you can check the Ruby changelog
Running 2.0 on Heroku
If you're interested in taking advantage of these new features give it a try on Heroku today. To run Ruby 2.0 on Heroku you'll need this line in your Gemfile : ruby "2.0.0"
Then commit to git: $ git add . $ git commit -m "Using Ruby …
The UTF-8 default encoding, which make many magic comments omissible
Built-in libraries
Enumerable#lazy and Enumerator::Lazy, for (possibly infinite) lazy stream
Enumerator#size and Range#size, for lazy size evaluation
#to_h, which is a new convention for conversion to Hash
Onigmo, which is a new regexp engine (a fork of Oniguruma)
Asynchronous exception handling API
Debug support
DTrace support, which enables run-time diagnosis in production
TracePoint, which …
…Unix, Plan 9 OS, The Unix Programming Environment book, UTF-8, and most recently the Go programming language .
Waza is the Japanese word for art and technique and it's where we celebrate craft and the creative process of software development with technical sessions and interactive artistic happenings .
In planning Waza 2013 we went back to reflect on last year's speakers. And we want to make the talks readily availble to anybody who could …
The default output encoding for RDoc is now UTF-8. Previously RDoc used the default external encoding which was determined from your locale. Issue #106 by Justin Baker.
RDoc:: RI::Store is now RDoc::Store so ri data generated by RDoc 4 cannot be read by earlier versions of RDoc. RDoc:: RI::Store exists as an alias of RDoc::Store so ri data from older versions can still be read. RDoc:: RI::Store will be removed in RDoc 5.
Tests that create RDoc:: CodeObjects on …
The default encoding for ruby scripts is now UTF-8 [#6679]. Some people report that it affects existing programs, such as some benchmark programs becoming very slow [ ruby-dev:46547].
Iconv was removed, which had already been deprecated when M17N was introduced in ruby 1.9. Use String#encode, etc. instead.
There is ABI breakage [ruby-core:48984]. We think that normal users can/should just reinstall extension libraries. You should be aware: DO NOT COPY .so OR .bundle FILES …
…v3.0007 which has a bug that causes Perl not to flag UTF-8 data as being UTF-8. Presuming that the MySQL table/column is using UTF-8, and the Perl MySQL connection is also using UTF-8, then a correct system returns:
PV = 0x9573840 "\343\203\213 \303\250"\0 [UTF8 "\x{30cb} \x{e8}"]
That's a Devel::Peek inside a Perl scalar variable which clearly shows that Perl has recognized and flagged the data at UTF-8. With DBD::mysql v3.0007 …
…gem we ran into some cases where the data being posted back to our controller had invalid UTF-8 bytes. For Griddler, our failing case needs to simulate the body of an email having an invalid byte, and encoded as UTF-8.
What are valid and invalid bytes? This table on Wikipedia tells us bytes 192, 193, and 245-255 are off limits. In ruby's string literal we can represent this by escaping one of those numbers: > "hi \255" => "hi \xAD"
There's …
The default encoding for ruby scripts is now UTF-8 [#6679] . Some people report that it affects existing programs, such as some benchmark programs becoming very slow [ruby-dev:46547] . The spec may change for the official release.
iconv was removed, which had already been deprecated when M17N was introduced in ruby 1.9. Use String#encode, etc. instead.
There is ABI breakage [ruby-core:48984] . We think that normal users can/should just reinstall …