What's new?
|
Help
|
Directory
|
Sign in
lingr
Lingr API
Project Home
Downloads
Wiki
Issues
Source
Checkout
|
Browse
|
Changes
|
Source Path:
svn
/
trunk
/
plugins
/
multilingual_ferret_tools
/
test
/
multilingual_analyzer_test.rb
r87
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
require 'rubygems'
require 'test/unit'
require File.dirname(__FILE__) + '/../init'
require 'active_support'
class MultilingualFerretTools::AnalyzerTest < Test::Unit::TestCase
def test_all_latin
a = MultilingualFerretTools::Analyzer.new
ts = a.token_stream 'foo', 'now is the time'
assert_next_token ts, 'now', 0, 3
assert_next_token ts, 'time', 11, 15
end
def test_all_non_latin
a = MultilingualFerretTools::Analyzer.new
ts = a.token_stream 'foo', "\xe6\x82\xaa\xe3\x81\x9d\xe3\x81\x86"
assert_next_token ts, "\xe6\x82\xaa", 0, 3
assert_next_token ts, "\xe3\x81\x9d", 3, 6
assert_next_token ts, "\xe3\x81\x86", 6, 9
end
def test_mixed
a = MultilingualFerretTools::Analyzer.new
ts = a.token_stream 'foo', "\xe6\x82\xaa\xe3\x81\x9d\xe3\x81\x86 foo and bar \xe6\x82\xaa\xe3\x81\x9d\xe3\x81\x86"
assert_next_token ts, "\xe6\x82\xaa", 0, 3
assert_next_token ts, "\xe3\x81\x9d", 3, 6
assert_next_token ts, "\xe3\x81\x86", 6, 9
assert_next_token ts, 'foo', 10, 13
assert_next_token ts, 'bar', 18, 21
assert_next_token ts, "\xe6\x82\xaa", 22, 25
assert_next_token ts, "\xe3\x81\x9d", 25, 28
assert_next_token ts, "\xe3\x81\x86", 28, 31
end
private
def assert_next_token(ts, text, start, _end)
token = ts.next
assert_equal token.text, text
assert_equal token.start, start
assert_equal token.end, _end
end
end
Show details
Hide details
Change log
r76
by dburkes on Oct 06, 2008
Diff
move to Google Code repository
Go to:
/trunk/api
/trunk/api/toolkits
/trunk/api/toolkits/javascript
...pi/toolkits/javascript/infoteria
...ts/javascript/infoteria/lingr.js
...pt/infoteria/lingr.js/README.txt
...ript/infoteria/lingr.js/lingr.js
...nfoteria/lingr.js/lingrtest.html
.../infoteria/lingr.js/prototype.js
...kits/javascript/infoteria/simple
...vascript/infoteria/simple/README
...t/infoteria/simple/api_demo.html
/trunk/api/toolkits/ruby
/trunk/api/toolkits/ruby/infoteria
...i/toolkits/ruby/infoteria/README
...its/ruby/infoteria/api_client.rb
...oolkits/ruby/infoteria/botkit.rb
.../ruby/infoteria/botkit_sample.rb
.../infoteria/sample_chat_client.rb
/trunk/plugins
...lugins/multilingual_ferret_tools
...multilingual_ferret_tools/README
...ltilingual_ferret_tools/Rakefile
...ultilingual_ferret_tools/init.rb
...ns/multilingual_ferret_tools/lib
...ols/lib/multilingual_analyzer.rb
...ools/lib/multilingual_chunker.rb
...ools/lib/multilingual_version.rb
...s/multilingual_ferret_tools/test
...st/multilingual_analyzer_test.rb
...est/multilingual_chunker_test.rb
/trunk/plugins/versioned_urls
...lugins/versioned_urls/README.txt
...k/plugins/versioned_urls/init.rb
/trunk/plugins/versioned_urls/lib
...ioned_urls/lib/versioned_urls.rb
/trunk/plugins/versioned_urls/test
...urls/test/versioned_urls_test.rb
Project members,
sign in
to write a code review
Older revisions
All revisions of this file
File info
Size: 1435 bytes, 47 lines
View raw file