2013-02-18 MeCab 0.996
- - configure script の不備によりこiconvへのリンクに失敗する問題を修正
+
- configure script の不備によりiconvへのリンクに失敗する問題を修正
- ユーザ辞書用CSVファイルのコストと左/右文脈IDを付与し, 新たなCSVファイルを生成する機能の追加
- 解析結果からLattice を作成する Lattice::set_result() メソッドを追加. 単体テスト時のスタブの作成等に利用可能
@@ -443,7 +443,7 @@
左から,
-表層形\t品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用形,活用型,原形,読み,発音
+表層形\t品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用型,活用形,原形,読み,発音
となっています。
diff --git a/learn.html b/learn.html
index ba27484..1ccc16f 100644
--- a/learn.html
+++ b/learn.html
@@ -312,7 +312,7 @@ rewrite.def
例
[unigram rewrite]
-# 読み,発音をとりのぞいて, 品詞1,2,3,4,活用形,活用型,原形,よみ を使う
+# 読み,発音をとりのぞいて, 品詞1,2,3,4,活用型,活用形,原形,よみ を使う
*,*,*,*,*,*,*,* $1,$2,$3,$4,$5,$6,$7,$8
# 読みがない場合は無視
*,*,*,*,*,*,* $1,$2,$3,$4,$5,$6,$7,*
@@ -597,11 +597,11 @@
-l オプションによって, どの素性のレベルを使って評価するか指定できます.
- - -l 0: 0 番目の素性のみを使って評価します.
-
- -l 4: 0〜4 番目の素性を使って評価します
+
- -l 0: 分かち書きの精度を評価します.
+
- -l 4: 1(先頭)〜4 番目の素性を使って評価します
- -l -1: 全レベルの素性を使って評価します
-
- -l "0 1 2" 0番目, 0〜1番目, 0〜4番目の3つの評価を表示します.
-
- -l "0 1 -1" 0番目, 0〜1番目, 全レベルの3つの評価を表示します.
+
- -l "0 1 4" 分かち書き, 1番目, 1〜4番目の3つの評価を表示します.
+
- -l "0 1 -1" 分かち書き, 1番目, 全レベルの3つの評価を表示します.
diff --git a/mecab.html b/mecab.html
index a74e555..28b2fcd 100644
--- a/mecab.html
+++ b/mecab.html
@@ -1,72 +1,155 @@
-Content-type: text/html
+Content-type: text/html; charset=UTF-8
Man page of MECAB
MECAB
-Section: MeCab (1)
Updated: July 2006
Index
+Section: User Commands (1)
Updated: February 2019
Index
Return to Main Contents
NAME
-mecab - manual page for mecab of 0.92
+mecab - Yet Another Part-of-Speech and Morphological Analyzer
SYNOPSIS
mecab
-[options] files
+[,options/] ,files/
DESCRIPTION
-MeCab: Yet Another Part-of-Speech and Morphological Analyzer
+Mecab is a morphological analysis system. It reads continuous text sentences
+such as Japanese ones from the standard input, segments them into morpheme
+sequences, and outputs them to the standard output with many additional pieces
+of information (pronunciation, semantic information, etc).
-COPYRIGHT
-
-Copyright © 2001-2006 Taku Kudo
-
+OPTIONS
-Copyright © 2004-2006 Nippon Telegraph and Telephone Corporation
-- -r, --rcfile=FILE
-
+
- -r, --rcfile=,FILE/
-
use FILE as resource file
-
- -d, --dicdir=DIR
-
+
- -d, --dicdir=,DIR/
-
set DIR as a system dicdir
-
- -u, --userdic=FILE
-
+
- -u, --userdic=,FILE/
-
use FILE as a user dictionary
-
- -l, --lattice-level=INT
-
-lattice information level (default 0)
+
- -l, --lattice-level=,INT/
-
+lattice information level (DEPRECATED)
+
- -D, --dictionary-info
-
+show dictionary information and exit
+
- -O, --output-format-type=,TYPE/
-
+set output format type (SEE OUTPUT FORMAT)
- -a, --all-morphs
-
-output all morphs (default false)
-
- -O, --output-format-type=TYPE
-
-set output format type (wakati,none,...)
+output all morphs(default false)
+
- -N, --nbest=,INT/
-
+output N best results (default 1)
- -p, --partial
-
-partial parsing mode
-
- -F, --node-format=STR
-
+partial parsing mode (default false)
+
- -m, --marginal
-
+output marginal probability (default false)
+
- -M, --max-grouping-size=,INT/
-
+maximum grouping size for unknown words (default 24)
+
- -F, --node-format=,STR/
-
use STR as the user-defined node format
-
- -U, --unk-format=STR
-
-use STR as the user-defined unk format
-
- -B, --bos-format=STR
-
-use STR as the user-defined bos format
-
- -E, --eos-format=STR
-
-use STR as the user-defined eos format
-
- -b, --input-buffer-size=INT
-
-set input buffer size (default BUF_SIZE)
+
- -U, --unk-format=,STR/
-
+use STR as the user-defined unknown node format
+
- -B, --bos-format=,STR/
-
+use STR as the user-defined beginning-of-sentence format
+
- -E, --eos-format=,STR/
-
+use STR as the user-defined end-of-sentence format
+
- -S, --eon-format=,STR/
-
+use STR as the user-defined end-of-NBest format
+
- -x, --unk-feature=,STR/
-
+use STR as the feature for unknown word
+
- -b, --input-buffer-size=,INT/
-
+set input buffer size (default 8192)
+
- -P, --dump-config
-
+dump MeCab parameters
- -C, --allocate-sentence
-
allocate new memory for input sentence
-
- -N, --nbest=INT
-
-output N best results (default 1)
-
- -t, --theta=FLOAT
-
+
- -t, --theta=,FLOAT/
-
set temparature parameter theta (default 0.75)
-
- -o, --output=FILE
-
+
- -c, --cost-factor=,INT/
-
+set cost factor (default 700)
+
- -o, --output=,FILE/
-
set the output file name
- -v, --version
-
show the version and exit.
- -h, --help
-
show this help and exit.
+
+
+OUTPUT FORMAT
+
+
+The default output format and the selectable output formats from the
+-O option argument are defined in the resource file.
+There are few special hard coded formats.
+
+
+- "" (null string)
-
+disable format setting of resource file. This is required to set user-defined
+format from the command line.
+
- wakati
-
+output each node separated by a space
+
- dump
-
+dump all node data in one line
+
- none
-
+no output
+
+
+See <https://taku910.github.io/mecab/format.html> for details of format
+definition.
+
+
+
DICTIONARY
+
+
+See
+
+- •
-
+<https://taku910.github.io/mecab/learn.html>
+
- •
-
+<https://taku910.github.io/mecab/dic-detail.html>
+
+
+
+for details of preparation and updating of the mecab dictionary.
+
+
+
EXAMPLE
+
+
+Output reading in KataKana with installed and fully configured UniDic.
+
+
$ mecab -O "" -F"%pS%f[9]" -U"%M" -E"\n" <input_file
+
+Output writing in Hiragana with installed and fully configured UniDic.
+
+
$ mecab -O "" -F"%pS%f[6]" -U"%M" -E"\n" <input_file \
+
+
+
|nkf --hiragana
+
+Output reading in KataKana with installed and fully configured Ipadic.
+
+
$ mecab -O yomi <input_file
+
+
+
COPYRIGHT
+
+Copyright(C) 2001-2012 Taku Kudo
+
+
+Copyright(C) 2004-2008 Nippon Telegraph and Telephone Corporation
+
+SEE ALSO
+
+Full documentation at: <https://taku910.github.io/mecab/>
+
Index
@@ -74,12 +157,17 @@ COPYRIGHT
NAME
SYNOPSIS
DESCRIPTION
-COPYRIGHT
+OPTIONS
+OUTPUT FORMAT
+DICTIONARY
+EXAMPLE
+COPYRIGHT
+SEE ALSO
This document was created by
man2html,
using the manual pages.
-Time: 15:16:13 GMT, July 09, 2006
+Time: 06:47:11 GMT, February 24, 2019
diff --git a/mecab/doc/dic.html b/mecab/doc/dic.html
index d903f4b..80ffed0 100644
--- a/mecab/doc/dic.html
+++ b/mecab/doc/dic.html
@@ -83,7 +83,7 @@ エントリのフォーマット (活用しない語)
左から,
-表層形,左文脈ID,右文脈ID,コスト,品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用形,活用型,原形,読み,発音
+表層形,左文脈ID,右文脈ID,コスト,品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用型,活用形,原形,読み,発音
です.
diff --git a/mecab/doc/index.html b/mecab/doc/index.html
index 0babaeb..64fd4a2 100644
--- a/mecab/doc/index.html
+++ b/mecab/doc/index.html
@@ -443,7 +443,7 @@
左から,
-表層形\t品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用形,活用型,原形,読み,発音
+表層形\t品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用型,活用形,原形,読み,発音
となっています。
diff --git a/mecab/doc/learn.html b/mecab/doc/learn.html
index ba27484..9f3c689 100644
--- a/mecab/doc/learn.html
+++ b/mecab/doc/learn.html
@@ -312,7 +312,7 @@ rewrite.def
例
[unigram rewrite]
-# 読み,発音をとりのぞいて, 品詞1,2,3,4,活用形,活用型,原形,よみ を使う
+# 読み,発音をとりのぞいて, 品詞1,2,3,4,活用型,活用形,原形,よみ を使う
*,*,*,*,*,*,*,* $1,$2,$3,$4,$5,$6,$7,$8
# 読みがない場合は無視
*,*,*,*,*,*,* $1,$2,$3,$4,$5,$6,$7,*
diff --git a/mecab/doc/mecab.html b/mecab/doc/mecab.html
index a74e555..28b2fcd 100644
--- a/mecab/doc/mecab.html
+++ b/mecab/doc/mecab.html
@@ -1,72 +1,155 @@
-Content-type: text/html
+Content-type: text/html; charset=UTF-8
Man page of MECAB
MECAB
-Section: MeCab (1)
Updated: July 2006
Index
+Section: User Commands (1)
Updated: February 2019
Index
Return to Main Contents
NAME
-mecab - manual page for mecab of 0.92
+mecab - Yet Another Part-of-Speech and Morphological Analyzer
SYNOPSIS
mecab
-[options] files
+[,options/] ,files/
DESCRIPTION
-MeCab: Yet Another Part-of-Speech and Morphological Analyzer
+Mecab is a morphological analysis system. It reads continuous text sentences
+such as Japanese ones from the standard input, segments them into morpheme
+sequences, and outputs them to the standard output with many additional pieces
+of information (pronunciation, semantic information, etc).
-COPYRIGHT
-
-Copyright © 2001-2006 Taku Kudo
-
+OPTIONS
-Copyright © 2004-2006 Nippon Telegraph and Telephone Corporation
-- -r, --rcfile=FILE
-
+
- -r, --rcfile=,FILE/
-
use FILE as resource file
-
- -d, --dicdir=DIR
-
+
- -d, --dicdir=,DIR/
-
set DIR as a system dicdir
-
- -u, --userdic=FILE
-
+
- -u, --userdic=,FILE/
-
use FILE as a user dictionary
-
- -l, --lattice-level=INT
-
-lattice information level (default 0)
+
- -l, --lattice-level=,INT/
-
+lattice information level (DEPRECATED)
+
- -D, --dictionary-info
-
+show dictionary information and exit
+
- -O, --output-format-type=,TYPE/
-
+set output format type (SEE OUTPUT FORMAT)
- -a, --all-morphs
-
-output all morphs (default false)
-
- -O, --output-format-type=TYPE
-
-set output format type (wakati,none,...)
+output all morphs(default false)
+
- -N, --nbest=,INT/
-
+output N best results (default 1)
- -p, --partial
-
-partial parsing mode
-
- -F, --node-format=STR
-
+partial parsing mode (default false)
+
- -m, --marginal
-
+output marginal probability (default false)
+
- -M, --max-grouping-size=,INT/
-
+maximum grouping size for unknown words (default 24)
+
- -F, --node-format=,STR/
-
use STR as the user-defined node format
-
- -U, --unk-format=STR
-
-use STR as the user-defined unk format
-
- -B, --bos-format=STR
-
-use STR as the user-defined bos format
-
- -E, --eos-format=STR
-
-use STR as the user-defined eos format
-
- -b, --input-buffer-size=INT
-
-set input buffer size (default BUF_SIZE)
+
- -U, --unk-format=,STR/
-
+use STR as the user-defined unknown node format
+
- -B, --bos-format=,STR/
-
+use STR as the user-defined beginning-of-sentence format
+
- -E, --eos-format=,STR/
-
+use STR as the user-defined end-of-sentence format
+
- -S, --eon-format=,STR/
-
+use STR as the user-defined end-of-NBest format
+
- -x, --unk-feature=,STR/
-
+use STR as the feature for unknown word
+
- -b, --input-buffer-size=,INT/
-
+set input buffer size (default 8192)
+
- -P, --dump-config
-
+dump MeCab parameters
- -C, --allocate-sentence
-
allocate new memory for input sentence
-
- -N, --nbest=INT
-
-output N best results (default 1)
-
- -t, --theta=FLOAT
-
+
- -t, --theta=,FLOAT/
-
set temparature parameter theta (default 0.75)
-
- -o, --output=FILE
-
+
- -c, --cost-factor=,INT/
-
+set cost factor (default 700)
+
- -o, --output=,FILE/
-
set the output file name
- -v, --version
-
show the version and exit.
- -h, --help
-
show this help and exit.
+
+
+OUTPUT FORMAT
+
+
+The default output format and the selectable output formats from the
+-O option argument are defined in the resource file.
+There are few special hard coded formats.
+
+
+- "" (null string)
-
+disable format setting of resource file. This is required to set user-defined
+format from the command line.
+
- wakati
-
+output each node separated by a space
+
- dump
-
+dump all node data in one line
+
- none
-
+no output
+
+
+See <https://taku910.github.io/mecab/format.html> for details of format
+definition.
+
+
+
DICTIONARY
+
+
+See
+
+- •
-
+<https://taku910.github.io/mecab/learn.html>
+
- •
-
+<https://taku910.github.io/mecab/dic-detail.html>
+
+
+
+for details of preparation and updating of the mecab dictionary.
+
+
+
EXAMPLE
+
+
+Output reading in KataKana with installed and fully configured UniDic.
+
+
$ mecab -O "" -F"%pS%f[9]" -U"%M" -E"\n" <input_file
+
+Output writing in Hiragana with installed and fully configured UniDic.
+
+
$ mecab -O "" -F"%pS%f[6]" -U"%M" -E"\n" <input_file \
+
+
+
|nkf --hiragana
+
+Output reading in KataKana with installed and fully configured Ipadic.
+
+
$ mecab -O yomi <input_file
+
+
+
COPYRIGHT
+
+Copyright(C) 2001-2012 Taku Kudo
+
+
+Copyright(C) 2004-2008 Nippon Telegraph and Telephone Corporation
+
+SEE ALSO
+
+Full documentation at: <https://taku910.github.io/mecab/>
+
Index
@@ -74,12 +157,17 @@ COPYRIGHT
NAME
SYNOPSIS
DESCRIPTION
-COPYRIGHT
+OPTIONS
+OUTPUT FORMAT
+DICTIONARY
+EXAMPLE
+COPYRIGHT
+SEE ALSO
This document was created by
man2html,
using the manual pages.
-Time: 15:16:13 GMT, July 09, 2006
+Time: 06:47:11 GMT, February 24, 2019
diff --git a/mecab/man/Makefile.am b/mecab/man/Makefile.am
index aac694b..ab6ca24 100644
--- a/mecab/man/Makefile.am
+++ b/mecab/man/Makefile.am
@@ -1,12 +1,10 @@
# Process this file with automake to produce Makefile.in
man_MANS = mecab.1
-EXTRA_DIST = $(man_MANS)
+EXTRA_DIST = $(man_MANS) mecab1.ins mecab2.ins meca13.ins man.sh
update-man:
@for man in $(man_MANS); do \
cmd=`basename $$man .1` && \
- LANG=C help2man --no-info ../src/$$cmd | \
- perl -pe s/\\bFSF\\b/MeCab/ | \
- perl -pe 's/(LT|lt)-//g' > $$man; \
+ LANG=C ./man.sh ../src/$$cmd > $$man; \
done
diff --git a/mecab/man/man.sh b/mecab/man/man.sh
new file mode 100755
index 0000000..04525ba
--- /dev/null
+++ b/mecab/man/man.sh
@@ -0,0 +1,31 @@
+#!/bin/sh
+# Process this file with automake to produce Makefile.in
+cmdarg=$1
+if [ -n "$cmdarg" ]; then
+ cmd=`basename $cmdarg .1`
+else
+ cmdarg="mecab"
+ cmd="mecab"
+fi
+man="${cmd}.1"
+
+# TOP PART while including DESCRIPTION contents
+LANG=C help2man \
+--no-info \
+--name='Yet Another Part\-of\-Speech and Morphological Analyzer' \
+--include="mecab1.ins" \
+$cmd |\
+sed -e '/^\.PP/,/^Copyright.*Nippon/d' |\
+sed -e 's/(wakati,.*$/(SEE OUTPUT FORMAT)/' > $man
+
+# OUTPUT FORMAT, DICTIONARY
+cat mecab2.ins >> $man
+
+# COPYRIGHT
+echo '.SH "COPYRIGHT' >> $man
+LANG=C help2man $cmdarg |grep -ie "^Copyright" |\
+sed -e 's/Kudo/Kudo\n.br/' - >> $man
+
+# SEE ALSO
+cat mecab3.ins >> $man
+
diff --git a/mecab/man/mecab.1 b/mecab/man/mecab.1
index 90ed1d6..8fd48db 100644
--- a/mecab/man/mecab.1
+++ b/mecab/man/mecab.1
@@ -1,77 +1,84 @@
-.\" DO NOT MODIFY THIS FILE! It was generated by help2man 1.23.
-.TH MECAB "1" "March 2009" "mecab of 0.98pre1" MeCab
+.\" DO NOT MODIFY THIS FILE! It was generated by help2man 1.47.8.
+.TH MECAB "1" "February 2019" "mecab of 0.996" "User Commands"
.SH NAME
-mecab \- manual page for mecab of 0.98pre1
+mecab \- Yet Another Part\-of\-Speech and Morphological Analyzer
.SH SYNOPSIS
.B mecab
-[\fIoptions\fR] \fIfiles\fR
+[\fI\,options\/\fR] \fI\,files\/\fR
.SH DESCRIPTION
-MeCab: Yet Another Part-of-Speech and Morphological Analyzer
-.PP
-Copyright(C) 2001-2008 Taku Kudo
-Copyright(C) 2004-2008 Nippon Telegraph and Telephone Corporation
+Mecab is a morphological analysis system. It reads continuous text sentences
+such as Japanese ones from the standard input, segments them into morpheme
+sequences, and outputs them to the standard output with many additional pieces
+of information (pronunciation, semantic information, etc).
+.SH OPTIONS
.TP
-\fB\-r\fR, \fB\-\-rcfile\fR=\fIFILE\fR
+\fB\-r\fR, \fB\-\-rcfile\fR=\fI\,FILE\/\fR
use FILE as resource file
.TP
-\fB\-d\fR, \fB\-\-dicdir\fR=\fIDIR\fR
+\fB\-d\fR, \fB\-\-dicdir\fR=\fI\,DIR\/\fR
set DIR as a system dicdir
.TP
-\fB\-u\fR, \fB\-\-userdic\fR=\fIFILE\fR
+\fB\-u\fR, \fB\-\-userdic\fR=\fI\,FILE\/\fR
use FILE as a user dictionary
.TP
-\fB\-l\fR, \fB\-\-lattice\-level\fR=\fIINT\fR
-lattice information level (default 0)
+\fB\-l\fR, \fB\-\-lattice\-level\fR=\fI\,INT\/\fR
+lattice information level (DEPRECATED)
.TP
\fB\-D\fR, \fB\-\-dictionary\-info\fR
show dictionary information and exit
.TP
+\fB\-O\fR, \fB\-\-output\-format\-type\fR=\fI\,TYPE\/\fR
+set output format type (SEE OUTPUT FORMAT)
+.TP
\fB\-a\fR, \fB\-\-all\-morphs\fR
output all morphs(default false)
.TP
-\fB\-O\fR, \fB\-\-output\-format\-type\fR=\fITYPE\fR
-set output format type (wakati,none,...)
+\fB\-N\fR, \fB\-\-nbest\fR=\fI\,INT\/\fR
+output N best results (default 1)
.TP
\fB\-p\fR, \fB\-\-partial\fR
-partial parsing mode
+partial parsing mode (default false)
+.TP
+\fB\-m\fR, \fB\-\-marginal\fR
+output marginal probability (default false)
+.TP
+\fB\-M\fR, \fB\-\-max\-grouping\-size\fR=\fI\,INT\/\fR
+maximum grouping size for unknown words (default 24)
+.TP
+\fB\-F\fR, \fB\-\-node\-format\fR=\fI\,STR\/\fR
+use STR as the user\-defined node format
.TP
-\fB\-F\fR, \fB\-\-node\-format\fR=\fISTR\fR
-use STR as the user-defined node format
+\fB\-U\fR, \fB\-\-unk\-format\fR=\fI\,STR\/\fR
+use STR as the user\-defined unknown node format
.TP
-\fB\-U\fR, \fB\-\-unk\-format\fR=\fISTR\fR
-use STR as the user-defined unk format
+\fB\-B\fR, \fB\-\-bos\-format\fR=\fI\,STR\/\fR
+use STR as the user\-defined beginning\-of\-sentence format
.TP
-\fB\-B\fR, \fB\-\-bos\-format\fR=\fISTR\fR
-use STR as the user-defined bos format
+\fB\-E\fR, \fB\-\-eos\-format\fR=\fI\,STR\/\fR
+use STR as the user\-defined end\-of\-sentence format
.TP
-\fB\-E\fR, \fB\-\-eos\-format\fR=\fISTR\fR
-use STR as the user-defined eos format
+\fB\-S\fR, \fB\-\-eon\-format\fR=\fI\,STR\/\fR
+use STR as the user\-defined end\-of\-NBest format
.TP
-\fB\-x\fR, \fB\-\-unk\-feature\fR=\fISTR\fR
+\fB\-x\fR, \fB\-\-unk\-feature\fR=\fI\,STR\/\fR
use STR as the feature for unknown word
.TP
-\fB\-b\fR, \fB\-\-input\-buffer\-size\fR=\fIINT\fR
+\fB\-b\fR, \fB\-\-input\-buffer\-size\fR=\fI\,INT\/\fR
set input buffer size (default 8192)
.TP
\fB\-P\fR, \fB\-\-dump\-config\fR
dump MeCab parameters
.TP
-\fB\-M\fR, \fB\-\-open\-mutable\-dictionary\fR
-open dictioanry with mutable mode (experimental)
-.TP
\fB\-C\fR, \fB\-\-allocate\-sentence\fR
allocate new memory for input sentence
.TP
-\fB\-N\fR, \fB\-\-nbest\fR=\fIINT\fR
-output N best results (default 1)
-.TP
-\fB\-t\fR, \fB\-\-theta\fR=\fIFLOAT\fR
+\fB\-t\fR, \fB\-\-theta\fR=\fI\,FLOAT\/\fR
set temparature parameter theta (default 0.75)
.TP
-\fB\-c\fR, \fB\-\-cost\-factor\fR=\fIINT\fR
+\fB\-c\fR, \fB\-\-cost\-factor\fR=\fI\,INT\/\fR
set cost factor (default 700)
.TP
-\fB\-o\fR, \fB\-\-output\fR=\fIFILE\fR
+\fB\-o\fR, \fB\-\-output\fR=\fI\,FILE\/\fR
set the output file name
.TP
\fB\-v\fR, \fB\-\-version\fR
@@ -79,3 +86,59 @@ show the version and exit.
.TP
\fB\-h\fR, \fB\-\-help\fR
show this help and exit.
+.SH "OUTPUT FORMAT"
+
+The default output format and the selectable output formats from the
+\fB\-O\fR option argument are defined in the resource file.
+There are few special hard coded formats.
+
+.TP
+\fB""\fR (null string)
+disable format setting of resource file. This is required to set user\-defined
+format from the command line.
+.TP
+\fBwakati\fR
+output each node separated by a space
+.TP
+\fBdump\fR
+dump all node data in one line
+.TP
+\fBnone\fR
+no output
+
+.PP
+See for details of format
+definition.
+
+.SH DICTIONARY
+
+See
+.IP \(bu 4
+
+.IP \(bu 4
+
+.PP
+for details of preparation and updating of the mecab dictionary.
+
+.SH EXAMPLE
+
+Output reading in KataKana with installed and fully configured UniDic.
+
+ $ mecab -O "" -F"%pS%f[9]" -U"%M" -E"\\n"
diff --git a/mecab/man/mecab1.ins b/mecab/man/mecab1.ins
new file mode 100644
index 0000000..bff917c
--- /dev/null
+++ b/mecab/man/mecab1.ins
@@ -0,0 +1,6 @@
+[DESCRIPTION]
+Mecab is a morphological analysis system. It reads continuous text sentences
+such as Japanese ones from the standard input, segments them into morpheme
+sequences, and outputs them to the standard output with many additional pieces
+of information (pronunciation, semantic information, etc).
+.SH OPTIONS
diff --git a/mecab/man/mecab2.ins b/mecab/man/mecab2.ins
new file mode 100644
index 0000000..c4f8ccb
--- /dev/null
+++ b/mecab/man/mecab2.ins
@@ -0,0 +1,50 @@
+.SH "OUTPUT FORMAT"
+
+The default output format and the selectable output formats from the
+\fB\-O\fR option argument are defined in the resource file.
+There are few special hard coded formats.
+
+.TP
+\fB""\fR (null string)
+disable format setting of resource file. This is required to set user\-defined
+format from the command line.
+.TP
+\fBwakati\fR
+output each node separated by a space
+.TP
+\fBdump\fR
+dump all node data in one line
+.TP
+\fBnone\fR
+no output
+
+.PP
+See for details of format
+definition.
+
+.SH DICTIONARY
+
+See
+.IP \(bu 4
+
+.IP \(bu 4
+
+.PP
+for details of preparation and updating of the mecab dictionary.
+
+.SH EXAMPLE
+
+Output reading in KataKana with installed and fully configured UniDic.
+
+ $ mecab -O "" -F"%pS%f[9]" -U"%M" -E"\\n"
diff --git a/mecab/python/setup.py b/mecab/python/setup.py
index 4486cbb..657945a 100644
--- a/mecab/python/setup.py
+++ b/mecab/python/setup.py
@@ -7,7 +7,7 @@ def cmd1(str):
return os.popen(str).readlines()[0][:-1]
def cmd2(str):
- return string.split (cmd1(str))
+ return cmd1(str).split()
setup(name = "mecab-python",
version = cmd1("mecab-config --version"),