Re: PDF中のテキストの抽出 (Re: dvipdfm vs. pdflatex?)

名前: Takashima
日時: 2007-05-03 03:22:02
IPアドレス: 61.21.52.*

>>47849 ギリシャ語のフォントについては CB.encに以下のようなものを追加してそのファイルをCBlig.enc とするなら、 otftotfm -e CBlig.enc -fkern -fliga -n llgrmn LinLibertine_Re.otf >>llgr.map のようなことでunicode対応が可能です。 変種ウムラウトはsuperscript e がunicodeで定義されているので 一つの合成文字にはならなくともなんとかなるかと試してみましたが うまくいきませんね。(対応フォントがotfでなくttfのためでしょうか?) % LIGKERN uni1FFE uni1FC0 =: uni1FDF; % LIGKERN uni1FFE tonos =: uni1FDE; % LIGKERN uni1FFE uni1FEF =: uni1FDD; % LIGKERN uni1FFE alpha =: uni1F01; % LIGKERN uni1FFE epsilon =: uni1F11; % LIGKERN uni1FFE eta =: uni1F21; % LIGKERN uni1FFE rho =: uni1FE5; % LIGKERN uni1FFE omicron =: uni1F41; % LIGKERN uni1FFE omega =: uni1F61; % LIGKERN uni1FFE iota =: uni1F31; % LIGKERN uni1FFE upsilon =: uni1F51; % LIGKERN uni1FFE {-150} Alpha; % LIGKERN uni1FFE {-50} Omicron; % LIGKERN uni1FFE {-50} uni03A9; % LIGKERN uni1FBF uni1FC0 =: uni1FCF; % LIGKERN uni1FBF tonos =: uni1FCE; % LIGKERN uni1FBF uni1FEF =: uni1FCD; % LIGKERN uni1FBF alpha =: uni1F00; % LIGKERN uni1FBF epsilon =: uni1F10; % LIGKERN uni1FBF eta =: uni1F20; % LIGKERN uni1FBF rho =: uni1FE4; % LIGKERN uni1FBF omicron =: uni1F40; % LIGKERN uni1FBF omega =: uni1F60; % LIGKERN uni1FBF iota =: uni1F30; % LIGKERN uni1FBF upsilon =: uni1F50; % LIGKERNX uni1FBF {-150} Alpha; % LIGKERNX uni1FBF {-100} Omicron; % LIGKERNX uni1FBF {-100} uni03A9; % LIGKERN dieresis uni1FC0 =: uni1FC1; % LIGKERN dieresis tonos =: dieresistonos; % LIGKERN dieresis uni1FEF =: uni1FED; % LIGKERN dieresis || =: quoteright; % LIGKERN dieresis iota =: iotadieresis; % LIGKERN dieresis upsilon =: upsilondieresis; % LIGKERN dieresis Iota =: Iotadieresis; % LIGKERN dieresis Upsilon =: Upsilondieresis; % LIGKERN uni1FED iota =: uni1FD2; % LIGKERN uni1FED upsilon =: uni1FE2; % LIGKERN dieresistonos iota =: iotadieresistonos; % LIGKERN dieresistonos upsilon =: upsilondieresistonos; % LIGKERN uni1FC1 iota =: uni1FD7; % LIGKERN uni1FC1 upsilon =: uni1FE7; % LIGKERN uni1FEF dieresis =: uni1FED; % LIGKERN uni1FEF uni1FEF =: quoteleft; % LIGKERN uni1FEF uni1FFE =: uni1FDD; % LIGKERN uni1FEF uni1FBF =: uni1FCD; % LIGKERN uni1FEF alpha =: uni1F70; % LIGKERN uni1FEF eta =: uni1F74; % LIGKERN uni1FEF omega =: uni1F7C; % LIGKERN uni1FEF epsilon =: uni1F72; % LIGKERN uni1FEF omicron =: uni1F78; % LIGKERN uni1FEF iota =: uni1F76; % LIGKERN uni1FEF upsilon =: uni1F7A; % LIGKERNX uni1FEF {-150} Alpha; % LIGKERNX uni1FEF {-50} Omicron; % LIGKERNX uni1FEF {-50} uni03A9; % LIGKERN tonos dieresis =: dieresistonos; % LIGKERN tonos tonos =: quoteright; % LIGKERN tonos uni1FFE =: uni1FDE; % LIGKERN tonos uni1FBF =: uni1FCE; % LIGKERN tonos alpha =: alphatonos; % LIGKERN tonos eta =: etatonos; % LIGKERN tonos omega =: omegatonos; % LIGKERN tonos epsilon =: epsilontonos; % LIGKERN tonos omicron =: omicrontonos; % LIGKERN tonos iota =: iotatonos; % LIGKERN tonos upsilon =: upsilontonos; % LIGKERNX tonos {-200} Alpha; % LIGKERNX tonos {-150} Omicron; % LIGKERNX tonos {-150} uni03A9; % LIGKERN uni1FDE alpha =: uni1F05; % LIGKERN uni1FDE eta =: uni1F25; % LIGKERN uni1FDE omega =: uni1F65; % LIGKERN uni1FDE epsilon =: uni1F15; % LIGKERN uni1FDE omicron =: uni1F45; % LIGKERN uni1FDE iota =: uni1F35; % LIGKERN uni1FDE upsilon =: uni1F55; % LIGKERNX uni1FDE {-250} Alpha; % LIGKERNX uni1FDE {-100} Omicron; % LIGKERN uni1FCE alpha =: uni1F04; % LIGKERN uni1FCE eta =: uni1F24; % LIGKERN uni1FCE omega =: uni1F64; % LIGKERN uni1FCE epsilon =: uni1F14; % LIGKERN uni1FCE omicron =: uni1F44; % LIGKERN uni1FCE iota =: uni1F34; % LIGKERN uni1FCE upsilon =: uni1F54; % LIGKERNX uni1FCE {-250} Alpha; % LIGKERNX uni1FCE {-100} Omicron; % LIGKERNX uni1FCE {-100} uni03A9; % LIGKERN uni1FCD alpha =: uni1F02; % LIGKERN uni1FCD eta =: uni1F22; % LIGKERN uni1FCD omega =: uni1F62; % LIGKERN uni1FCD epsilon =: uni1F12; % LIGKERN uni1FCD omicron =: uni1F42; % LIGKERN uni1FCD iota =: uni1F32; % LIGKERN uni1FCD upsilon =: uni1F52; % LIGKERNX uni1FCD {-250} Alpha; % LIGKERNX uni1FCD {-100} Omicron; % LIGKERN uni1FDD alpha =: uni1F03; % LIGKERN uni1FDD eta =: uni1F23; % LIGKERN uni1FDD omega =: uni1F63; % LIGKERN uni1FDD epsilon =: uni1F13; % LIGKERN uni1FDD omicron =: uni1F43; % LIGKERN uni1FDD iota =: uni1F33; % LIGKERN uni1FDD upsilon =: uni1F53; % LIGKERNX uni1FDD {-200} Alpha; % LIGKERNX uni1FDD {-100} Omicron; % LIGKERN uni1FC0 dieresis =: uni1FC1; % LIGKERN uni1FC0 uni1FFE =: uni1FDF; % LIGKERN uni1FC0 uni1FBF =: uni1FCF; % LIGKERN uni1FC0 alpha =: uni1FB6; % LIGKERN uni1FC0 eta =: uni1FC6; % LIGKERN uni1FC0 omega =: uni1FF6; % LIGKERN uni1FC0 iota =: uni1FD6; % LIGKERN uni1FC0 upsilon =: uni1FE6; % LIGKERNX uni1FC0 {-250} Alpha; % LIGKERN uni1FDF alpha =: uni1F07; % LIGKERN uni1FDF eta =: uni1F27; % LIGKERN uni1FDF omega =: uni1F67; % LIGKERN uni1FDF iota =: uni1F37; % LIGKERN uni1FDF upsilon =: uni1F57; % LIGKERNX uni1FDF {-250} Alpha; % LIGKERN uni1FCF alpha =: uni1F06; % LIGKERN uni1FCF eta =: uni1F26; % LIGKERN uni1FCF omega =: uni1F66; % LIGKERN uni1FCF iota =: uni1F36; % LIGKERN uni1FCF upsilon =: uni1F56; % LIGKERNX uni1FCF {-250} Alpha; % LIGKERN alpha uni037A =: uni1FB3; % LIGKERN eta uni037A =: uni1FC3; % LIGKERN omega uni037A =: uni1FF3; % LIGKERN uni1F70 uni037A =: uni1FB2; % LIGKERN uni1F01 uni037A =: uni1F81; % LIGKERN uni1F00 uni037A =: uni1F80; % LIGKERN uni1F03 uni037A =: uni1F83; % LIGKERN alphatonos uni037A =: uni1FB4; % LIGKERN uni1F05 uni037A =: uni1F85; % LIGKERN uni1F04 uni037A =: uni1F84; % LIGKERN uni1F02 uni037A =: uni1F82; % LIGKERN uni1FB6 uni037A =: uni1FB7; % LIGKERN uni1F07 uni037A =: uni1F87; % LIGKERN uni1F06 uni037A =: uni1F86; % LIGKERN uni1F74 uni037A =: uni1FC2; % LIGKERN uni1F21 uni037A =: uni1F91; % LIGKERN uni1F20 uni037A =: uni1F90; % LIGKERN etatonos uni037A =: uni1FC4; % LIGKERN uni1F25 uni037A =: uni1F95; % LIGKERN uni1F24 uni037A =: uni1F94; % LIGKERN uni1F23 uni037A =: uni1F93; % LIGKERN uni1FC6 uni037A =: uni1FC7; % LIGKERN uni1F27 uni037A =: uni1F97; % LIGKERN uni1F26 uni037A =: uni1F96; % LIGKERN uni1F22 uni037A =: uni1F92; % LIGKERN uni1F7C uni037A =: uni1FF2; % LIGKERN uni1F61 uni037A =: uni1FA1; % LIGKERN uni1F60 uni037A =: uni1FA0; % LIGKERN uni1F63 uni037A =: uni1FA3; % LIGKERN omegatonos uni037A =: uni1FF4; % LIGKERN uni1F65 uni037A =: uni1FA5; % LIGKERN uni1F64 uni037A =: uni1FA4; % LIGKERN uni1F62 uni037A =: uni1FA2; % LIGKERN uni1FF6 uni037A =: uni1FF7; % LIGKERN uni1F67 uni037A =: uni1FA7; % LIGKERN uni1F66 uni037A =: uni1FA6; % LIGKERN sigma period =:| sigma1; % LIGKERN sigma comma =:| sigma1; % LIGKERN sigma anoteleia =:| sigma1; % LIGKERN sigma colon =:| sigma1; % LIGKERN sigma semicolon =:| sigma1; % LIGKERN sigma exclam =:| sigma1; % LIGKERN sigma parenright =:| sigma1; % LIGKERN sigma slash =:| sigma1; % LIGKERN sigma bracketright =:| sigma1; % LIGKERN sigma || =:| sigma1; % LIGKERN sigma asterisk =:| sigma1; % LIGKERN sigma quoteright =:| sigma1; % LIGKERN internalchar2 internalchar2 =: internalchar2; % LIGKERN internalchar2 Alpha =: Alpha; % LIGKERN internalchar2 Beta =: Beta; % LIGKERN internalchar2 Gamma =: Gamma; % LIGKERN internalchar2 uni0394 =: uni0394; % LIGKERN internalchar2 Epsilon =: Epsilon; % LIGKERN internalchar2 Zeta =: Zeta; % LIGKERN internalchar2 Eta =: Eta; % LIGKERN internalchar2 Theta =: Theta; % LIGKERN internalchar2 Iota =: Iota; % LIGKERN internalchar2 Kappa =: Kappa; % LIGKERN internalchar2 Lambda =: Lambda; % LIGKERN internalchar2 Mu =: Mu; % LIGKERN internalchar2 Nu =: Nu; % LIGKERN internalchar2 Xi =: Xi; % LIGKERN internalchar2 Omicron =: Omicron; % LIGKERN internalchar2 Pi =: Pi; % LIGKERN internalchar2 Rho =: Rho; % LIGKERN internalchar2 Sigma =: Sigma; % LIGKERN internalchar2 Tau =: Tau; % LIGKERN internalchar2 Upsilon =: Upsilon; % LIGKERN internalchar2 Phi =: Phi; % LIGKERN internalchar2 Chi =: Chi; % LIGKERN internalchar2 Psi =: Psi; % LIGKERN internalchar2 uni03A9 =: uni03A9; % LIGKERN Alpha uni037A =: uni1FBC; % LIGKERN Eta uni037A =: uni1FCC; % LIGKERN uni03A9 uni037A =: uni1FFC;

この書き込みへの返事:

お名前
題名 
メッセージ(タグは <a href="...">...</a> だけ使えます。適宜改行を入れてください)