[521] | 1 | Summary: Text Categorization Library |
---|
| 2 | Summary(ja): テキスト分類ライブラリ |
---|
| 3 | Name: libtextcat |
---|
| 4 | Version: 2.2 |
---|
| 5 | Release: 1%{?_dist_release} |
---|
| 6 | Group: System Environment/Libraries |
---|
| 7 | License: BSD |
---|
| 8 | Source0: http://software.wise-guys.nl/download/%{name}-%{version}.tar.gz |
---|
| 9 | Source1: http://external.openoffice.org/source/browse/*checkout*/external/libtextcat/data/new_fingerprints/fpdb.conf |
---|
| 10 | Source2: http://external.openoffice.org/source/browse/*checkout*/external/libtextcat/data/new_fingerprints/lm/chinese_simplified.lm |
---|
| 11 | Source3: http://external.openoffice.org/source/browse/*checkout*/external/libtextcat/data/new_fingerprints/lm/chinese_traditional.lm |
---|
| 12 | Source4: http://external.openoffice.org/source/browse/*checkout*/external/libtextcat/data/new_fingerprints/lm/japanese.lm |
---|
| 13 | URL: http://software.wise-guys.nl/libtextcat/ |
---|
| 14 | Buildroot: %{_tmppath}/%{name}-%{version}-%{release}-root-%(%{__id_u} -n) |
---|
| 15 | BuildRequires: libtool |
---|
| 16 | Patch0: libtextcat-2.2-exportapi.patch |
---|
| 17 | Patch1: libtextcat-2.2-OOo.patch |
---|
| 18 | |
---|
| 19 | %description |
---|
| 20 | Libtextcat is a library with functions that implement the classification |
---|
| 21 | technique described in Cavnar & Trenkle, "N-Gram-Based Text Categorization". |
---|
| 22 | It was primarily developed for language guessing, a task on which it is known |
---|
| 23 | to perform with near-perfect accuracy. |
---|
| 24 | |
---|
| 25 | %package devel |
---|
| 26 | Summary: Support files necessary to compile applications with libtextcat |
---|
| 27 | Summary(ja): Support files necessary to compile applications with libtextcat |
---|
| 28 | Group: Development/Libraries |
---|
| 29 | Requires: libtextcat = %{version}-%{release} |
---|
| 30 | |
---|
| 31 | %description devel |
---|
| 32 | Libraries, headers, and support files necessary to compile applications |
---|
| 33 | using libtextcat. |
---|
| 34 | |
---|
| 35 | %prep |
---|
| 36 | %setup -q |
---|
| 37 | %patch0 -p1 -b .exportapi.patch |
---|
| 38 | %patch1 -p1 -b .ooo.patch |
---|
| 39 | |
---|
| 40 | %build |
---|
| 41 | autoreconf -f -i |
---|
| 42 | %configure --disable-static |
---|
| 43 | make %{?_smp_mflags} |
---|
| 44 | |
---|
| 45 | %install |
---|
| 46 | rm -rf $RPM_BUILD_ROOT |
---|
| 47 | make DESTDIR=$RPM_BUILD_ROOT install |
---|
| 48 | rm -f $RPM_BUILD_ROOT/%{_libdir}/*.la |
---|
| 49 | mkdir -p $RPM_BUILD_ROOT/%{_datadir}/libtextcat |
---|
| 50 | cp -p %{SOURCE1} $RPM_BUILD_ROOT/%{_datadir}/libtextcat |
---|
| 51 | cd langclass/LM |
---|
| 52 | cp -p amharic-utf.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/amharic_utf.lm |
---|
| 53 | cp -p yiddish-utf.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/yiddish_utf.lm |
---|
| 54 | cp -p afrikaans.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/afrikaans.lm |
---|
| 55 | cp -p basque.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/basque.lm |
---|
| 56 | cp -p bosnian.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/bosnian.lm |
---|
| 57 | cp -p croatian-ascii.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/croatian.lm |
---|
| 58 | cp -p drents.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/drents.lm |
---|
| 59 | cp -p dutch.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/dutch.lm |
---|
| 60 | cp -p english.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/english.lm |
---|
| 61 | cp -p icelandic.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/icelandic.lm |
---|
| 62 | cp -p indonesian.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/indonesian.lm |
---|
| 63 | cp -p latin.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/latin.lm |
---|
| 64 | cp -p malay.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/malay.lm |
---|
| 65 | cp -p manx.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/manx_gaelic.lm |
---|
| 66 | cp -p marathi.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/marathi.lm |
---|
| 67 | cp -p nepali.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/nepali.lm |
---|
| 68 | cp -p romanian.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/romanian.lm |
---|
| 69 | cp -p sanskrit.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/sanskrit.lm |
---|
| 70 | cp -p scots.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/scots.lm |
---|
| 71 | cp -p serbian-ascii.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/serbian_ascii.lm |
---|
| 72 | cp -p slovak-ascii.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/slovak_ascii.lm |
---|
| 73 | cp -p swahili.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/swahili.lm |
---|
| 74 | cp -p tagalog.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/tagalog.lm |
---|
| 75 | cp -p welsh.lm $RPM_BUILD_ROOT/%{_datadir}/libtextcat/welsh.lm |
---|
| 76 | iconv -f WINDOWS-1256 -t UTF-8 arabic-windows1256.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/arabic.lm |
---|
| 77 | iconv -f ISO-8859-1 -t UTF-8 albanian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/albanian.lm |
---|
| 78 | iconv -f WINDOWS-1251 -t UTF-8 belarus-windows1251.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/belarus.lm |
---|
| 79 | iconv -f ISO-8859-1 -t UTF-8 breton.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/breton.lm |
---|
| 80 | iconv -f ISO-8859-1 -t UTF-8 catalan.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/catalan.lm |
---|
| 81 | iconv -f ISO-8859-2 -t UTF-8 czech-iso8859_2.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/czech.lm |
---|
| 82 | iconv -f ISO-8859-1 -t UTF-8 danish.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/danish.lm |
---|
| 83 | iconv -f ISO-8859-3 -t UTF-8 esperanto.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/esperanto.lm |
---|
| 84 | iconv -f ISO-8859-15 -t UTF-8 estonian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/estonian.lm |
---|
| 85 | iconv -f ISO-8859-1 -t UTF-8 finnish.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/finnish.lm |
---|
| 86 | iconv -f ISO-8859-1 -t UTF-8 french.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/french.lm |
---|
| 87 | iconv -f ISO-8859-1 -t UTF-8 frisian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/frisian.lm |
---|
| 88 | iconv -f ISO-8859-1 -t UTF-8 georgian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/georgian.lm |
---|
| 89 | iconv -f ISO-8859-1 -t UTF-8 german.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/german.lm |
---|
| 90 | iconv -f ISO-8859-7 -t UTF-8 greek-iso8859-7.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/greek.lm |
---|
| 91 | iconv -f ISO-8859-8 -t UTF-8 hebrew-iso8859_8.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/hebrew.lm |
---|
| 92 | iconv -f ISO-8859-2 -t UTF-8 hungarian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/hungarian.lm |
---|
| 93 | iconv -f ISO-8859-1 -t UTF-8 irish.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/irish_gaelic.lm |
---|
| 94 | iconv -f ISO-8859-1 -t UTF-8 italian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/italian.lm |
---|
| 95 | iconv -f ISO-8859-13 -t UTF-8 latvian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/latvian.lm |
---|
| 96 | iconv -f ISO-8859-13 -t UTF-8 lithuanian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/lithuanian.lm |
---|
| 97 | iconv -f ISO-8859-1 -t UTF-8 malay.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/malay.lm |
---|
| 98 | iconv -f ISO-8859-1 -t UTF-8 middle_frisian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/middle_frisian.lm |
---|
| 99 | iconv -f ISO-8859-1 -t UTF-8 mingo.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/mingo.lm |
---|
| 100 | iconv -f ISO-8859-1 -t UTF-8 norwegian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/norwegian.lm |
---|
| 101 | iconv -f ISO-8859-2 -t UTF-8 polish.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/polish.lm |
---|
| 102 | iconv -f ISO-8859-1 -t UTF-8 portuguese.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/portuguese.lm |
---|
| 103 | iconv -f ISO-8859-1 -t UTF-8 quechua.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/quechua.lm |
---|
| 104 | iconv -f ISO-8859-1 -t UTF-8 rumantsch.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/romansh.lm |
---|
| 105 | iconv -f ISO-8859-5 -t UTF-8 russian-iso8859_5.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/russian.lm |
---|
| 106 | iconv -f ISO-8859-1 -t UTF-8 scots_gaelic.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/scots_gaelic.lm |
---|
| 107 | iconv -f ISO-8859-2 -t UTF-8 slovenian-iso8859_2.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/slovenian.lm |
---|
| 108 | iconv -f ISO-8859-1 -t UTF-8 spanish.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/spanish.lm |
---|
| 109 | iconv -f ISO-8859-1 -t UTF-8 swedish.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/swedish.lm |
---|
| 110 | iconv -f ISO-8859-9 -t UTF-8 turkish.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/turkish.lm |
---|
| 111 | iconv -f KOI8-R -t UTF-8 ukrainian-koi8_r.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/ukrainian.lm |
---|
| 112 | #these look wrong to me, but that's what upstream OOo has done, raise this upstream |
---|
| 113 | iconv -f ISO-8859-1 -t UTF-8 hindi.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/hindi.lm |
---|
| 114 | iconv -f ISO-8859-1 -t UTF-8 persian.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/persian.lm |
---|
| 115 | iconv -f ISO-8859-1 -t UTF-8 korean.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/korean.lm |
---|
| 116 | iconv -f ISO-8859-1 -t UTF-8 tamil.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/tamil.lm |
---|
| 117 | iconv -f ISO-8859-1 -t UTF-8 thai.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/thai.lm |
---|
| 118 | iconv -f ISO-8859-1 -t UTF-8 vietnamese.lm > $RPM_BUILD_ROOT/%{_datadir}/libtextcat/vietnamese.lm |
---|
| 119 | #and I have no idea how they fixed the encoding of these ones |
---|
| 120 | cp -p %{SOURCE2} $RPM_BUILD_ROOT/%{_datadir}/libtextcat/chinese_simplified.lm |
---|
| 121 | cp -p %{SOURCE3} $RPM_BUILD_ROOT/%{_datadir}/libtextcat/chinese_traditional.lm |
---|
| 122 | cp -p %{SOURCE4} $RPM_BUILD_ROOT/%{_datadir}/libtextcat/japanese.lm |
---|
| 123 | |
---|
| 124 | %post -p /sbin/ldconfig |
---|
| 125 | |
---|
| 126 | %postun -p /sbin/ldconfig |
---|
| 127 | |
---|
| 128 | %files |
---|
| 129 | %defattr(-,root,root,-) |
---|
| 130 | %doc ChangeLog README LICENSE TODO |
---|
| 131 | %{_libdir}/lib*.so.* |
---|
| 132 | %{_datadir}/libtextcat |
---|
| 133 | |
---|
| 134 | %files devel |
---|
| 135 | %defattr(-,root,root,-) |
---|
| 136 | %{_bindir}/createfp |
---|
| 137 | %{_libdir}/*.so |
---|
| 138 | %{_includedir}/%{name}/ |
---|
| 139 | |
---|
| 140 | %clean |
---|
| 141 | rm -r $RPM_BUILD_ROOT |
---|
| 142 | |
---|
| 143 | %changelog |
---|
| 144 | * Sun Jul 27 2008 Daisuke SUZUKI <daisuke@linux.or.jp> 2.2-1 |
---|
| 145 | - initial build for Vine Linux |
---|
| 146 | |
---|
| 147 | * Mon Feb 04 2008 Caolan McNamara <caolanm@redhat.com> 2.2-5 |
---|
| 148 | - rebuild for stricter headers |
---|
| 149 | |
---|
| 150 | * Wed Aug 29 2007 Caolan McNamara <caolanm@redhat.com> 2.2-4 |
---|
| 151 | - rebuild |
---|
| 152 | |
---|
| 153 | * Wed Aug 08 2007 Caolan McNamara <caolanm@redhat.com> 2.2-3 |
---|
| 154 | - package fingerprints |
---|
| 155 | |
---|
| 156 | * Wed Jul 25 2007 Caolan McNamara <caolanm@redhat.com> 2.2-2 |
---|
| 157 | - fiddle with api for OOo 2.3 |
---|
| 158 | |
---|
| 159 | * Wed Jun 27 2007 Caolan McNamara <caolanm@redhat.com> 2.2-1 |
---|
| 160 | - Initial version |
---|