Project's presentation |
Oyez !
Are you using Mozilla under Windows 98 ? Vote for Bug 180112 !
AraMorph is a Java port of the homonym product developed in Perl by Tim Buckwalter on behalf of the Linguistic Data Consortium (LDC) which can be downloaded from http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2002L49.
The product includes Java classes for the morphological analysis of arabic text files, whatever their encoding. Three test files are included in the principal encodings used for arabic : UTF-8, ISO-8859-6 and CP1256.
This project also includes some classes which are compatible with Lucene architecture, in order to allow analyzing, indexing and querying documents in arabic.