--- trunk/TODO 2006/03/23 12:28:26 424 +++ trunk/TODO 2006/09/05 21:02:24 628 @@ -11,9 +11,28 @@ + fix nasty bug with repeatable subfields [2.10] + write pure perl Search::HyperEstraier [Search::Estraier is on CPAN] + apply regex on records from input to fix-up character encodings [2.11] -- support arrays for normalize/path and lookup -- add Excel input format ++ add support for KinoSearch search library [2.12] ++ added new set-based normalizer which is pure perl code [2.13] ++ added --stats to report field and subfield usage [2.14] ++ add validator for input data [2.15] ++ add Excel input format [2.16] ++ remove WebPAC::Normalize::XML and promote WebPAC::Normalize::Set to WebPAC::Normalize [2.20] ++ support arrays for normalize/path [2.21] ++ add marc to normalize and create export MARC file [2.22] ++ implement indicators and repetable subfield in marc export [2.23] ++ add WebPAC::Output::MARC [2.24] ++ add config() and id() to WebPAC::Normalize ++ support local (by hostname) config files ++ implement marc_original_order to remap source records to marc [2.25] ++ fix statistics to use original data instead of data after modify_records ++ fix encoding and recoding issues (use UTF-8 as WebPAC native encoding) [2.26] +- modify_records should preserve order (YAML format modification) +- modify_records should match all occurances instead of just last +- fix WebPAC::Output::MARC encoding troubles +- support splitting of config yml to multiple files +- rewrite lookup support to use WebPAC::Normalize - add dBase input format - remove delimiters characters from index and query entered - delete unused files in database directories - scoring for various fields in input/*.xml +- marclint - validate 035$9 as valid