--- trunk/TODO 2006/09/06 19:25:22 636 +++ trunk/TODO 2006/10/08 12:45:50 748 @@ -27,12 +27,18 @@ + fix statistics to use original data instead of data after modify_records + fix encoding and recoding issues (use UTF-8 as WebPAC native encoding) [2.26] + modify_file preserve order of translations in file [2.27] -- modify_records regexpes should match all occurances instead of just last -- fix WebPAC::Output::MARC encoding troubles ++ modify_records regexpes now match just first occurence (repeat to get second...) ++ fix WebPAC::Output::MARC encoding troubles ++ generate reports (validation and stats) for each input ++ rewrite lookup support to use WebPAC::Normalize [2.28] ++ marc_leader shouldn't really be included in hash returned by data_structure +- fix-length fields (<100) support +- add option to specify output marc path in config.yml +- add checks for search directive in normalization to parser - support splitting of config yml to multiple files -- rewrite lookup support to use WebPAC::Normalize - add dBase input format - remove delimiters characters from index and query entered - delete unused files in database directories - scoring for various fields in input/*.xml - marclint - validate 035$9 as valid +- lookup to another input file