use save_row and load_row to share data between lookups and input->fetch, added some timing for loading of lookups which revealed a big performance impact of one debug(dump())