by Joel Reicher <joel@[EMAIL PROTECTED]
>
Mar 23, 2008 at 04:41 AM
subject.fun@[EMAIL PROTECTED]
writes:
> I am working on CSV files and i need to do a search on lagre amount
> csv data files. Currently i am using GAWK for doing the same. But
> still i am not satisfied with the performance.
> Could any body suggest what would be better option for my scenario.
Is the loss of performance in the data processing or in the file
accessing? If the files are large it may take a long time for the
sequential access to get to the desired points, in which case
optimising the data processing won't change a thing; you would instead
need to make yourself an index/lookup method of accessing, either by
splitting the files up into smaller files or by seeking to the right
spot in the file (you can preprocess with dd to do this if you want to
keep using awk).
Cheers,
- Joel