

All columns are always filled (at least in the visible sample set).From looking at the data you can make these observations: The Structured one is much better as the data structure resembles the data from the original document. The Plain one is not useful at all as the data is all over the place, without any clear pattern. You can see that you can export to Plain or Structured. Mainly standard functionality, nothing too "dirty".įirst you need to realise how do the exported data look like. This one will try to use purely text extraction and Regular expressions. You will also be facing challanges with Automation Anywhere, since it does not really provide the right tools to do such a thing and you may need to resort to scripting (VBScripts) or Metabots. You can still achieve what you need, and with good performance, but the code itself will not be pretty. and the main reason for that are the values that contains multiple lines. makes it easy to install a subset of excellent KDE apps in windows.I am afraid that your case will be quite challenging. in a neat and fast enough application with a small footprint. I am really impressed with what Okular can do. and their slogan More Than a Reader certainly fits. So, it depends on how great your needs are, this may be a viable option for even windows. Would this work for windows? Yes, KDE can be installed in windows, but KDE applications come with some decent overhead of other needless software. So, I will try first with tabula (which looks very promising too), and if that fails, then okular it would have to be. I have this need (to extract a few tables from a PDF doc) now on a Centos desktop running gnome/xfce, and installing okular would mean installing a whole bunch of other KDE graphics tools. When you paste it, you get a consistent tab-delimited output that any serious tool can be coaxed into handling as a CSV file. With Okular, you use a Table Selection Tool and define an area, and then click in column borders to mark fields. Okular is awesome in the features it packs, and can certainly address the needs of the guy asking the question.

I have it installed in my ubuntu desktop, and never took it seriously.
