Datex - The Data Extractor
Datex - The Data Extractor
January 12, 2008
The first Marble to be published is what I call the Data Extractor, or Datex for short. Datex is powerful data extraction tool that really puts the power of regular expressions in your hands. Datex supports Perl-style regular expressions and is my tool of choice for extracting data form text files and web pages.
Downloads
Datex for Windows 2000, XP and Vista
Datex for Macintosh (Universal Binary)
Datex for Linux with GTK+ 2.x
All downloads are self-contained zipped executables that can be copied anywhere you want on your PC or that can be run from a USB-stick or external drive without worrying about an installation procedure.
By downloading Datex you agree with the Datex License Agreement.
For questions or support requests, please visit the support forum.
The Data Extractor is a handy tool that makes it easy to extract data from text. It uses regular expressions to match items in your text that you want to use as structured data. Data Extractor can export this data in a format you define, which can then be imported into an Excel spread sheet or an Access database, or can be processed further with other tools.
Regular expressions are a powerful mechanism to grab exactly those text elements you want to export as data. The Data Extractor supports Perl-style regular expressions, which are both extremely powerful as well as easy to use.
Output data is generated in Datex by referring to the matched items in the search pattern using the $1, $2, … back reference parameters. You can also use \t for tab and \n for newline in your output pattern.
Datex supports multi-line as well as case-sensitive matching, and provides for a ‘Greediness’ toggle to globally influence the greediness settings of the regular expression-matching engine.
The regular expression syntax supported by Datex can be found in the ‘Regular Expressions’ help entry.
If you like this application, please consider a donation to keep its development going.