Thursday 27 April 2006 by ClarK
The Flex code is the most functional. Made in C with Flex as a syntaxic analyser, it is really fast.
The necessary packages are (used on my Ubuntu Breezy Badger):
The code can be found on the project hosting savannah, in Download section for tarballs or in CVS section.
— > HERE <—
The command ./tokenize -h will display the help and option message.
It is then possible to write scripts in order to retrieve the internet addresses of the web pages logged thanks to a proxy (or whatever) in order to use them as input of the program. Thanks to the result, it is then possible to add the necessary one into a blacklist.
If you develop such scripts, feel free to send them to me so that I can add them on the website.