| Sign In/My Account | View Cart |
|
|
We've expanded our Perl news coverage and improved our search! Search for all things Perl across O'Reilly! Text Tools ascii2pdf ascii2pdf is a simple text to PDF converter. It has options for font, font size, and portrait vs. landscape. bp, a Perl Bibliography Package A Perl library designed to:
Dan Melamed Dan has 150 general text processing tools (mostly in PERL5), and 50 text statistics tools (mostly in PERL5). disjoin disjoin -- the contrary of a database %22join%22. It is useful for performing set operations on text database files; it is especially useful (for example!) for administrating passwd files (for instance when you want to remove some entries from the passwd file without deleting them, but storing them in a different file for backup). It allows to do set complement and set difference extremely easily on this kind of files. epsmerge epsmerge is a perl program for merging EPS (Encapsulated Postscript) files into one bigger EPS file. It does this by reading the files and arranging them in rows (or columns), squeezing them together so that they all fit. It can squeeze in several ways: uniformly, preserving aspect ratios, preserving the relative sizes of the images, etc. It can also be used to format a single EPS file on a paper which can then be sent to a printer. There are options to put labels over or under the images. findgrep Findgrep is a utility to recursively grep in the current directory and subdirectories. It uses regular expressions to specify the search expression as well as the file name. It now has support for approximate matches. It is written in perl and runs on Unix and DOS systems (The author hasn't received any information about Macs, but it should run OK on these too). formatit Formats ASCII text for the PalmPilot. Tries to make ASCII drawings look right under the Pilot's proportional font. html2ps An HTML-to-PostScript converter. HTML::FromText HTML::FromText converts plain text to HTML. Lingua::En::Nameparse This module takes as input a person or persons name in free format text such as,
Lingua::EN::Nums2Words This code has the potential for generating US English verbage representative of every real value from negative infinity to positive infinity if the module's private variables @Classifications and @Categories are filled appropriately. This module generates verbage based on the thousands system. See http://www.quinion.demon.co.uk/words/numbers.htm for details of the thousands system versus millions system of linguistically representing large numbers. List::Permutor Process all of the possible permutations of a list of items. Locations Locations - magical spots in your output files -- %22Locations%22 free you from the need to write output files in rigidly sequential order. They allow you to define insertion points in output files (while you are still writing to them!) which you can fill in later. Think of %22Locations%22 as folders (or drawers) containing papers in a sequential order, most of which contain printable text or data, while some may contain the name of another folder (or drawer). When dumping a location to a file, the papers contained in it are printed in their stored order. When a paper containing the name of another location is encountered, the contents of that location are processed before continuing to print the remaining papers of the current location. And so forth, in a recursive descent. Number::Format A library for formatting numbers. Functions are provided for converting numbers to strings in a variety of ways, and to convert strings that contain numbers back into numeric form. The output formats may include thousands separators - characters inserted between each group of three characters counting right to left from the decimal point. The characters used for the decimal point and the thousands separator come from the locale information or can be specified by the user. PDF library provides classes and functions to read and manipulate PDF files with perl. PDF stands for Portable Document Format and is a format proposed by Adobe. For more details about PDF, refer to: http://www.adobe.com. pgrep Perl grep -- Did you ever want to have a %22grep%22 combined with the power of Perl's regular expressions? Here it is! QDMerge QDMerge is a Quick and Dirty Perl script that Merges text from a source document with a template to generate a unique output file that follows a specific design. (Fully Buzzword Compliant) SDF - Simple Document Format A freely available documentation system designed and developed by Ian Clatworthy, with help from many others. Based on a simple, readable markup language, SDF generates high quality output in multiple formats, all derived from a single document source. Supported output formats include HTML, PostScript, PDF, man pages, POD, LaTeX, MIF, RTF, Windows help and plain text. Squeeze.pm Squeeze text (English) to most compact format possibly so that it is barely readable. You shold convert all text to lowercase for maximum compression, because optimisations have been designed mostly for unpapitalised letters. You can use this module eg to preprocess text before it is sent to electronic media that has maximum text size limit. For example Pagers have some arbitrary text size limit, say 200 characters, which you want to fill as much as possible. Alternatively you may have GSM Cellular phone wich is capable of receiving Short Messages (SMS), whose text limit is 160 characters. String::Approx This Perl module implements approximate matching (aka fuzzy matching). Approximate is defined as n-differences. One difference is an insertion, a deletion, or a substitution of one character. n-differences is the maximum number of differences. For example 1-difference means that a match is found if there is one character too many (insertion) or one character missing (deletion) or one character changed (substitution). Text-Tabs+Wrap Text::Tabs performs the same job that the unix expand(1) and unexpand(1) commands do: adding or removing tabs from a document. Text::Wrap::wrap() will reformat lines into paragraphs. All it does is break up long lines, it will not join short lines together. Text::Wrap::fill() will do simple paragraph reformatting. Text::BibTeX Text::BibTeX is a Perl library for parsing and processing BibTeX data files; it is paired with the C library btparse, which does most of the low-level work. Together, these two libraries provide a comprehensive interface to BibTeX files, and ultimately aim to reimplement BibTeX in C and Perl without the need for a special-purpose, dedicated programming language. Text::CSV Text::CSV provides facilities for the composition and decomposition of comma-separated values. An instance of the Text::CSV class can combine fields into a CSV string and parse a CSV string into fields. [ pod ] Text::FillIn This module provides a class for doing fill-in templates. These templates may be used as web pages with dynamic content, e-mail messages with fill-in fields, or whatever other uses you might think of. Text::FillIn provides handy methods for fetching files from the disk, printing a template while interpreting it (also called streaming), and nested fill-in sections (i.e. expressions like [[ $th[[$thing2]]ing1 ]] are legal). Text::Format Various subroutines to manipulate text. The wrap routine will wrap under all circumstances even if the width isn't enough to contain the longest words. Text::Wrap will die under these circumstances which isn't quite desirable in my opinion. If columns is set to a small number and words are longer than that and the leading 'whitespace' than there will be a single word on each line. This will let you make a simple word list which could be indented or right aligned There is a chance for croaking if you try to subvert the module. [ pod ] Text::GenderFromName Guess the gender of a %22Christian%22 first name. This module provides a lone function: gender(), which returns one of three values: %22m%22 for male, %22f%22 for female%22, or UNDEF if it doesn't know. For instance, gender(%22Chris%22) is UNDEF. Text::MetaText Perl extension implementing meta-language for processing %22template%22 text files. Like a glorified pre-processor, MetaText can; include files, define and substitute variable values, execute conditional actions based on variables, call other perl functions or object methods and capture the resulting output back into the document, and more. It can format the resulting output of any of these operations in a number of ways. The objects, and inherently, the format and symantics of the MetaText langauge itself, are highly configurable. Text::Query::Advanced This module provides an object that matches a string or list of strings against a Boolean query expression similar to an AltaVista %22advanced query%22. Elements of the query expression may be regular expressions or literal text. Text::Query::Simple This module provides an object that tests a string or list of strings against a query expression similar to an AltaVista %22simple query%22 and returns a %22relevance value.%22 Elements of the query expression may be regular expressions or literal text. Text::Template Allow arbitrary perl code to be embedded into an HTML or text file. Supports safe evaluation. Very fast. Well-developed and stable since 1995. TextDB An add-on for perl 5 which uses the mechanism for tying associative arrays (%-variables) using packages. In this case you can access data in a text file with a format specified by you using class variables, giving a great deal of flexibility about how the data looks. Modes exist for line data (e.g. /etc/aliases) and column data (e.g. /etc/passwd or /etc/hosts), but the major use is for text files containing records with key/value pairs. Numerous variables exist for adapting to the format of the data accessed; you can access /etc/termcap or .netrc, for example. Treesed Treesed, a Perl program, is a search/replace tool for lists of files. It can search for patterns in a list of files, or even a tree of directories with files. Usage: Treesed searches for pattern1. If pattern2 is supplied pattern1 is replaced by pattern2. If pattern2 is not supplied treesed just searches. A list of files can be supplied with the -files parameter. Treesed is also capable of search/replace in files in subdirectories if you supply the -tree parameter. All files in the current directory and subdirectories are processed. Always a backup is made of the original file, with a random numeric suffix. Fairly detailed documentation can be found at http://www.uwo.ca/its/doc/hdi/web/treesed.html.
|
|
Sponsored By: |