AlchemyCmd
Command line text analysis tool for Linux/Unix
AlchemyCmd? What’s that?
AlchemyCmd is a command-line tool for performing natural language processing and text analysis on Linux/Unix systems.
What Does It Do?
AlchemyCmd enables you to perform named entity extraction, concept tagging, keyword extraction, language detection, and structured content extraction directly from the command line.
This tool can process HTML or text content on your local filesystem, crawl Internet-hosted web pages, and process content from standard input (stdin). This makes it easy to construct shell scripts and UNIX command pipes that leverage natural language processing.
OK, How Do I Use It?
AlchemyCmd provides a wide variety of NLP and content retrieval options. You may see the available command-line options by issuing the following command:
- alchemycmd —help
To perform concept tagging on an Internet-hosted web page, you may issue the following command:
- alchemycmd —mode concept -S web -U “http://www.cnn.com/2010/US/07/13/steinbrenner.obit/index.html?hpt=T1”
The above command would return concept tagging results in a simplified, comma separated value (CSV) format:
New York Yankees,0.956409Major League Baseball,0.62449
George Steinbrenner,0.548511
Billy Martin,0.473525
Yankee Stadium,0.457922
Reggie Jackson,0.423929
Derek Jeter,0.421396
YES Network,0.413723
Output in XML format may also be retrieved using the “—output-mode xml” command-line option.
1 year ago