Feb 172011
 

tux-terminalAmong the commands available from the linux terminal there’s also cut, very useful for processing strings and characters in general.

The cut command is one of the oldest Unix command. That means that it is more then 40 years old. And it shows. It is important to understand that this is a Unix command and behaves in “Unix way”.  For example, it  uses IFS (Input Field Separators) to determine where to split fields.  You can check it with set | grep IFS. You can also set it, for example, to:

IFS=" tn"

The cut command takes a vertical slice of a file, printing only the specified columns or fields. The columns are selected using the standard IFS or one specified. Continue reading »

Flattr this!

Feb 172011
 

nutchToday I present you this excellent and comprehensive article on an open source search engine: Nutch, you can find the original article with the code examples here

After reading this article readers should be somewhat familiar with the basic crawling concepts and core MapReduce jobs in Nutch.

What is a web crawler?

A Web Crawler is a computer program that usually discovers and downloads content from the web via an HTTP protocol. The discovery process of a crawler is usually simple and straightforward. A crawler is first given a set of URLs, often called seeds. Next the crawler goes and downloads the content from those URLs and then extracts hyperlinks or URLs from the downloaded content. This is exactly the same thing that happens in the real world when a human is interfacing with a web browser and clicks on links from a homepage, and pages that follow, one after another.
Continue reading »

Flattr this!

Feb 162011
 

escherTired of the “usual” screensaver present on the major Windows Manager?

Today we will see some alternative programs that can be used on our linux to have new and original effects.

Matrixgl

Matrixgl is a free, open source 3D screensaver based on The Matrix Reloaded. It supports widescreen setups, and can be run on Windows, Mac OSX, Linux, BSD, and many other Unix based operating systems.

Continue reading »

Flattr this!

Feb 142011
 

collabshotIn a previous article I talked about some programs to take screenshots in linux, and since then I must say that I have always used shutter.

But the open source world is beautiful because there are so many differents ways to do the same thing, so today I also unveils Collabshot and Jshot,  2 programs that waere created with the idea of sharing your work with someone else.

Continue reading »

Flattr this!

Feb 122011
 

tux-terminalIf you want you can only use programs that run inside a terminal to make the most ordinary things you do every day.

Comfortable?

probably not, but on some occasions a program that runs from the terminal might help you … or for some of them they are simply better than even their graphics counterparts. Continue reading »

Flattr this!