Tag Content Extractor

It is a nice sunny day, at least when I was having breakfast earlier this morning, in the Twin Cities of Minneapolis and St. Paul. Better yet, it is Friday!!!

Spoke with one of my sons. He and his family had scheduled a holiday and were on the road. When they moved, they build a home. Some years went by and last fall they decided to buy a new one that they liked. Shortly after they moved and put their first home on the market. A few months went by and finally they closed on it yesterday. I am very glad for them. Having two mortgages is not convenient at all. Continue reading “Tag Content Extractor”

More than a List of Words

When indexing text based word frequency / relevance which may be applicable for web searches, one of the procedures used is to create a term frequency (tf) array followed by an inverse document frequency (idf) one. You can read more about this here.

In a previous post I experimented with some text in order to build hashmaps with the words of sentences (to keep things in perspective for a blog post). In that post I used a string that I copied from a course I took some years ago. The sting was already preprocessed. The text had already been stripped off punctuation marks. Continue reading “More than a List of Words”

Regular Expressions

A regular expression is a sequence of characters that define a search pattern. Usually this pattern is then used by string searching algorithms for “find” / “match” or “find and replace” operations on strings. Continue reading “Regular Expressions”