Thread Pool

It is possible to receive a request, create a process or thread, service the request, and return to the caller the results of the operation. Many years ago, creating a process was the default approach. The issue was that creating and destroying a process when done are quite expensive operations. Continue reading “Thread Pool”

More than a List of Words

When indexing text based word frequency / relevance which may be applicable for web searches, one of the procedures used is to create a term frequency (tf) array followed by an inverse document frequency (idf) one. You can read more about this here.

In a previous post I experimented with some text in order to build hashmaps with the words of sentences (to keep things in perspective for a blog post). In that post I used a string that I copied from a course I took some years ago. The sting was already preprocessed. The text had already been stripped off punctuation marks. Continue reading “More than a List of Words”

Simple Problems in Python

Last week I was reading a post on Medium “First Steps in Data Science with Python NumPy” by Kshitij Bajracharya.

What called my attention is his opening statement “I’ve read that the best way to learn something is to blog about it”. I believe Kshitij hit it right on. The reason I agree is that I have been a believer in “If you can’t explain it simply, you don’t understand it well enough”. This quote is attributed to Albert Einstein. Continue reading “Simple Problems in Python”

Vector Model and Similarity Search

Have you ever wondered how computers search for text and similar images?

For example, if you use Windows, open a File Explorer window. From top to bottom the windows has the title bar, the menu bar, the tool bar. Under the toolbar there are two text fields. The one on the left displays the full path to the current folder / directory. The one on the right displays “Search <current_folder>” e.g., “Algorithms”. I have enabled in my computer “Index Properties and File Contents”. By default when you search, Windows will only search the file names and properties; not the contents of the file. Depending on your usage, you might need to index some or all the files in all folders in your computer. In my case, I perform searches in all types of documents. If you mostly use the Office Suite, you might enable search only on folders holding your *.docx files. The reason for this is that the mechanism uses additional disk and memory to operate. Continue reading “Vector Model and Similarity Search”

Issues while Installing Hadoop on Windows

I am always trying to read and practice with topics / subjects that I feel learning or call my attention. I am currently taking some specialization courses on Coursera. You can never give up and stop learning. In this post I will describe how I installed Hadoop and the issues I encountered. Continue reading “Issues while Installing Hadoop on Windows”

Crash Course in Python – Part II

It is Friday. Many stressful things are happening in different fronts in the past few months. Most of the things are out of my control. It seems like I need to find a way to relax. Hopefully things will turn out well. I fully understand that stress is an internal thing and I am quite good at controlling it. I used to say that stress is the salt of life; but too much salt is not good for you. Continue reading “Crash Course in Python – Part II”

Crash Course in Python – Part I

As I have mentioned in previous posts, I like to purchase and read computer related technical books. When I receive the book I write my name and year on the first page. I then locate the date for the last revision and circle it. In 2017 I purchased “Data Science from Scratch” by Joel Grus. I read the first five chapters that I was interested it at the time and moved on to the next book. Continue reading “Crash Course in Python – Part I”

Closed my Linkedin Account

This morning I finally had time to close my Linkedin account. I had been thinking to close the account for a couple years. In that time I have logged in a few times a year. I have seldom contacted people in my ‘network’. For the people I care to keep in touch with, I use email, my cell phone and text messages. That said; in the past few years I have been thinking about two reasons to close the account. Continue reading “Closed my Linkedin Account”

Castor Framework

My wife and I are going to attend a wedding on July 5th. I am going to wear black dress shoes. A week or so ago while at Nordstrom’s at the MOA I was going to get a pair of shoes for the occasion. My wife mentioned that I have a pair of never worn black shoes still in their box. This morning, my wife gave me the shoes. I decided to wear them at home for a few days in order to make them mold to my feet. Seems like most of the times I wear new shoes I get blisters on my feet. I am the type of person that wears thick socks even when wearing sandals. Will see how things went after the wedding. Continue reading “Castor Framework”

Deep Work

I typically get a couple dozen technical books for each non technical. I purchased via Amazon “Deep Work” by Cal Newport. I saw it mention while reading an article in Medium. It took me less than a week to read the book. The subject is very well known to me. Will explain why it is the case in the following paragraphs.

I was born in Lima, Peru. My parents left Europe in the 1930s in search of a better life. I had three siblings. One of the twins graduated from college as an international attorney and passed away months after. My other two sisters are physicians. One continued her studies in the USA and Europe. Today she is a professor at an Ivy League school spending time in a lab in China at a well know university. I came to this country over 40 years ago with a scholarship to attend an Ivy League school in New York.

Continue reading “Deep Work”