So after my last trip to Lisbon, Portugal, work resumed and tests continued on the virtual machines system. Different tests were done testing mainly Weka-Grid. It was frustrating seeing how this tools would process the requested task. In part it truly parallelize the model building (at least it claims it does based on documentation), plus it also parallelizes the Cross-validation tasks. However, its performance in regards to time taken to complete the task was really not impressive at all. It has a huge overhead before it presents the results, the actual times of building the models are ok but from the moment once sends the command to execute the task to the moment it presents the result it can take from 1 to 3 minutes. This compared to the same task executed locally with regular Weka is very disappointing. In any case even when executing a task locally with Weka-grid it would take a long time to present the results and in many cases it just won;t presetn a result and freeze (maybe was the network system between the virtual machines but could not find any particual cause for this behavior). Weka-grid actually evaluates if the servers have enough memory available then test the connections and rank each server, the first three process the data mining task, and the Cross-validation is executed on how many servers/host one indicates. Besides this fact, another annoying occurrence with Weka-grid was that when performing the RandomForest algorithm tasks, Weka-grid would never complete its initial testing and ranking of servers. No matter how many times I tried it could never display results either on parallel or localhost mode. In any case many insightful results were obtained and the results still have to be evaluated properly. In another area, a little meeting with the collaborators at UPC was scheduled to present some of the insights and results of the experiments. To this meeting Dr. Ricard Gavalda, Dr. Jordi Guitart, Ferran Julia, and Íñigo Goiri were present. I thank them endlessly for their help especially to Ferran and Íñigo since they both are in charge of the virtual machines I use for the experiments ( which continusly ran into issues and questions). We had a picture session but I can't yet post them (but will in the next day or so).


After that meeting more work was done this time on Weka-parallel. I could only implement one of the three algorithms planned due to the fact that Weka-parallel was implemented over an old version of Weka that did not had RandomForest or MultilayerPerceptron implementations. Regardless, Weka-parallel only parallelized the cross-validation portion of the data mining task. I ran with some issues trying to execute it from the command line which made me use the GUI explorer interface of it. but overall the results were gathered. Doing these tests I have learned a lot not just about data mining but also about the virtual systems and UNIX. Also about the many requirements that tools like Weka4ws need to execute properly.
In any case the International PIRE experience is coming to an end but at the same time is the opening door to my ongoing desire to learn and give to the research community new tools and developments on the parallelization of data mining tools. This task I feel is way bigger than what I am capable of doing at the moment but it has open my eyes to appreciate each and every one of the experiences and knowledge I have and knowledge I need to acquire to accomplish my final goal of creating a plug-in parallelization tool for Weka and the data mining task. For the moment the PIRE experience is the start of a long journey into my research tasks. I just hope now that I am going back to Florida, I can manage my working life with my research life.
Dr. Gavalda and myself, last day in Barcelona:

Moving on into details about living and leaving Barcelona, Spain, last two days I was trying to cancel he HSI service I contracted, and well there was a problem with the last payment. They charge more than what I was expecting and the bank didn’t have enough funds (I opened a bank account here in Barcelona so I could order the HSI service). In any case I deposited enough to cover and I saw today they charged it so this is good since I won’t be leaving debts, at least not a big one. I say at least because they will cancel the service in 10 days so the service will be charged, but no matter how much I fought on the phone with the company they won’t cancel sooner…so I guess there is going to be a balance to pay and they do not accept over the phone or via internet payments. So I I will do a direct deposit from the U.S. as I see possible. But then maybe I don’t need too…I have to wait. In another area of the living experiences in Barcelona, I went souvenir hunting and I found out that one can ask to buy something for less than what is advertised…Yes, one has to kind of fight with the seller and try to convince him to deduct 1 to 5 Euros of an item but is doable, so a tip on saving money on souvenirs is to haggle. Now is a matter of catching some sleep before we go to the airport to take the flight back home.
But before that I just want to thank each and every one of the people that have made this research experience possible including the NSF itself and all the people at FIU and FAU and also the people at UPC and the BSC that in one or another way have collaborated to the PIRE program. I enjoyed knowing people in the field I am interested to develop my study and work, as well as acquiring knowledge and experience in new fields before unknown to me. I am glad I took this opportunity to get to know what I am capable of doing by myself with some help. The living abroad experience was the best I have had but after all I miss Florida and all the big advantages we can enjoy in the United States of America.
































































































































































