Deep Learning Specialization and 42 Silicon Valley

10 Mar 2018


Update 0705/2020: Updated the comparison of style transfer using BeerSlider.


Welcome again!

I am (slowly) growing accustomed to the idea of having a blog now, but for now I can share my latest accomplishment: I have finally obtained the Deep Learning Specialization from deeplearning.ai and Coursera! I do not regret one bit obtaining this specialization, as many of the algorithms in Deep Learning that are used in the A3C algorithm used in my internship at the Computer Vision Center were eluding me, specifically CNNs and LSTMs. I will expand more what I accomplished in this internship (and henceforth) in a future blog post.

The DL Specialization also made my mind wander off into applications of new and unexplored algorithms I hadn’t yet tried before, especially as some programming exercised involved us toying with generating new Jazz music with LSTMs, recognizing objects with CNNs, and neural style transfer again with CNNs. With the latter, I chose to generate the following ‘classical’ example of using a known landmark or real-world image, and applying the artistic style of a renowned piece of art. For this, I chose the Louvre as the real-world photograph (Content) and a sample of a huipil from the area of Chichicastenango in Guatemala as art (Style). The result (Generated Image) is shown here:

Louvre
Louvre wth the style of huipil

This opens up another area of applications for CNNs, as not only recognizing from which area in Guatemala (or even Mesoamerica) does a huipil come from is an interesting problem to solve (as hidden patterns may emerge), but also one of deep importance to the Mayan community. This is due to the movement for intellectual property of the designs is gaining momentum and rightly so, as checking whether or not some products being sold online have paid a fair price, or even paid at all for the design is a problem worth solving.

The Jazz improvisation using LSTMs coding exercise on the Deep Learning Specialization was extremely fun, and I wish to use them to not only generate dialogue of my favorite books (though this will require more learning), but also on basically doing the same idea to generate marimba music, as it is the national instrument from my country and I feel it to be a very expressive instrument, with many adaptations outside of the classical marimba music that I am used to.

Lastly, the specialization showed me that chatbots don’t always have to belong to a corporation website/social media site, but they can also be extremely useful in either improving mental healthcare of its users or solving conflicts between two parties. Indeed, Greg Brockman’s blog on how his path to OpenAI has cemented on me exploring not only chatbots, but NLP as well. This will be another project I wish to explore, and soon if possible.

42 Silicon Valley and beyond

I won’t be updating this as much as I would like as I have briefly mentioned it in the first blog post, starting on Monday March 12th, I will be attending 42’s Piscine! The Piscine is a 1-month intensive programming bootcamp where 7 days a week, roughly 20 hours a day, we will be working towards being accepted to 42’s full program. My understanding of the Piscine is that we will be learning both the programming language C and how to program, which is great since my background in Physics and Mathematics simply involved using well-known packages or very high-level programming languages like Python, never fully understanding the background.

I hope to come back with more news, but I will be focusing more on the Piscine, and will of course write in another blog post my experiences there.

See you soon!