All posts
#codex
#obsidian
#jekyll
#writing
I like writing. More precisely, I feel like I need to write regularly. I usually organize my thoughts in a diary, and when I learn something techni...
read more →
#figma
#design-system
If someone asked me what feels most difficult when building websites or apps, the first thing that comes to mind is design. In the past, I might ha...
read more →
#aws
#text2sql
#ontology
#cdk
Through several company examples and sessions, I was able to see what kinds of synergy can emerge when development with AI is combined with AWS. In...
read more →
#spark
#scheduling
#data-locality
#streaming
This is the third post in my notes on Spark in Action by Petar Zečević and Marko Bonaći. In this post, I will summarize the runtime components and ...
read more →
#spark
#partitioning
#shuffle
#rdd
This is the second post in my notes on Spark in Action by Petar Zečević and Marko Bonaći. In the first post, I looked at Spark’s basic execution fl...
read more →
#spark
#hadoop
#mapreduce
#rdd
I am going to record my notes from reading Spark in Action by Petar Zečević and Marko Bonaći in three parts. In this first post, I will start with ...
read more →
#search-algorithm
#learning-to-rank
#information-retrieval
There are countless documents on the web, and we can now search for almost any information that exists in the world. That makes a different questio...
read more →
#tf-idf
#bm25
#word2vec
#rnn
#transformer
Natural language processing (NLP) is a field that represents text as numbers, learns relationships among those numbers, and produces meaningful res...
read more →
#paper-review
#self-attention
#speaker-recognition
This post is a review based on the paper Self-attention encoding and pooling for speaker recognition. Overview Not every frame in utterance data is...
read more →