Stealing pages from the server...

I train machine to train model.
03
16
15
Visualise Crypto and Twitter with SQL and FastAPI Visualise Crypto and Twitter with SQL and FastAPI
In this article, first, I'll populate crypto database using Python and SQL. I retrieve the list of crypto coin assets, verify the data, and tackle any errors I encounter along the way. Second, I'll talk about how to keep the database up to date with the latest prices, and retrieve daily data from the yahoo finance API. Third, a database for twitter data will be built as well. Finally, I'll set up to build a web UI using FastAPI.
2021-03-15
15
Triple Barrier Method for ML Triple Barrier Method for ML
Time series prediction has been widely applied to the finance industry in applications such as stock market price and commodity price forecasting. Machine learning methods have been widely used in financial time series prediction in recent years. How to label financial time series data to determine the prediction accuracy of machine learning models and subsequently determine final investment returns is a hot topic.
2021-03-15
15
Eigenvectors from Eigenvalues Eigenvectors from Eigenvalues
This article is about implementing "Eigenvectors from eigenvalues" of Terence Tao's paper using Python and R. It's a amazing work and mathematics contribution from Terence Tao. It is an elegant non-evident result, which makes me so excited about it!
2021-03-15
14
13
Principal Component Analysis Derivation Principal Component Analysis Derivation
Principal Component Analysis (PCA) is an important technique to understand in the fields of statistics and data science. It is a process of computing the principal components and utilising then to perform a change of basis on the data. For the purpose of visualisation, it is very hard to visulaise and understand the data in high dimensions, this is where PCA comes to the rescue.
2021-03-13
12
P-Value Easy Explanation P-Value Easy Explanation
In Data Science interviews, one of the frequently asked questions is 'What is P-Value?'. It's hard to grasp the concept behind p-value. To understand p-value, you need to understand some background and context behind it.
2021-03-12
02
Train Word Embedding Vectors on Custom Corpus Train Word Embedding Vectors on Custom Corpus
When I was doing my dissertation project, I found out that the performance of model wasn't quite well. I believe it's because the domain of pre-trained GoogleNews-vectors-negative300 is different from the the dataset of mine. Hence, I decide to pre-train a word2vec model by myself.
2021-03-02
02
07
Twitter Hate Speech Detection Twitter Hate Speech Detection
The objective of this task is to detect hate speech in tweets. For the sake of simplicity, let's say a tweet contains hate speech if it has a racist or sexist sentiment associated with it. So, the task is to classify racist or sexist tweets from other tweets.
2021-02-07
06
Prevent Colab from Disconnecting Prevent Colab from Disconnecting
Google Colab notebooks have an idle timeout of 90 minutes and absolute timeout of 12 hours. This means, if user does not interact with his Google Colab notebook for more than 90 minutes, its instance is automatically terminated. Also, maximum lifetime of a Colab instance is 12 hours.
2021-02-06
06
SSH into Google Colab SSH into Google Colab
Sometimes the code I write may not work in my local machine, however, it works in Google Colab. So, I wanna connect to Google Colab terminal using SSH.
2021-02-06
05
01
29
28
Laplace Expansion and Chiò Condensation Laplace Expansion and Chiò Condensation
Determinants are mathematical objects which have applications in engineering mathematics. For example, they can be used in the solution of simultaneous equations, and to evaluate vector products. Determinants can also be used to see if a system of $n$ linear equations in $n$ variables has a unique solution. There are several ways to calculate determinant, however, today I'm going to introduce another way of computing determinants Chio Identity.
2021-01-28
27
26
25
GitHub Pages from Zero to Hero GitHub Pages from Zero to Hero
The ability to classify music in an automated manner has become increasingly more important with the advent of musical streaming services allowing greater access to music. Spotify alone hit 100 million users in 2016, with other services provided by companies such as Apple, Soundcloud and YouTube. In addition, there are huge numbers of professional musicians, approximately 53,000 in the USA alone, as well as amateurs who are producing music which needs to be classified. With this quantity of music, it is unfeasible to classify genres without an automated method.
2021-01-25
25
Weighted Word Embedding Weighted Word Embedding
Today I'm going to summarise some important point about weighted word embedding for some specific NLP tasks. Frankly speaking, this is the topic I wish to write about a few months ago, however, I was so busy during my MSc.
2021-01-25
4 / 5