Stealing pages from the server...

I train machine to train model.
03
31
Dissertation Paraphraser Dissertation Paraphraser
Paraphrasing and summarizing are vital so that you essay doesn't become one long quote of other academics' work. To paraphrase a piece of text is to write it in your own words. In this article, I will show you how I make an app that will help me rephrase the sentence I need.
2021-03-31
26
Interview Preparation for IBM Interview Preparation for IBM
Come up with a list of topics that I should research and understand to prepare for the first IBM interview. This interview will likely be more behavioral as well as testing to see how much I know about IBM. 
2021-03-26
24
23
22
Text Representation for Unstructured Data Text Representation for Unstructured Data
Text is a very important unstructured data, and how to represent text data has been an important research direction in the field of machine learning. In this article, I will only discuss the very basic methods, such as Bag of Words, TF-IDF (Term Frequency Inverse Document Frequency), Topic Model, and Word Embedding.
2021-03-22
22
Notes on Feature Engineering Notes on Feature Engineering
Without sufficient data and suitable features, the most powerful model structure cannot get satisfactory output. As a classic saying goes, "Garbage in, garbage out." For a machine learning problem, the data and features often determine the upper limit of the results, while the selection of models, algorithms and optimization are gradually approaching this upper limit.
2021-03-22
20
What is Fundamental Analysis and Technical Analysis What is Fundamental Analysis and Technical Analysis
The dream of early retirement. Most people are not able to do so. Therefore, we should do a good job of financial planning while we are young. It is not impossible to make a lot of profit from investments. In essence, there are two methods to do investment, fundamental analysis and technical analysis. Fundamental analysis considers all the factors that affect a company's financial and profitability factors. Technical analysis side is to look at the line chart, indicators, etc.
2021-03-20
20
How to Apply for Coursera Financial Aid How to Apply for Coursera Financial Aid
Coursera is the global online learning platform that offers anyone, anywhere access to online courses and degrees from world-class universities and companies. If you can’t afford to pay for a Certificate, you can apply for Financial Aid or a Scholarship through the link on the course home page. Learners with Financial Aid or Scholarships in a course will be able to access all of the course content and complete all work required to earn a Course Certificate. Financial Aid and Scholarships only apply to the course that the application was approved for.
2021-03-20
19
Python Sudoku Solver Python Sudoku Solver
Sudoku is a logic-based, combinatorial number-placement puzzle. In classic sudoku, the objective is to fill a 9×9 grid with digits so that each column, each row, and each of the nine 3×3 subgrids that compose the grid (also called "boxes", "blocks", or "regions") contain all of the digits from 1 to 9. The puzzle setter provides a partially completed grid, which for a well-posed puzzle has a single solution.
2021-03-19
17
A Proof that e is Irrational A Proof that e is Irrational
In this article, I'll try and show that e, sometimes called Euler's number, is an irrational number 2.718281828459045.... Euler's number is a fantastic number, and it plays a role in just about every aspect of physics, maths, and statistics. There are many ways of calculating the value of e, but none of them ever give a totally exact answer, because e is irrational and its digits go on forever without repeating.
2021-03-17
16
15
Visualise Crypto and Twitter with SQL and FastAPI Visualise Crypto and Twitter with SQL and FastAPI
In this article, first, I'll populate crypto database using Python and SQL. I retrieve the list of crypto coin assets, verify the data, and tackle any errors I encounter along the way. Second, I'll talk about how to keep the database up to date with the latest prices, and retrieve daily data from the yahoo finance API. Third, a database for twitter data will be built as well. Finally, I'll set up to build a web UI using FastAPI.
2021-03-15
15
Triple Barrier Method for ML Triple Barrier Method for ML
Time series prediction has been widely applied to the finance industry in applications such as stock market price and commodity price forecasting. Machine learning methods have been widely used in financial time series prediction in recent years. How to label financial time series data to determine the prediction accuracy of machine learning models and subsequently determine final investment returns is a hot topic.
2021-03-15
15
Eigenvectors from Eigenvalues Eigenvectors from Eigenvalues
This article is about implementing "Eigenvectors from eigenvalues" of Terence Tao's paper using Python and R. It's a amazing work and mathematics contribution from Terence Tao. It is an elegant non-evident result, which makes me so excited about it!
2021-03-15
14
13
Principal Component Analysis Derivation Principal Component Analysis Derivation
Principal Component Analysis (PCA) is an important technique to understand in the fields of statistics and data science. It is a process of computing the principal components and utilising then to perform a change of basis on the data. For the purpose of visualisation, it is very hard to visulaise and understand the data in high dimensions, this is where PCA comes to the rescue.
2021-03-13
12
P-Value Easy Explanation P-Value Easy Explanation
In Data Science interviews, one of the frequently asked questions is 'What is P-Value?'. It's hard to grasp the concept behind p-value. To understand p-value, you need to understand some background and context behind it.
2021-03-12
02
Train Word Embedding Vectors on Custom Corpus Train Word Embedding Vectors on Custom Corpus
When I was doing my dissertation project, I found out that the performance of model wasn't quite well. I believe it's because the domain of pre-trained GoogleNews-vectors-negative300 is different from the the dataset of mine. Hence, I decide to pre-train a word2vec model by myself.
2021-03-02