1. Alpha design

1 Alpha design

A price action is a response to some world event. This event is reflected in the data. If the data never changes then there is no alpha. Thus, it is changes in the data that have the information. A change in information should produce a change in the alpha.

1.1 QUALITY OF AN ALPHA

An alpha is considered one of good quality when: The idea and expression is simple. The expression/code is elegant. It has good in-sample Sharpe. It is not sensitive to small changes in data and parameters. It works in multiple universes. It works in different regions. Its profit hits a recent new high.

1.2 ALGORITHM FOR FINDING ALPHAS

Repeat the below steps forever: Look at the variables in the data. Get an idea of the change you want to model. Come up with a mathematical expression that translates this change into stock position. Test the expression. If the result is favorable, submit the alpha.

1.3 Categories:

According to the time the alphas use the information, and the frequency at which the predictions are generated, we may categorize those alphas into the following groups:

Intraday alphas: rebalanced during trading hours of the day. They can also be grouped as

follows: a. Rebalance at each interval, e.g. 1 min/5 min/15 min, etc. b. Rebalance triggered by some events such as ticks/orders/fills or predefined events.

Daily alphas: rebalance every day. These types of alphas can be broken into further

subgroups by the time the information is used: a. Delay N: use data of N days ago. b. Delay 0 snapshot: use the data before a certain time snapshot. c. MOO/MOC: alphas trade at market open/close auction session.

Weekly/monthly alphas, rebalanced every week/month.

1.4 DEVELOPMENT OF AN ALPHA

An alpha is developed by using public information. The more efficient the process, the better performance the alpha can achieve. One can find alphas either by sourcing public information or building specific models to process the information. Alphas can be generated by searching signals/patterns from the informational spaces. Typical sources are as follows:

Price/volume. We can use technical analysis or prediction/regression models based on the price/volume.
Fundamentals. By analyzing the fundamentals of each company automatically, one can build fundamental alphas. Such alphas typically have very low turnover.

anormally

Macro data, such as gross domestic product numbers, employment rates. Such numbers

have big impacts on the financial markets.

Text, such as Federal Open Market Committee minutes, company filings, papers, journals,

news, or even information in publicly available social media. It’s necessary to quantify the text into numbers (eventually number of shares to buy/sell). Text data includes both current and future events.

Multimedia such as videos/audios can also be used as information sources. The techniques

to process video/audio are pretty mature. For example, one can simply use Text-To-Speech techniques to extract text information from the video/audio and then build models on the text information. Sometimes alphas are not derived from the models of information directly. This information may be used to improve the performance of alphas or generate alphas. Some examples are listed below:

Risk factor models: by controlling risk exposure or eliminating risk exposure to some

factors, one can improve the alpha’s performance.

Relationship models: e.g. instruments typically correlated with each other to some extent.

Some may lead or lag with others, thus they generate the opportunities for arbitrage.

Microstructure models to improve the execution performance of real trading.

1.5 FUTURE PERFORMANCE

All of the measurements in the preceding section are intended to compare two alphas where we have no additional information other than their actual predictions. However, additional information, such as how the alpha was constructed, can yield useful information in determining whether the alpha will make good predictions going forward. Ultimately, what is important is whether the alpha makes reliable future predictions, not historical predictions.

Comparison of in-sample to out-of-sample performance is useful not only on the alpha level but also in aggregate across all alphas of a given designer, or on groups of alphas from a given designer.

1.6 How to develop an alpha

STEP 5 → FINAL ROBUST ALPHA The final alpha would be a combination derived from both pieces of information. This would change the old values of +2.5 and +7.5 to –5.0 and +5.0 for Google and Apple, respectively.

STEP 6 → TRANSLATE INTO POSITIONS IN A FINANCIAL INSTRUMENT In order to get the final positions, we can simply apply the following formula: Final_alpha_stock = (alpha_stock/sum_of_alphas_of_all_stocks) * booksize So, if we have $10M, we’ll go long $5M on Apple and short $5M on Google. STEP 7 → CHECK FOR ROBUSTNESS These are indicators for robustness:

High in-sample information ratio (IR)
Good out-of-sample IR
Works well across the trading universe
Less fitting
Intuitive/interesting/simple idea
Works in multiple regions
Small drawdowns
Short drawdown periods

1.7 How to develop an alpha II

load data
predict stock return by translating data into trading signals
1. mean reversion
2. lead-lag effect
3. momentum
4. analyst rating information
5. news sentiment
aggregate all different predictions like adjusting alphas, industry neutral and market neutral, building a long short basket.
in sample test
out of sample test
rank the factors
optimize the portfolio to get diversifed weight and adjust risk exposure.

1.8 Fundamental analysis

financial statement and empirical accounting research

1.8.1 fundamental statements

balance sheet
income statement
cash flow

1.9 Improving the robustness of alphas

properties of robust alpha:

invariance under modification of traded universe. independent of trading universe choices.
robustness to extreme market conditions.

1.9.1 ordering methods(打分法)

ranking
quantiles approximation

1.9.2 approximation to normal distribution

Z-scoring

1.9.3 limiting methods

truncation
winsorizing

Table of Contents