
系統識別號 
U00021502201714571000 
中文論文名稱

具群組偵測之有效率的群組股票組合最佳化演算法 
英文論文名稱

Efficient Group Stock Portfolio Optimization Algorithms with Natural Group Detection 
校院名稱 
淡江大學 
系所名稱(中) 
資訊工程學系全英語碩士班 
系所名稱(英) 
Master’s Program, Department of Computer Science and Information Engineering (Englishtaught program 
學年度 
105 
學期 
1 
出版年 
106 
研究生中文姓名 
喬納森 
研究生英文姓名 
Jonathan Coupe 
學號 
603780106 
學位類別 
碩士 
語文別 
英文 
口試日期 
20170112 
論文頁數 
109頁 
口試委員 
指導教授陳俊豪 委員陳朝鈞 委員洪智傑

中文關鍵字 
分群效度指標
多樣群組股票投資組合
群組遺傳演算法
分組問題
組合最佳化

英文關鍵字 
Cluster validation indices
diverse group stock portfolio
grouping genetic algorithm
grouping problem
portfolio optimization

學科別分類 
學科別＞應用科學＞資訊工程

中文摘要 
由於金融市場的多變性，投資組合最佳化至今仍是相當吸引人的研究主題。過去幾十年間，許多不同的演化式演算法針對不同的投資組合亦不斷的被提出，其中一種就是多樣群組投資組合。然而，本研究發現現存的多樣群組投資組合最佳化技術仍有三個問題待解決，分別是：如何設定合適的群組數、演化過程耗時、投資組合風險差異過高。故為解決這些問題，本論文使用群組遺傳演算法提出兩個多樣群組股票投資組合最佳化方法。
第一個方法主要用於解決前兩個問題。針對設定合適的群組數問題，所提的方法首先透過股東權益報酬率(ROE)與本益比(P/E)兩屬性將股票分群後，之後設計分群效度因子(cluster validation factor)並將之當成適合度函數的一部份，使演算法可以自動搜尋較佳之分群結果。為解決演化過程耗時問題，在方法一則設計暫存染色體(Temporary chromosome)透過降低需要評估的組合數使演化過程得以加速。
第二個方法則著手解決投資組合風險差異過高問題。首先，方法中設計風險比例因子(Risk ratio factor)計算多樣群組股票投資組合可產生之最大組合風險。接著，所提的演算法結合自我調適之交配(Adaptive crossover)、突變(Adaptive mutation)運算與排序為基礎之輪盤選擇法(Rankbased roulette wheel selection)以達更好的搜尋能力。
最後，實驗透過31與50家公司之真實股市資料驗證所提方法的效率與效能確實優於現存的最佳化技術。與現存方法比較顯示，方法一不但在獲利上可達到相似結果，在執行時間上亦可減少約原來的百分之八十五。而方法二所找出之多樣群組投資組合其風險上也有明顯降低。

英文摘要 
Due to the variety of financial markets, stock portfolio optimization is an attractive research topic. In the past decades, many evolutionarybased algorithms have been proposed to optimize different types of stock portfolios, and one of them is named diverse group stock portfolio (DGSP). However, this study found that three problems remain to be solving in the existing DGSP approaches. They are how to set an appropriate group size, evolution process is timeconsuming, and difference between risks of portfolios is too high. To solve these problems, two approaches by grouping genetic algorithms (GGA) are proposed for optimizing DGSPs in this thesis.
The first approach is used to deal with the first two problems. For setting an appropriate group size, the two attributes, Return on Equity (RoE) and Price Earnings Ratio (P/E), are utilized to group stocks. Then, cluster validation factor, which is used as a part of fitness function, is designed to derive better stock groups. To solve timeconsuming problem, a temporary chromosome is designed to reduce number of stock portfolios should be evaluated to speed up the evolution process.
The second approach is then proposed to handle the third problem. It first designs risk ratio factor to calculate the maximum risk of a given DGSP. Then, by combining adaptive crossover, adaptive mutation, and rankbased roulette wheel selection, the second approach has higher searching ability to find better solution.
At last, experimental results on the two real datasets that contain 31 and 50 stocks were made to verify the two proposed approaches are effective and efficient. Comparing with the existing approach, the results show that the first approach can not only reach similar return but also reduce execution time up to 85%. The risk of optimized DGSP by second approach is significantly lower than that by the existing approaches.

論文目次 
Contents
CHAPTER 1 INTRODUCTION 1
1.1 Problem Definition and Motivation 1
1.2 Contributions 4
1.3 Reader’s Guide 6
CHAPTER 2 RELATED WORK 7
2.1 The MV model 7
2.2 Metaheuristics for portfolio optimization 8
2.3 Metaheuristics for Clustering 10
2.4 Cluster Validation Indices 11
CHAPTER 3 DIVERSE GROUP STOCK PORTFOLIO OPTIMIZATION APPROACH WITHOUT SETTING A GROUP NUMBER 14
3.1 Motivation 14
3.2 Components of Proposed Approach 16
3.2.1 Clustering Attributes 16
3.2.2 Encoding Scheme 17
3.2.3 Temporary Chromosome 19
3.2.4 Initial Population 22
3.2.5 Fitness Evaluation 23
3.2.6 Genetic Operations 25
3.3 Algorithm for First Approach 27
3.4 An Example 30
CHAPTER 4 A ENHANCED ALGORITHM FOR OPTIMIZING DIVERSE GROUP STOCK PORTFOLIO 42
4.1 Motivation 42
4.2 Elements of the proposed algorithm 44
4.2.1 Encoding Scheme 45
4.2.2 Initial Population 47
4.2.3 Fitness Evaluation 47
4.2.4 Genetic Operations 53
4.3 Algorithm for Second Approach 59
4.4 An Example 62
CHAPTER 5 EXPERIMENTAL RESULTS 76
5.1 Experimental Results for Approach (I) 76
5.1.1 Experimental Datasets 76
5.1.2 Effectiveness of The Proposed Approach 79
5.1.2.1 Experimental Results for Different Desired Stock 80
5.1.2.2 Analysis of the Derived GSP 85
5.1.3 Efficiency of Proposed Approach 88
5.2 Experimental Results for Approach (II) 90
5.2.1 Experimental Datasets 91
5.2.2 Effectiveness Of The Proposed Approach 91
5.2.3 Efficiency of Proposed Approach 100
CHAPTER 6 CONCLUSION AND FUTURE WORKS 103
REFERENCES 106
List of Figures
Figure 1. Encoding scheme of chromosome Cq 17
Figure 2. An example of the encoding scheme with three parts. 18
Figure 3. An example of a chromosome with four parts 19
Figure 4. An example of a chromosome from the previous approach 20
Figure 5. An example of a chromosome with relevant information 20
Figure 6. An example of a temporary chromosome 22
Figure 7. Encoding scheme of chromosome Cq. 45
Figure 8. An example of the encoding scheme. 46
Figure 9. The pie chart of the five chromosome in Table 10 58
Figure 10. The stock price series for 31 companies from the beginning of 2010 to the end of 2014 77
Figure 11. The stock price series for 50 companies from the beginning of 2012 to the end of 2014 78
Figure 12. The average fitness over 100 generations using elitist selection 80
Figure 13. The average fitness of the second approach with rankbased roulette wheel selection over 10 runs 92
Figure 14. The average risk of both approaches for 200 generations 96
List of Tables
Table 1. The relationship between group size and number of combinations. 21
Table 2. The 10 stocks used in the example 31
Table 3. The portfolio satisfaction for each chromosome 35
Table 4. The group balance for each chromosome 35
Table 5. The dissimilarity matrix for the proposed approach 35
Table 6. The diversity values for each chromosome 36
Table 7. The DaviesBouldin Index for every chromosome 37
Table 8. The fitness for each chromosome 38
Table 9. Two generated DGSPs with similar ROI but different risk values 43
Table 10. Survival probabilities of five chromosomes 58
Table 11. The 10 stocks used in the example 62
Table 12. The portfolio satisfaction for each chromosome 66
Table 13. The group balance for each chromosome 66
Table 14. The dissimilarity matrix for the proposed approach 67
Table 15. The diversity values for each chromosome 67
Table 16. The values of DaviesBouldin index for every chromosome 69
Table 17. The fitness for each chromosome 70
Table 18. The total fitness values for all the chromosomes 70
Table 19. The fitness f'(Cq) for all chromosomes after conversion for rankbased selection 71
Table 20. The probabilities of selection based on ranking 71
Table 21. The parameter settings for 31 and 50 company data sets 79
Table 22. The experimental results for the DavisBouldin Index approach on 31 stocks 80
Table 23. The experimental results for the C Index approach on 31 stocks 81
Table 24. The experimental results for the Dunn Index approach on 31 stocks 81
Table 25. The experimental results for the Silhouette approach on 31 stocks 82
Table 26. Approach using DavisBouldin Index with 50 stocks 84
Table 27. Approach using Silhouette index with 50 stocks 84
Table 28. Approach using Dunn index with 50 stocks 84
Table 29. Approach using C index with 50 stocks 85
Table 30. The initial and derived GSP for 31 stocks 86
Table 31. The initial and derived DGSP for 50 stocks 87
Table 32. The efficiency of previous and proposed approach 1 for 31 stocks 89
Table 33. The efficiency of Approach 1 using 50 stocks 90
Table 34. The experimental results for approach 2 using 31 stocks 93
Table 35. The experimental results for approach 2 using 50 companies 93
Table 36. The risk for 10 runs using the original approach 94
Table 37. The risk for approach 2 using 31 stocks 95
Table 38. The risk for the original approach using 50 stocks dataset 96
Table 39. The risk for proposed approach 2 using 50 stocks 97
Table 40. The initial and derived DGSP for 31 companies 98
Table 41. The initial and derived DGSP for approach 2 using 50 stocks 99
Table 42. The efficiency of approach 2 using 31 stocks 101
Table 43. The efficiency of approach 2 using 50 stocks 102

參考文獻 
[1] L.E. Agustı´nBlasa, S. SalcedoSanza, S. JiménezFernándeza, L. CarroCalvoa, J. Del Serb, and J.A. PortillaFiguerasa, “A new grouping genetic algorithm for clustering problems” Expert Systems with Applications, Vol. 39, No. 10, pp. 9695–9703, 2012.
[2] M. H. Babaei, M. Hamidi, E. Jahani and H. Pasha, "A new approach to solve an extended portfolio selection problem," International Conference on Industrial Engineering and Operations Management, pp. 19541960, 2012.
[3] S. Barak, M. Abessi, and M. Modarres, "Fuzzy turnover rate chance constraints portfolio model," European Journal of Operational Research, Vol. 228, No. 1, pp. 141147, 2013
[4] J. Bermúdez, J. Segura and E. Vercher, "A multiobjective genetic algorithm for cardinality constrained fuzzy portfolio selection," Fuzzy Sets and Systems, Vol. 188, pp. 1626, 2012.
[5] V. Bevilacqua, V. Pacelli and S. Saladino, "A novel multi objective genetic algorithm for the portfolio optimization," Advanced Intelligent Computing, pp. 186193, 2012.
[6] T. J. Chang, S. C. Yang and K. J. Chang, "Portfolio optimization problems in different risk measures using genetic algorithm," Expert Systems with Applications, Vol. 36, pp. 1052910537, 2009.
[7] C. H. Chen and C. Y. Hsieh, "Mining actionable stock portfolio by genetic algorithms," accepted as to appear in Journal of Information Science and Engineering, 2016.
[8] C. H. Chen, C. Y. Lu, T. P. Hong and J. H. Su, "Using grouping genetic algorithm to mine diverse group stock portfolios," The IEEE Congress on Evolutionary Computation, 2016.
[9] C. H. Chen, C. B. Lin and C. C. Chen, "Mining group stock portfolio by using grouping genetic algorithms," The IEEE Congress on Evolutionary Computation, pp. 738  743, 2015.
[10] L.H. Chen and L. Huang, “Portfolio optimization of equity mutual funds with fuzzy return rates and risks”, Expert Systems with Applications, Vol. 36, Issue 2, Part 2, pp. 3720–3727, 2009.
[11] W. Chen, “Artificial bee colony algorithm for constrained possibilistic portfolio optimization problem”, Physica A: Statistical Mechanics and its Applications, Vol. 429, pp 125–139, 2015
[12] Z. G. M. Elhachlouﬁ, F. Hamza, "Stocks portfolio optimization using classification and genetic algorithms," Applied Mathematical Sciences, Vol. 6, pp. 46734684, 2012.
[13] L. R. Z. Hoklie, "Resolving multi objective stock portfolio optimization problem using genetic algorithm," International Conference on Computer and Automation Engineering, pp. 4044, 2010.
[14] H. Jalota, M. Thakur and G. Mittal, “Modelling and constructing membership function for uncertain portfolio parameters: a credibilistic framework”, Expert Systems with Applications, Volume 71, pp 4056, 2017
[15] H. Konno and A. Wijayanayake, "Portfolio optimization problem under concave transaction costs and minimal transaction unit constraints," Mathematical Programming, Vol. 89, pp. 233250, 2001.
[16] H. Kellerer, R. Mansini and S. M. Grazia, "Selecting portfolios with fixed costs and minimum transaction lots," Annals of Operations Research, Vol. 99, pp. 287304, 2000.
[17] J. C. Li, C. Long, and X.D. Chen, “The returns and risks of investment portfolio in stock market crashes”, Physica A: Statistical Mechanics and its Applications, Vol. 427, pp. 282–288, 2015
[18] C. C. Lin and Y. T. Liu, "Genetic algorithms for portfolio selection problems with minimum transaction lots," European Journal of Operational Research, Vol. 185, pp. 393404, 2008.
[19] P. C. Lin, "Portfolio optimization and risk measurement based on nondominated sorting genetic algorithm," Journal of Industrial and Management Optimization, Vol. 8, pp. 549564, 2012.
[20] Y. J. Liu and W. G. Zhang, "Fuzzy portfolio optimization model under real constraints," Insurance: Mathematics and Economics, Vol. 53, No. 3, pp. 704711, 2013.
[21] K. Lwin, , R. Qu, and G. Kendall, “A learningguided multiobjective evolutionary algorithm for constrained portfolio optimization”, Applied Soft Computing, Vol 24, pp 757–772, 2014
[22] R. Mansini and M. G. Speranza, "Heuristic algorithms for the portfolio selection problem with minimum transaction lots," European Journal of Operational Research, Vol. 114, pp. 219233, 1999
[23] H. M. Markowitz, "Harry Markowitz: Selected Works," World Scientific Publishing Company, 2009.
[24] G. A. V. Pai, and Thierry Michel, "Evolutionary optimization of constrained kmeans clustered assets for diversification in small portfolios," IEEE Transactions On Evolutionary Computation, Vol. 13, No. 5, pp. 1030  1053, 2009.
[25] A. R. Pouya, A. Rezaei, M. Solimanpur and M. J. Rezaee, “Solving multiobjective portfolio optimization problem using invasive weed optimization.”, Swarm and Evolutionary Computation, Vol 28, pp 4257. 2016
[26] S. H. Razavi, E. O. Ebadati, S. Asadi, and H. Kaur, “An efficient grouping genetic algorithm for data clustering and big data analysis”, Computational Intelligence for Big Data Analysis, Frontier Advances and Applications, Vol 19, pp 119142, 2015
[27] F. J. Rodriguez, M. Lozano, C. GarcíAMartíNez, and J. D. GonzáLezBarrera. “An artificial bee colony algorithm for the maximally diverse grouping problem”, Information Sciences. Vol 230, pp 183196, 2013
[28] A. D. Roy, "Safety first and the holding of assets," Econometrica, Vol. 20, pp. 431449, Jul. 1952.
[29] S. SalcedoSanz, J. Del Ser and Z. W. Geem, "An island grouping genetic algorithm for fuzzy partitioning problems," The Scientiﬁc World Journal, Vol. 2014, pp. 115, 2014.
[30] A. Silva, R. Neves, and N. Horta, “A hybrid approach to portfolio composition based on fundamental and technical indicators”, Expert Systems with Applications, Vol. 42, Issue 4, pp 20362048, 2015
[31] R. R. Weitz and S. Lakshminarayanan, "An empirical comparison of heuristic methods for creating maximally diverse groups," Journal of the operational Research Society, pp. 635646, 1998.

論文使用權限 
同意紙本無償授權給館內讀者為學術之目的重製使用，於20220223公開。同意授權瀏覽/列印電子全文服務，於20220223起公開。 


