View of Application of Decision Tree in Classifying Secondary School Students’ Tendencies to Choose TVET in Malaysia

(1)

Manuscript Title: Application of Decision Tree in Classifying Secondary School

Students’ Tendencies to Choose TVET in Malaysia

Chia Ming Hong

1

_{, Chee Keong Ch’ng}

2*

_{, TehRaihana Nazirah Roslan}

3

1,2_{Department of Decision Science, School of Quantitative Sciences, Universiti Utara Malaysia,} 06010 Sintok, Kedah, Malaysia

3_{Othman Yeop Abdullah Graduate School of Business, Universiti Utara Malaysia,} 50300 Kuala Lumpur, Malaysia

1_{[email protected],}2*_{[email protected],}3_{[email protected]}

Article History: Received: 10 November 2020; Revised: 12 January 2021; Accepted: 27 January 2021;

Published online: 05 April 2021

Abstract

The wave of Industry Revolution (IR 4.0) highlights the importance of technology in our life. The demand for technologist and skilled workers in Technical and Vocational Education and Training (TVET) are increasing day by day due to their expertise. TVET provides a platform for formal and non-formal learning to equip the youngsters in contributing to the development of a prosperous and inclusive nation. Moreover, TVET promises bright job prospects especially in fulfilling the manpower demand of IR 4.0. However, students in Malaysia currently are not fully aware of the existence of TVET, since the number of students who join TVET are still below expectation. Therefore, the main objective in this study is to develop the best TVET model to classify the students’ tendency in choosing TVET after completing secondary school. From the literature, five main factors that hinder students’ interest in joining TVET are recognized, namely students’ interest, parents, society, TVET instructors and employers. In this study, 428 secondary school students from Kedah (Malaysia) are involved as respondents. Different types of decision tree models are developed based on the algorithms and the splitting criteria. Altogether, there are 15 variables derived from 5 main affecting factors mentioned above to determine the tendency of joining TVET. Consequently, the best TVET classifier with the misclassification rate of 0.2919 is selected, to predict the tendency of students who will be joining TVET in future. Our findings revealed that the variable of “Stream” plays as the primary and trifling roles. This classifier is beneficial in assisting the government to achieve the aim of upholding TVET in Malaysia.

Keywords:TVET, Data mining, Classification, Decision tree

1. Introduction

To keep abreast with the era of Industry Revolution (IR) 4.0, it is undeniable that technology nowadays plays an essential role in our daily life. Today, the presence of mobile phones and artificial intelligence-based appliances are not deemed as desires, but act as one’s necessities. Thus, the demand for technologist who can handle various machines is very crucial because they are the one who possess expertise in that field.

In Malaysia, Technical and Vocational Education and Training (TVET) is a channel toprepare students to get into IR 4.0 by providing them useful skills in workplace [1]. TVET is a dual teaching process that involves general education, technology knowledge and training which are closely linked with industry. It is a tool to produce skillful workers in the future [2]. The graduates from TVET can work under many fields such as electrical, construction, maintenance etc. In Malaysia, the government has been focusing on TVET by implementing many strategies such as allocating RM5.9 billion in Budget 2020, developing online database to concentrate on important sectors, organizing various exhibition tours, forming a committee known as TVET Empowerment Cabinet (JKKPTVET), implementation of various policies etc[3],[4],[5]. Despite the efforts made, it is frustrating that students nowadays are not really concern with the TVET’s existence. This can be observed from the number of students who registered in TVET was still below expectation [6]. Not only that, it was found that only 9% students chose polytechnics when selecting tertiary education channels [7].

Overall, there were many factors that affected students’ tendency in choosing TVET. The first factor was students’ personality and interest [8]. Interest was an element to encourage the students to join TVET if they were passionate in doing vocational activities such as repairing broken things, cutting hair etc. Next, social perception was another factor that caused the students’ repudiation towards TVET. Nowadays, the society usually had a pessimistic perception towards the TVET students because they were labelled as being weak in academic [8], [9]. The negativity had also affected the parents’ mindset. Parents’ perception was a crux to determine their children’s future [10]. Parents were usually reluctant to let their children join TVET since they perceived that TVET was not good for their child’s future. In addition, underestimation on TVET’s certification was also an issue which increased TVET’s unemployability rates. Cheong and Lee [11] discussed that employers had negative perceptions towards TVET graduates’ ability by assuming that TVET graduates were not intelligent

(2)

in academic. Other than that, the salary of TVET graduates were lower than those who graduated from non-vocational and technical courses [12]. Finally, the last factor was TVET instructors who were teaching in the TVET institutions. Due to low salary payments, TVET institutions faced the issue of insufficient teaching staffs [13]. The quality of teacher was also a crucial part in the institution. A recent study revealed that the TVET instructors were found lacking in ICT skills, knowledge, and English skills [14].

The existence of the mentioned factors will affect a student’s tendency to enter TVET. In dealing with this problem, Decision Tree is chosen among the classification models in this research. Decision Tree is a popular approach in data classification since it is not affected by the missing values. Also, due to its transparency, Decision Tree can be easily interpreted by decision maker. It also allows pruning.

Decision tree was developed by J. Ross Quinlan [15]. It is a tree-based diagram which contains root node, internal node, and leaf nodes. Root node is on the top of the tree, representing the whole population and will be divided into two or more sets. Internal node is in the middle of the tree. The leaf node is the node on the most bottom which does not split further [16]. The structure of decision tree is displayed in Figure 1.

Figure 1. Structure of decision tree

Decision tree will split the variables with the highest homogeneity. In order to deal with overfitting problem, pruning is a crucial step in partitioning the data into smaller subsets. It is a process to remove the useless parts in the tree so that can reduce the size of the tree. There are four algorithms that are commonly used in decision tree namely, ID3, C4.5, CART and CHAID [16].

ID 3 was developed by Quinlan [15]. It can create the shortest and fastest tree but unable to handle the missing values and continuous data [17]. The splitting criteria in ID 3 are Entropy and Information gain [18]. Entropy measures the unpredictability in the dataset. The entropy will be zero if the sample has a high homogeneity. The formula of entropy is,

𝐸𝑛𝑡𝑟𝑜𝑝𝑦, 𝐻(𝑥) = − ∑_{𝑥 ∈ 𝑋}𝑝(𝑥) log2𝑝(𝑥) (1) where

𝑥= Random variable, and

𝑝(𝑥)= The possibility of result 𝑥 of variable 𝑥.

The measurement for entropy before and after splitting is known as information gain. Theformula of information gain (IG) is given by,

𝐼𝐺 = 𝐸𝑛𝑡𝑟𝑜𝑝ℎ𝑦 𝑆 − ∑ |𝑆𝑖| |𝑆| 𝑛 𝑖=1 ∗ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦 (𝑆𝑖) (2) where 𝑛= number of attributes 𝐴,

|𝑆𝑖| = number of cases in the partition 𝑆𝑖, |𝑆|= total number of cases in 𝑆.

On the other hand, C4.5 was developed by Quinlan [15]. It is the extension of earlier ID3 algorithms, and can handle the missing value and continuous data [17]. The splitting criteria of C4.5 is information gain ratio. If the

(3)

data is under one branch, the gain ratio value will be smaller compared to data that are evenly distributed[19]. Theformulaofgainratioispresentedas, 𝐺𝐴𝐼𝑁𝑅𝑎𝑡𝑖𝑜 =𝐺𝐴𝐼𝑁(𝑋,𝑇) 𝐼𝑁𝐹𝑂(𝑋,𝑇) (3) where 𝐺𝐴𝐼𝑁(𝑋, 𝑇) = 𝐼𝑛𝑓𝑜(𝑇) − 𝐼𝑛𝑓𝑜(𝑋, 𝑇), 𝐼𝑁𝐹𝑂(𝑋, 𝑇) = − ∑|𝑇𝑖| |𝑇| 𝑛 𝑖=1 𝐼𝑛𝑓𝑜(𝑇𝑖) , 𝑇𝑖= Number of records in partition 𝑖.

Next, the Classification and Regression Tree (CART) was developed by Breiman et al. [20]. Besides handling the outliers, it can also handle the categorical or numerical variables.Meanwhile, Gini index will be used as splitting criteria to develop the binary tree classifier [17].Gini index is a measurement of the impurity in the dataset [18]. The formula of Gini indexis displayed as,

𝐺𝑖𝑛𝑖 (𝑆) = 1 − ∑𝑐𝑗=1𝑝𝑗2 (4)

where

𝑝𝑗= the proportion of the sample in class 𝑐.

Finally, the algorithm Chi-square Automatic Interaction Detector (CHAID)was developed byKass [21]. It is used to develop non-binary tree and deals with large dataset[22]. Furthermore, Díaz-Pérez and Bethencourt-Cejas[23] claimed that CHAID hasmany advantages since it is a non-parametric statistic and can handle continuous and categorical attributes. For the continuous attribute, CHAID uses F-statistic to do the splitting for the dataset. The formula of F-statistic is written as,

𝐹_(𝛼, 𝑘 − 1, 𝑁 − 𝑘) =𝐵𝑆𝑆/(𝑘−1)

𝑊𝑆𝑆(𝑁−𝑘) (5)

Where

𝛼 = 𝐶𝑟𝑖𝑡𝑖𝑐𝑎𝑙 𝑣𝑎𝑙𝑢𝑒

BSS= Sum of squares between groups, WSS= Sum of squares within groups, k= Number of groups, and

N= Sum of sample size for the groups.

For the categorical attribute, CHAID uses Chi-square to split the criteria. The formula of chi-square is presented as, 𝜒2_{= ∑} (𝑂𝑏𝑠𝑒𝑟𝑣𝑒𝑑−𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑)2 𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑘 𝑖=1 (6) where

Observed = Observed frequency for each category, Expected = Expected frequency for each category, and k = Number of categories.

The splitting rules will affect the performance of Decision Tree [24]. Therefore, all the models created will be compared by using misclassification rate. A lower misclassification rate means it is a better model. The formula of misclassification rate is displayed as,

𝑀𝑖𝑠𝑐𝑙𝑎𝑠𝑠𝑖𝑓𝑖𝑐𝑎𝑡𝑖𝑜𝑛 𝑟𝑎𝑡𝑒 = 𝐹𝑃+𝐹𝑁

𝐹𝑃+𝐹𝑁+𝑇𝑃+𝑇𝑁 (7)

Where

𝐹𝑁 = False negative, the actual class is positive but wrongly predicted as negative, 𝐹𝑃 = False positive, the actual class is negative class but wrongly predicted as positive, 𝑇𝑁 = True negative, the actual class is negative and correctly predicted as negative, 𝑇𝑃 = True positive, the actual class is positive and correctly predicted as positive.

(4)

Previous researchers often implemented decision tree in the education field. MaerMatei, Mocanu and Zamfir[25] had applied decision tree in determining students’ choice between general education or vocational education in Romania. They investigated four factors which are age, education, gender, and income among the students by using CART algorithm. As a result, the researchers found that the variable of age had the highest importance among all factors. Next, a study done by Gonzales, Lagman and De Angel [26] used Decision Tree algorithm to create a model to predict TVET students’ performance so that they could determine the students’ success in the future. The researchers had applied J48 method in their research. Furthermore, Hassan and Awan[27] used the data mining techniques in classifying the behavior of students enrolled in the TVET courses. In their research, the researchers had applied methods such as association, clustering, and classification methods in discovering the enrollment behavior to produce the skilled workers. For the classification model, the researchers applied Decision Tree, Random Forest, Neural Network and Naïve Bayes. The results showed that all the models had the same accuracy rate.

2. Materials and Methods

In this study, 428 respondents of Form 4 and Form 5 students were selected in Kedah, a Northern state in Malaysia. To ensure non-biasness, stratified random sampling and simple random sampling were used to determine the samples. Stratified random sampling method assisted in organizing the Kedah state based on District Education Offices (DEO) and regions. By applying stratified random sampling, 36 regions were classified into 9 DEOs. After that, simple random sampling method was applied in each DEO to find out the respective regions. Next, the questionnaires were distributed to the students in the listed schools. The questionnaire focused on five main factors namely students, TVET instructors, social perception, parents, and employers in the previous section.

The objective of this study is to develop a TVET classifier to classify the secondary school students’ tendency in choosing TVET. There are three outputs consisting of “yes”, “no”, and “maybe” respectively. “Yes” stands for “students will join TVET”, “No” stands for “students will not join TVET”, and “Maybe” stands for “students might be joining TVET”. Figure 2 displays the process of tree development in SAS E-miner.

Figure 2. Tree development using SAS E-miner a) File import

In this study, 15 variables were imported from Microsoft Excel. The details of each variable are discussed as in Table 1.

Table 1. Overview of dataset

Name Role Level Explanation

FamilyInvolved Input Nominal The family members of student join the TVET. FatherEducation Input Nominal The highest education of the student’s father.

FatherJob Input Nominal The occupation of the student’s father

(5)

HeardTVET Input Binary The students heard the information about TVET before. InterestedField Input Nominal The field that the student interested to join with. InterestedTVET Target Nominal The student feel interested to join TVET after Form5. MotherEducation Input Nominal The highest education of the student’s mother.

MotherJob Input Nominal The occupation of the student’s mother.

Stream Input Nominal The stream of student in their secondary school.

TVETBright Input Ordinal The level of agreement towards students who get a TVET certification, people will think that they have a bright future. TVETStudHigherJobChance Input Ordinal The level of agreement towards TVET students will have a

higher chance to get job compared to those who are not TVET students.

TVETStudHighSalary Input Ordinal The level of agreement towards TVET students have a high salary.

VocationalCourseStatus Input Ordinal The level of agreement towards vocational course have a high status.

VocationalSubjInteresting Input Ordinal The level of agreement towards vocational subject is interesting.

b) Impute

Decision tree required less data cleaning compared to some other methods as it is not affected by missing values and outliers. Data removal is preferable for small number of missing data values whereas data replacement is more appropriate for large number of missing data values [28]. Moreover, the best surrogate predictor can be used when the value of the optimal split predictor for an observation is missing. Decision tree is normally robust to the outliers. Most likely outliers will have a negligible effect because the nodes are determined based on the sample proportions in each split region [29]. However, there are some approaches that researchers used to handle the anomalies such as Robust Decision Tree [30] and Winsorized tree [31]. In short, Decision Tree is insensitive towards the presence of missing values and outliers. Hence, it produces a comprehensive analysis of the consequences along each branch.

c) Data Partition

In this node, the dataset was allocated into 70% training sets and 30% testing set. From the 428 samples, 299 data will be set as training data, while 129 data will be set as testing data.

d) Decision tree models

Four types of decision trees were created based on the splitting criteria and maximum branches. Since the variables consist of nominal and ordinal, therefore the splitting criteria were based on Gini Index and Entropy. Gini for the nominal attributes and Entropy for the ordinal attributes were selected in Decision Tree 1, whereas Gini for both nominal and ordinal attributes were selected for Decision Tree 2. Decision Tree 3 and Decision Tree 4 used Entropy for the nominal attributes and Gini for the ordinal attributes, and Entropy for both nominal and ordinal attributes, respectively. As we discussed before, Entropy is used to split the multiple branches while Gini is used to split the binary branches. Hence, Decision Tree 1 and Decision Tree 4 had the maximum branch of 3, whereas Decision Tree 2 and Decision Tree 3 had the maximum branch of 2. Table 2 shows the details for Decision Tree models.

Table 2. Decision Tree models

Decision tree

models Splitting criteria Maximum branches

Decision Tree 1 Nominal target: Gini, Ordinal target: Entropy 3 Decision Tree 2 Nominal target: Gini, Ordinal target: Gini 2 Decision Tree 3 Nominal target: Entropy, Ordinal target: Gini 2 Decision Tree 4 Nominal target: Entropy, Ordinal target: Entropy 3

(6)

In the model comparison node, the decision tree models were compared using the misclassification rate. The model with the lowest misclassification rate is the best model to predict the tendency of students in selecting TVET in future

.

3. Results and Discussion

The Model comparison node shows the misclassification rate for each model. From the results obtained, the best Decision Tree model is Decision Tree 3, which has the lowest misclassification rate (0.2919) by simpler classifier. Table 3 shows the misclassification rate for the Decision Tree.

Table 3. Misclassification rate for decision tree

Decision tree models Misclassification rate

Decision Tree 1 0.3154

Decision Tree 3 has binary branches with a depth of 7. Hence, the variables with higher importance is assigned at higher levels. Figure 3 shows the complete Decision Tree 3 model.

Figure 3. Decision Tree 3 model

Since Figure 3 is too large and cannot be seen clearly, it is divided into left side and right side in Figure 3 (a) and Figure 3 (b).

(7)

Figure 3 (b). Decision Tree 3 model (right side)

The variables with highest homogeneity are selected along the splitting process. Thus, some unimportant features might be chosen if it does not contribute to the decrease of impurities. Based on the result, variable of “Stream” had the highest importance (1.0000), followed by IMP_MotherEducation (0.8684),VocationalCourseInteresting(0.8679), TVETStudHigherJobChance (0.8166), Gender(0.7108), IMP_MotherJob (0.6269), IMP_FatherJob (0.5621), TVETBright (0.4853), VocationalSubjStatus(0.3872), IMP_FatherEducation (0.3647), FamilyInvolved (0.3339), and InterestedField (0.3177). Two variables had the importance of “0.0000”, namely TVETStudHighSalary and HeardTVET. The summary of importance for each variable is displayed in Table 4.

Table 4. Importance of variable

Variable name Importance

Stream 1.0000 IMP_MotherEducation 0.8684 VocationalCourseInteresting 0.8679 TVETStudy HigherJobChance 08166 Gender 0.7108 IMP_MotherJob 0.6269 IMP_FatherJob 0.5621 TVETBright 0.4853 VocationalSubjStatus 0.3872 IMP_FatherEducation 0.3647 FamilyInvolved 0.3339 InterestedField 0.3177 TVETStudHighSalary 0.0000 HeardTVET 0.0000

Other than that, Decision Tree can be linearized into a set of decision rules that represent the view for the whole system as shown Table below.

Table 5. Details for each node

Node Details

Node = 12 IF VocationalCourseInteresting <= 3 AND Stream IS ONE OF: SCIENCE or MISSING THEN “Maybe” (67%), “No” (33%).

Node = 13 IF VocationalCourseInteresting <= 3 AND Stream IS ONE OF: ACCOUNT THEN “Maybe”(12%), “NO” (88 %).

(8)

Node = 20 IF VocationalCourseInteresting >= 5 AND TVETStudHigherJobChance <= 3 AND Stream IS ONE OF: ARTS, BUSINESS, ICT, LITERATURE, VOCATIONAL or MISSING AND InterestedField IS ONE OF: MANUFACTURING, CONSTRUCTION or MISSING THEN “Maybe” (100%).

Node = 21 IFVocationalCourseInteresting >= 5 AND TVETStudHigherJobChance >= 4 or MISSING AND Stream IS ONE OF: ARTS, BUSINESS, ICT, LITERATURE, VOCATIONAL or MISSING AND InterestedField IS ONE OF: MANUFACTURING, CONSTRUCTION or MISSING THEN “Yes” (88%), “Maybe” (12%).

Node = 25 IF VocationalCourseInteresting >= 5 or MISSING AND Stream IS ONE OF:TECHNICAL, ACCOUNT, SCIENCE AND FamilyInvolved IS ONE OF: YES THEN “Yes” (17%), “Maybe” (83%).

Node = 27 IF VocationalCourseInteresting <= 4 or MISSING AND TVETBright >= 3 or MISSING AND Stream IS ONE OF: ARTS, BUSINESS, ICT, LITERATURE, VOCATIONAL or

MISSING AND Imputed: MotherEducation IS ONE OF: SPM, NO

COMPLETESECONDARY SCHOOL, DEGREE or MISSING AND Imputed: FatherJob IS ONE OF: CLERICAL/ CIVIL SERVICE, UNEMPLOYED, AGRICULTURE THEN “Maybe”(72%), Yes (9%), No (19%).

Node = 29 IF VocationalCourseInteresting <= 4 or MISSING AND TVETBright >= 3 or MISSING AND Stream IS ONE OF: ARTS, BUSINESS, ICT, LITERATURE, VOCATIONAL or MISSING AND Imputed: MotherJob =Missing AND Imputed: MotherEducation IS ONE OF: DIPLOMA, STPM THEN “Yes” (100%)

Node = 32 IF VocationalCourseInteresting >= 5 AND Stream IS ONE OF: ARTS, BUSINESS, ICT, LITERATURE, VOCATIONAL or MISSING AND InterestedField IS ONE OF: CLERICAL/ CIVIL SERVICE, SELF-EMPLOYED, PROFESSIONAL AND Imputed: FatherEducation IS ONE OF: NO COMPLETE SECONDARY SCHOOL or MISSING AND Gender IS ONEOF: MALE or MISSING THEN “Yes” (100%)

Node = 37 IF VocationalCourseInteresting >= 4 or MISSING AND TVETStudHigherJobChance >= 4 ANDStream IS ONE OF: TECHNICAL, ACCOUNT, SCIENCE AND Imputed: MotherEducation IS ONE OF: SPM, DIPLOMA, DEGREE or MISSINGAND FamilyInvolved IS ONE OF: NO or MISSING THEN “Maybe” (72%), “Yes” (16%), “No” (12%).

Node = 38 IF VocationalCourseInteresting >= 4 or MISSING AND TVETStudHigherJobChance <= 3 ANDStream IS ONE OF: TECHNICAL, ACCOUNT, SCIENCE AND Imputed: MotherEducation IS ONE OF: NO COMPLETE SECONDARY SCHOOL, PHD AND FamilyInvolved IS ONE OF: NO or MISSING THEN “Maybe” (80%), “No” (20%).

Node = 39 IF VocationalCourseInteresting >= 4 or MISSINGAND TVETStudHigherJobChance >= 4 or MISSING AND Stream IS ONE OF: TECHNICAL, ACCOUNT, SCIENCE AND Imputed:MotherEducation IS ONE OF: NO COMPLETE SECONDARY SCHOOL, PHD AND FamilyInvolved IS ONE OF: NO or MISSING THEN “Maybe” (14%), “No” (86%). Node = 48 IF VocationalCourseInteresting >= 5 AND Stream IS ONE OF: ARTS, BUSINESS, ICT,

LITERATURE, VOCATIONAL or MISSING AND InterestedField IS ONE OF: CLERICAL/ CIVIL SERVICE, SELF-EMPLOYED, PROFESSIONAL, AND Imputed:

MotherJob IS ONE OF: SELF-EMPLOYED or MISSING AND Imputed:

FatherEducation IS ONE OF: SPM THEN“Yes” (38%), “No” (63%).

Node = 49 IF VocationalCourseInteresting >= 5 AND Stream IS ONE OF: ARTS, BUSINESS, ICT, LITERATURE, VOCATIONAL or MISSING AND InterestedField IS ONE OF: CLERICAL/ CIVIL SERVICE, SELF-EMPLOYED, PROFESSIONAL AND Imputed: MotherJob IS ONE OF: OTHERS AND Imputed: FatherEducation IS ONE OF: SPM THEN “Yes” (80%), “No” (20%).

(9)

Node = 50 IF VocationalSubjStatus <= 3 or MISSING AND VocationalCourseInteresting >= 4 or MISSING AND TVETStudHigherJobChance <= 3 or MISSING AND Stream IS ONE OF: TECHNICAL, ACCOUNT, SCIENCE AND Imputed: MotherEducation IS ONE OF: SPM, DIPLOMA, DEGREE or MISSING AND FamilyInvolved IS ONE OF: NO or MISSING THEN “Maybe” (93%), “No” (7%).

Node = 55 IF VocationalCourseInteresting <= 4 or MISSING AND TVETBright >= 3 or MISSING AND Stream IS ONE OF: BUSINESS, LITERATURE or MISSING AND Imputed:

MotherEducation IS ONE OF: NO COMPLETE SECONDARY SCHOOL AND

Imputed: FatherJob IS ONE OF: OTHERS, SELF-EMPLOYED, MANUFACTURING or MISSING THEN “Maybe” (75%), “No” (25%).

Node = 57 IF VocationalCourseInteresting <= 4 or MISSING AND TVETBright >= 3 or MISSING AND Stream IS ONE OF: ARTS AND Imputed: MotherEducation IS ONE OF: SPM, NO COMPLETE SECONDARY SCHOOL, DEGREE or MISSING AND Imputed: FatherJob IS ONE OF: OTHERS, SELF-EMPLOYED, MANUFACTURING or MISSING AND Gender IS ONE OF: MALE THEN “Maybe” (8%), “Yes” (17%), “No” (75%).

Node = 64 IF VocationalSubjStatus >= 4 AND VocationalCourseInteresting >= 4 or MISSING AND TVETStudHigherJobChance <= 3 or MISSING AND Stream IS ONE OF: ACCOUNT AND Imputed: MotherEducation IS ONE OF: SPM, DIPLOMA, DEGREE or MISSING AND FamilyInvolved IS ONE OF: NO or MISSING THEN “Maybe” (32%), “No” (63%).

Node = 65 IFVocationalSubjStatus >= 4 ANDVocationalCourseInteresting >= 4 or MISSING AND TVETStudHigherJobChance <= 3 or MISSING AND Stream equals Missing AND Imputed: MotherEducation IS ONE OF: SPM, DIPLOMA, DEGREE or MISSING AND FamilyInvolved IS ONE OF: NO or MISSING THEN “Maybe” (80%), “No” (20%).

4. Conclusions

In this paper, the best TVET classifier is developed for classifying and predicting the tendency of students in choosing TVET after their secondary school. The decision tree model can be used to increase the number of TVET students in the future, by targeting potential students based on our findings. Since our results revealed that the variable of “Stream” has the highest importance, the school teachers should play vital roles in raising the awareness on students’ capabilities. In addition, different parties should take responsibilities in raising the awareness on student’s capabilities. For example, teachers should always focus on those students who are interested in joining TVET and provide them the proper information needed. Other than that, the government must organize more programs or training for students in schools to increase students’ awareness towards TVET’s attractiveness. Besides that, non-government organizations (NGO) should provide scholarships or incentives for TVET courses to attract more students to join TVET.

5. Funding Statement

The authorsreceived funding from Ministry of Education (MOE) through Fundamental Research Grant Scheme (FRGS) with S/O code: 14868.

6. Acknowledgments

This research was supported by Ministry of Education (MOE) through Fundamental Research Grant Scheme S/O Code 14868 (FRGS/1/2020/SSI0/UUM/02/18). The authors also want to extend their gratitude to the editor of the journal and reviewers for their valuable suggestions on the paper.

References

1. Sulaiman, N. L., & Mohd Salleh, K. (2016). The development of technical and vocational education and training (tvet) profiling for workforce management in Malaysia: Ensuring the validity and

reliability of

(10)

onal_education_and_training_tvet_profiling_for_workforce_management_in_Malaysia_Ensuring_the_ validity_and_reliability_of_tvet_data

2. UNESCO. (2003). Technical and vocational education and training for the twenty-first century: UNESCO recommendations. France: UNESCO.

3. Sivanandam, H., Rahim, R., Carvalho, M., & Tan, T. (2019, October 11). Budget 2020: Every single

sen for education will be used properly, says Maszlee. The Star

Online.https://www.thestar.com.my/news/nation/2019/10/11/budget-2020- every-single-sen- for-education-will-be-used-properly-says-maszlee

4. Povera, A. (2019, Novermber 28). Education Ministry to set up database to empower TVET. New StraitsTime.

https://www.nst.com.my/news/nation/2019/11/542967/education-ministry-set-database-empower-tvet

5. Rajaendram, R. (2019, November 30). TVET committee on the right track. The Star Online.https://www.thestar.com.my/news/nation/2019/11/30/tvet-committee-on-the- right-track 6. Devi, V. (2019, August 28). Demand for TVET grads to increase. The Star Online.

https://www.thestar.com.my/metro/metro-news/2019/08/28/demand-for-tvet-grads-to- increase 7. Lim, L. L., Mansor, J., Rahim, M. A., Sazali, N. T., &Izhar, M. K. (2018). The school- to-work

transition of young Malaysians. Malaysia: Khazanah Research Institute.

8. Affero, I., & Hassan, R. (2013). Issues and challenges of technical and vocational education & training in Malaysia for knowledge worker driven. National ConferenceonEngineeringTechnology.

https://doi.org/10.13140/2.1.4555.2961

9. Amedorme, S., &Fiagbe, Y. (2013). Challenges facing technical and vocational education in Ghana. International Journal of Scientific & Technology Research, 2(6), 253-255

10. Hussin, A., Mohamad, M., Hassan, R., & Omar, A. (2017). Technical vocational education training branding from perspective of stakeholder (parent) in Malaysia. Advanced Science Letters, 23(2), 1216-1219. https://doi.org/10.1166/asl.2017.7543

11. Cheong, K., & Lee, K. (2016). Malaysia's education crisis- can tvet help? Malaysian Journal of Economic Studies, 53(1), 115-134.https://mjes.um.edu.my/article/view/2782

12. Omar, A.(2019). Tvet Malaysia-the way forward. Paper presented at TVET Symposium 2019.https://tvet2019.ppks.edu.my/pdf/TVET%20Malaysia%20%20The%20Way%20Forw

ard.pdf

13. Mohamad, M. M., Saud, M. S., & Ahmad, A. (2009). The need in training and retraining for tvet teachers in Malaysia. Journal of Technical Education and Training, 1(1),51-57.

14. Ismail, K., Nopiah, Z., Rasul, M., & Leong, P. (2017). Malaysian teachers' competency intechnicalvocational education and training: A review. Regionalization and Harmonizationin TVET. https://doi.org/10.1201/9781315166568-15

15. Quinlan, J. (1986). Induction of Decision Trees. Dordrecht, The Netherlands: KluwerAcademic Publishers.

16. Rokach, L., &Maimon, O. (2015). Data Mining with Decision Trees Theory and Applications.Singapore: World Scientific Publishing Co. Pte. Ltd.

17. Hssina, B., Merbouha, A., Ezzikouri, H., &Erritali, M. (2014). A comparative study of decision tree ID3 and C4.5. International Journal of Advanced Computer Science and Applications, 13-19. https://doi.org/10.14569/SpecialIssue.2014.040203

18. Gulati, P., Sharma, A., & Gupta, M. (2016). Theoretical study of decision tree algorithm to identify pivotal factors for performance improvement: A review. International Journal of Computer Applications, 141(14), 19-25. https://doi.org/10.5120/ijca2016909926

19. Karegowda, A. G., Manjunath, A. S., & Jayaran, M.A. (2010). Comparative study of a attribute selection using gain ratio and correlation based feature selection. International Journal of Information

Technology and KnowlegdeManagement, 2(2),

271-277.https://www.researchgate.net/publication/228919572_Comparative_study_of_attribute_selection_u sing_gain_ratio_and_correlation_based_feature_selection

20. Breiman, L., J. Friedman, R. Olshen, and C. Stone (1984).Classification and regression trees. United States: Wadsworth Books.

21. Kass, G. V. (1980). An exploratory technique for investigating large quantities of categorical data. Journal of the Royal Statistical Society, 29(2), 119-127.https://doi.org/10.2307/2986296

22. Ritschard, G. (2013). CHAID and earlier supervised tree methods.Comtemporary Issues in Exploratory Data Mining in Behavioral Sciences, 48-74.

23. Díaz-Pérez, F., & Bethencourt-Cejas, M. (2016). CHAID algorithm as an appropriate analytical method for tourism market segmentation. Journal of Destination Marketing & Management, 5, 275-282.https://doi.org/10.1016/j.jdmm.2016.01.006

(11)

24. Galdi, P., &Tagliaferri, R. (2017). Data mining: Accuracy and error measures for classification and prediction. Elsevier: Reference Module in Life Sciences,1-14.

25. Maer Matei, M., Mocanu, C., & Zamfir, A. (2018). Educational paths in Romania: Choosing general or vocational education. HOLISTICA, 9(2), 127-136.

26. Gonzales, J. G., Lagman, A. C., & De Angel, R. M. (2020). Predictive analytics assessment tool for information communication technology (Ict) courses of technical and vocational education and training (TVET) school in the Philippines. International Journal of Scientific and Technology Research, 9(4), 1282-1287.https://www.ijstr.org/paper-references.php?ref=IJSTR-0420-34089

27. Hassan, R. H., &Awan, S. M. (2019). Identification of trainees enrollment behavior and course selection variables in technical and vocational education training (TVET) program using education data mining. International Journal Modern Education and ComputerScience, 10, 14-24.https://doi.org/10.5815/ijmecs.2019.10.02

28. Patidar, P., &Tiwari, A. (2013). Handling missing value in decision tree algorithm. International Journal of Computer Application, 70(13), 31-36.

29. Ghosh, A., Manwani, N., &Sastry, P. S. (2016). On the robustness of decision tree learning under label noise. JMLP: Workshop and Conference Proceedings.

30. John, G. H. (1995). Robust decision trees: Removing outliers from databases. KDD-95 Proceedings, 174-179.

31. Ch’ng, C. K., &Mahat, N. I.(2020). Winsorize tree algorithm for handling outlier in classification

problem. InternationalJournal of Operational Research, 38(2), 278-293.