Applied Intelligence (2018) 48:1083–1085 https://doi.org/10.1007/s10489-018-1143-0 Editorial for the special issue: Knowledge-based systems and data science Hamido Fujita1 · Ali Selamat2 Published online: 7 February 2018 © Springer Science+Business Media, LLC, part of Springer Nature 2018 Intelligence displayed by machines has evolved from the- oretical research to vital process re-improvements this very decade, where more and more industrial progression relied on the capability of machines perceiving and learn- ing from surrounding environments, assisting in succession of given tasks. The advanced progressing of computa- tional intelligence has promoted design and implementation of multifaceted solutions for complex industrial problems that eventually benefits changeling applications in multi- disciplinarily fields. This special issue highlights the pre- sentation of recent theoretical, operational and real-world applications on multiple industries benefitted through the implementation of artificial intelligence techniques in a vast array of domains including network security, hardware and software, data structures, ranking predictions, pattern recog- nition, medical tools, ubiquitous computing, education and else The papers in this special issue have been selected from the extended versions of the 29th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE 2016, Morioka, Japan) and the submitted papers through a public call. All papers have been peer-reviewed according to the journal standards. The accepted papers in this special issue are classified into three groups. The first group focuses on the area of security, regres- sive methods and algorithm design. The paper titled “Vector Based Genetic Algorithm (VBGA) to optimize predictive analysis in network security” proposes a new Intrusion Detection System (IDS) for network security, making use of a Vector-Based Genetic Algorithm (VBGA) inspired by � Hamido Fujita HFujita-799@acm.org Ali Selamat aselamat@utm.my 1 Iwate Prefecture University, Takizawa, Iwate, Japan 2 Universiti Teknologi Malaysia, Iskandar Puteri, Johor, Malaysia evolutionary approaches. The novelty in the algorithm is to represent chromosomes as vectors and training data as matrices. This approach allows multiple pathways to cal- culate fitness function out of which one particular method- ology is used and tested. The paper titled “Anomaly Detection using Piecewise Aggregate Approximation in the Amplitude” proposes a representation method called the Piecewise Aggregate Approximation in the Amplitude Domain (AD-PAA). For discovering anomalies, a sequence is partitioned into subsequences by a sliding window firstly. Then in the AD-PAA method, a subsequence is divided into equal-sized subsections according to the amplitude domain. With mean values of subsections computed, the amplitude oscillation of a subsequence is embodied effectively. The paper titled “Supervised Ranking Framework for Rela- tionship Prediction in Heterogeneous Information Net- works” proposes a novel frame work named Supervised Ranking frame- work (S-Rank) is proposed to tackle exist- ing approaches. To avoid the class imbalance problem, the S-Rank framework treated the relationship prediction problem as a ranking task and divide it into three phases: Supervised PageRank strategy (SPR), utilization of Meta Path-based Ranking method (MPR) and integration of two ranking scores combining all the Attr-info, Global-info and Local-info together. Experiments on DBLP data demon- strate that the proposed S-Rank framework can effectively take advantage of all the three kinds of information for rela- tionship prediction over HINs and outperforms other well- known baseline approaches. The paper titled “Challenging State-of-the-Art Move Ordering with Adaptive Data Structures” presents a comparison to two well-known, acclaimed strategies, which operate on a similar philoso- phy to the History-ADS, namely, the History Heuristic, and the Killer Moves technique. In this work, a mechanism by which these established move ordering strategies can be approximated, or directly implemented, in terms of ADSs. In a wide range of two-player and multi-player games, at various points in the game’s progression, the History-ADS performs at least as well as presented strategies (in the arti- cle), and, in fact, outperforms them in the majority of cases. http://crossmark.crossref.org/dialog/?doi=10.1007/s10489-018-1143-0&domain=pdf mailto:HFujita-799@acm.org mailto:aselamat@utm.my 1084 H. Fujita, A. Selamat The paper titled “ETARM: An Efficient Top-K Associa- tion Rule Mining Algorithm” presents a novel algorithm named ETARM (Efficient Top-k Association Rule Miner) to efficiently find the complete set of top-k association rules. The proposed algorithm integrates two novel candi- date pruning properties to more effectively reduce the search space. These properties are applied during the candidate selection process to identify items that should not be used to expand a rule based on its confidence, to reduce the num- ber of candidates. An extensive experimental evaluation on six standard benchmark datasets show that the proposed approach outperforms the state- of-the-art TopKRules algo- rithm both in terms of runtime and memory usage. The paper titled “SpringBoard: game-agnostic tool for sce- nario editing with meta-programming support” presents the development of SpringBoard, an open source scenario editor for games using the SpringRTS engine. Extending it to support game and level requirements is achieved with multi-level meta-programming, while still providing a sys- tem that is integrated with the GUI editor and therefore intuitive to use. The second group of articles in this special issue demon- strates machine learning and analytical solutions. The paper titled “Successes and challenges in developing a hybrid approach to sentiment analysis” covers some success and learning experiences attained during the devel- oping of a hybrid approach to Sentiment Analysis (SA) based on a Sentiment Lexicon, Semantic Rules, Negation Handling, Ambiguity Management and Linguistic Vari- ables. The proposed hybrid method is presented and applied to two selected datasets: Movie Review and Sentiment Twit- ter datasets. The achieved results are compared against those obtained when Naı̈ve Bayes (NB) and Maximum Entropy (ME) supervised machine learning classification methods are used for the same datasets. The proposed hybrid system attained higher accuracy and precision scores than NB and ME, which shows its superiority when applied to the SA problem at the sentence level. The paper titled “Online Sig- nature Verification by Spectrogram Analysis” proposes a novel online signature analysis methodology for touch- screens that starts with signing an interface consisting of a signature silhouette. The frequency spectrum along the signing process is stealthily extracted and spectrograms are created by short-time Fourier transforms. Since the spec- trograms are found as RGB images, providing valuable information about frequency vs time, grid histograms are formed by quantization for the real signature sample. Given the discrimination purposes, a fuzzified surface is designed for computing closeness of grid histograms. The paper titled “A View-invariant Gait Recognition Algorithm based on a Joint-Direct Linear Discriminant Analysis” pro- poses a view-invariant gait recognition algorithm, which builds a unique view invariant model taking advantages of the dimensionality reduction provided by the Direct Linear Discriminant Analysis (DLDA). Proposed scheme is able to reduce the under-sampling problem (USP) that appears usu- ally when the number of training samples is much smaller than the dimension of the feature space. In the paper titled “Hybrid sentiment classification on twitter aspect-based sentiment analysis”, a new hybrid approach for twitter aspect-based sentiment classification is proposed by embed- ding the feature selection method. The comparison of classi- fication accuracy using the feature selection method Princi- pal Component Analysis (PCA), Latent Semantic Analysis (LSA), and Random Projection (RP) have been presented in this paper. Furthermore, the evaluation with different clas- sification algorithms also demonstrates that our new hybrid approach produced meaningful results with our own Hate Crime Twitter Sentiment (HCTS) dataset and also Stanford Twitter Sentiment (STS) and Sanders Twitter Corpus (STC) benchmark dataset which represent different domain. The paper titled “A Fuzzy Classification Model for Myocar- dial Infarction Risk Assessment” presented the develop- ment of a new clinical decision support system (CDSS) to diagnose Coronary Artery Diseases (CAD). In this paper, an automatic approach to design CDSS for CAD assess- ment is proposed. The proposed diagnosis model is based on Random Forest algorithm, C5.0 decision tree algorithm and Fuzzy modeling. The experimental results underscore very promising classification accuracy of 90.50% while optimizing training time using UCI (the University of Cal- ifornia at Irvine) heart diseases datasets compared to the previously reported results. The paper titled “Agent Sys- tems Verification and Validation: Systematic Literature Review” presented a study that surveys existing techniques for checking agent properties and detecting faults during design, development, and runtime phases of agent system life-cycle are presented with the aim to identify implementa- tion gap and to derive research direction. The paper “Some Bibliometric Procedures for Analyzing and Evaluating Research Fields” has presented new results on bibliometric analytics that can provide a good tool to scientific research predictions. The third group in this special issue presented works done in industrial applications. The paper titled “Convolutional Neural Network Acceleration with Hardware/Software Co-Design” presents a design methodology for accelerating CNNs using Hardware/Software Co-design techniques, in order to balance performance and exibility, particularly for resource-constrained systems. The methodology is applied to a gender recognition case study, using an ARM processor and FPGA fabric to create an embedded system that can process facial images in real-time. The paper titled “Design and implementation of a web-based fuzzy expert system for diagnosing depressive disorder” proposes a design a web-based expert system for diagnosing depression using Editorial for the special issue: Knowledge-based systems and data science 1085 the fuzzy Delphi method by estimating the weights and im- portance of depression symptoms. Fuzzy Logic is adopted to calculate the level of depression. The proposed system appears helpful for everyone, from ordinary persons to specialists in medical environments. It can also be useful to train psychology students in the area of diagnostic reasoning. In the paper titled “Overlapping community detection based on discrete biogeography optimization”, an overlapping community detection algorithm based on biogeography optimization is proposed. Experiments on benchmark test data, including two synthetic networks and four real-world networks, show that the proposed algorithm can achieve results with better accuracy and stability than the compared evolutionary algorithms. The paper titled “Mining Constrained Inter-Sequence Patterns: A Novel Approach to Cope with Item Constraints” proposes two novel algorithms for it. First is the ISP- IC (Inter-Sequence Pattern with Item Constraint mining) algorithm based on a theorem that quickly determines whether an inter-sequence pattern satisfies the constraints. Then, we propose a way to improve the strategy of ISP- IC, which is then applied to the iISP-IC algorithm to enhance the performance of the process. Finally, piISP-IC, a parallel version of iISP-IC, will be presented. Experimental results show that piISP-IC algorithm outperforms the post-processing of the-state-of-the-art method for mining inter-sequence patterns (EISP-Miner), ISP-IC, and iISP- IC algorithms in most of the cases. The paper titled “A multi-level approach to ubiquitous modeling and solving constraints in combinatorial optimization problems in production and distribution” presents a multi-level approach to the modeling and solving of combinatorial optimization problems. It is versatile and effective owing to the use of multi-level pre-solving and multiple paradigms, such as constraint programming, logic programming, mathematical programming, and fuzzy logic, for their complementary strengths. The capability of this framework and its advantage over mathematical programming alone or over hybrid frameworks is shown in the illustrative example, in which combinatorial optimization is used as a benchmark to prove the effectiveness of the proposed approach. The paper titled “A novel three-band orthogonal wavelet filter bank method for an automated identification of alcoholic EEG signals” presented an automated system for the classification of alcoholic and normal EEG signals using a recently designed duration-bandwidth product (DBP) optimized three-band orthogonal wavelet filter bank (TBOWFB) and log-energy (LE). The results have achieved the classification accuracy (CA) of 97.08%, with ten-fold cross validation strategy. The proposed model presents a promising performance, and therefore it can be used in a practical setup to assist the medical professional in the diagnosis of alcoholism using EEG signals automatically. The paper titled “Automated detection of retinal health using PHOG and SURF features extracted from fundus images” proposed a novel approach to develop an automated retinal health screening system in this work. This paper discusses a retinal screening system to automatically differentiate normal image from abnormal (AMD, DR, and glaucoma) fundus images. The presented results achieved 96.21% accuracy, 95.00% sensitivity, and 97.42% specificity with ten-fold cross-validation strategy using k- nearest neighbor (kNN) classifier. This novel algorithm has high potential in the diagnosis of normal eye during the mass eye screening session or in polyclinics quickly and reliably. The paper “The Promotion Strategy of Supply Chain Flexibility Based on Deep Belief Network” has provided an intelligent flexible model by considering the supply side and demand side in uncertain environment using belief network. The editors hope that this issue will provide valuable knowledge, ideas, and state-of-the-art research propagation in applied intelligence specifically in reshaping industrial advancement through constant improvements of informative intelligence. With much appreciation to the contributors of this special issue, the editors hope that the hard work and efforts will be made useful for a conclusive betterment of computational industrial solutions. Editorial for the special issue: Knowledge-based systems and data science