An experimental evaluation of extreme learning machines on several hardware devices

Liang Li; Guoren Wang; Gang Wu; Qi Zhang

doi:10.1007/s00521-019-04481-6

An experimental evaluation of extreme learning machines on several hardware devices

Liang Li, Guoren Wang^*, Gang Wu, Qi Zhang

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

As an important learning algorithm, extreme learning machine (ELM) is known for its excellent learning speed. With the expansion of ELM’s applications in the field of classification and regression, the need for its real-time performance is increasing. Although the use of hardware acceleration is an obvious solution, how to select the appropriate acceleration hardware for ELM-based applications is a topic worthy of further discussion. For this purpose, we designed and evaluated the optimized ELM algorithms on three kinds of state-of-the-art acceleration hardware, i.e., multi-core CPU, Graphics Processing Unit (GPU), and Field-Programmable Gate Array (FPGA) which are all suitable for matrix multiplication optimization. The experimental results showed that the speedup ratio of these optimized algorithms on acceleration hardware achieved 10–800. Therefore, we suggest that (1) use GPU to accelerate ELM algorithms for large dataset, and (2) use FPGA for small dataset because of its lower power, especially for some embedded applications. We also opened our source code.

Original language	English
Pages (from-to)	14385-14397
Number of pages	13
Journal	Neural Computing and Applications
Volume	32
Issue number	18
DOIs	https://doi.org/10.1007/s00521-019-04481-6
Publication status	Published - 1 Sept 2020

Keywords

Extreme learning machine
FPGA
GPU
Hardware
Multi-core

Access to Document

10.1007/s00521-019-04481-6

Cite this

@article{c714568ebb0a4e9d98961f37b9dd1854,

title = "An experimental evaluation of extreme learning machines on several hardware devices",

abstract = "As an important learning algorithm, extreme learning machine (ELM) is known for its excellent learning speed. With the expansion of ELM{\textquoteright}s applications in the field of classification and regression, the need for its real-time performance is increasing. Although the use of hardware acceleration is an obvious solution, how to select the appropriate acceleration hardware for ELM-based applications is a topic worthy of further discussion. For this purpose, we designed and evaluated the optimized ELM algorithms on three kinds of state-of-the-art acceleration hardware, i.e., multi-core CPU, Graphics Processing Unit (GPU), and Field-Programmable Gate Array (FPGA) which are all suitable for matrix multiplication optimization. The experimental results showed that the speedup ratio of these optimized algorithms on acceleration hardware achieved 10–800. Therefore, we suggest that (1) use GPU to accelerate ELM algorithms for large dataset, and (2) use FPGA for small dataset because of its lower power, especially for some embedded applications. We also opened our source code.",

keywords = "Extreme learning machine, FPGA, GPU, Hardware, Multi-core",

author = "Liang Li and Guoren Wang and Gang Wu and Qi Zhang",

note = "Publisher Copyright: {\textcopyright} 2019, Springer-Verlag London Ltd., part of Springer Nature.",

year = "2020",

month = sep,

day = "1",

doi = "10.1007/s00521-019-04481-6",

language = "English",

volume = "32",

pages = "14385--14397",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "18",

}

TY - JOUR

T1 - An experimental evaluation of extreme learning machines on several hardware devices

AU - Li, Liang

AU - Wang, Guoren

AU - Wu, Gang

AU - Zhang, Qi

PY - 2020/9/1

Y1 - 2020/9/1

N2 - As an important learning algorithm, extreme learning machine (ELM) is known for its excellent learning speed. With the expansion of ELM’s applications in the field of classification and regression, the need for its real-time performance is increasing. Although the use of hardware acceleration is an obvious solution, how to select the appropriate acceleration hardware for ELM-based applications is a topic worthy of further discussion. For this purpose, we designed and evaluated the optimized ELM algorithms on three kinds of state-of-the-art acceleration hardware, i.e., multi-core CPU, Graphics Processing Unit (GPU), and Field-Programmable Gate Array (FPGA) which are all suitable for matrix multiplication optimization. The experimental results showed that the speedup ratio of these optimized algorithms on acceleration hardware achieved 10–800. Therefore, we suggest that (1) use GPU to accelerate ELM algorithms for large dataset, and (2) use FPGA for small dataset because of its lower power, especially for some embedded applications. We also opened our source code.

AB - As an important learning algorithm, extreme learning machine (ELM) is known for its excellent learning speed. With the expansion of ELM’s applications in the field of classification and regression, the need for its real-time performance is increasing. Although the use of hardware acceleration is an obvious solution, how to select the appropriate acceleration hardware for ELM-based applications is a topic worthy of further discussion. For this purpose, we designed and evaluated the optimized ELM algorithms on three kinds of state-of-the-art acceleration hardware, i.e., multi-core CPU, Graphics Processing Unit (GPU), and Field-Programmable Gate Array (FPGA) which are all suitable for matrix multiplication optimization. The experimental results showed that the speedup ratio of these optimized algorithms on acceleration hardware achieved 10–800. Therefore, we suggest that (1) use GPU to accelerate ELM algorithms for large dataset, and (2) use FPGA for small dataset because of its lower power, especially for some embedded applications. We also opened our source code.

KW - Extreme learning machine

KW - FPGA

KW - GPU

KW - Hardware

KW - Multi-core

UR - http://www.scopus.com/inward/record.url?scp=85073833462&partnerID=8YFLogxK

U2 - 10.1007/s00521-019-04481-6

DO - 10.1007/s00521-019-04481-6

M3 - Article

AN - SCOPUS:85073833462

SN - 0941-0643

VL - 32

SP - 14385

EP - 14397

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 18

ER -

An experimental evaluation of extreme learning machines on several hardware devices

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this