Application of learning to rank in bioinformatics tasks

2.

Wang

JY

,

Sun

Y

,

Gao

X

.

Sparse structure regularized ranking

.

Multimed Tools Appl

2015

;

74

(

2

):

635

–

54

.

3.

He

C

,

Wang

C

,

Zhong

Y

, et al. A survey on learning to rank. In:

International Conference on Machine Learning and Cybernetics, 2008

,

1734

–

9

. IEEE. Kunming, PEOPLES R CHINA.

4.

Li

H

.

A short introduction to learning to rank

.

IEICE T Inf Syst

2011

;

94

(

10

):

1854

–

62

.

5.

Xu

B

,

Lin

HF

,

Lin

Y

, et al. Learning to rank for biomedical information retrieval. In:

Proceedings of 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

,

464

–

9

. IEEE. Washington, DC.

6.

Jarvelin

K

,

Kekalainen

J

.

Cumulated gain-based evaluation of IR techniques

.

ACM Trans Inf Syst

2002

;

20

(

4

):

422

–

46

. MIT Press. Vancouver, Canada.

7.

Crammer

K

,

Singer

Y

. Pranking with ranking. In:

Advances in Neural Information Processing Systems

,

2001

,

641

–

7

.

8.

Caruana

R

,

Baluja

S

,

Mitchell

TM

, et al.

Using the future to ‘sort out’ the present: Rankprop and multitask learning for medical risk evaluation

.

Adv Neural Inf Process Syst

1999

;

8

:

959

–

65

.

9.

Burges

CJ

,

Ragno

RJ

,

Le

QV

. Learning to rank with nonsmooth cost functions. In:

International Conference on Neural Information Processing Systems, 2006

,

193

–

200

. MIT Press. Vancouver, Canada

10.

Herbrich

R

,

Graepel

T

,

Obermayer

K

.

Large margin rank boundaries for ordinal regression

.

Adv Neural Inf Process Syst

2000

;

88

:

115

–

132

.

11.

Cao

Z

,

Qin

T

,

Liu

T

, et al. Learning to rank: from pairwise approach to listwise approach. In:

International Conference on Machine Learning, 2007

,

129

–

36

. Association for Computing Machinery, New York, NY, United States. Corvalis Oregon.

12.

Joachims

T

. Optimizing search engines using clickthrough data. In:

Knowledge Discovery and Data Mining

,

2002

,

133

–

42

. Association for Computing Machinery, New York, NY, United States. Edmonton Alberta Canada.

13.

Mork

J

,

Jimenoyepes

A

,

Aronson

A

, et al.

The NLM medical text indexer system for indexing biomedical literature

. In:

Proceedings of BioASQ CLEF

. CEUR Workshop Proceedings. Valencia, Spain.

2013

(

1094

).

14.

Trieschnigg

D

,

Pezik

P

,

Lee

V

, et al.

MeSH up: effective MeSH text classification for improved document retrieval

.

Bioinformatics

2009

;

25

(

11

):

1412

–

8

.

15.

Sohn

S

,

Kim

W

,

Comeau

DC

, et al.

Optimal training sets for Bayesian prediction of MeSH (R) assignment

.

J Am Med Inform Assoc

2008

;

15

(

4

):

546

–

53

.

16.

Ruch

P

.

Automatic assignment of biomedical categories: toward a generic approach

.

Bioinformatics

2006

;

22

(

6

):

658

–

64

.

17.

Aronson

AR

.

Effective mapping of biomedical text to the UMLS metathesaurus: the MetaMap program

.

J Am Med Inform Assoc

2001

:

17

–

21

.

18.

Kim

W

,

Aronson

AR

,

Wilbur

WJ

.

Automatic MeSH term assignment and quality assessment

.

J Am Med Inform Assoc

2001

;

8

(1):

319

–

23

.

19.

Aronson

AR

,

Mork

JG

,

Gay

CW

, et al. The NLM indexing initiative’s medical text indexer. In:

Medinfo 2004: Proceedings of the 11th World Congress on Medical Informatics

,

2004

,

268

–

72

. IOS Press, Nieuwe Hemweg 6B, 1013 BG Amsterdam, Netherlands. San Francisco, CA.

20.

Huang

M

,

Neveol

A

,

Lu

Z

.

Recommending MeSH terms for annotating biomedical articles

.

J Am Med Inform Assoc

2011

;

18

(

5

):

660

–

7

.

21.

Mao

Y

,

Lu

Z

. NCBI at the 2013 BioASQ challenge task: learning to rank for automatic MeSH indexing.

Technical Report

,

2013

.

22.

Mao

Y

,

Lu

Z

.

MeSH now: automatic MeSH indexing at PubMed scale via learning to rank

.

J Biomed Semantics

2017

;

8

(

1

):

15

.

23.

Liu

K

,

Peng

S

,

Wu

J

, et al.

MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence

.

Bioinformatics

2015

;

31

(

12

):

i339

–

47

.

24.

Peng

S

,

You

R

,

Wang

H

, et al.

DeepMeSH: deep semantic representation for improving large-scale MeSH indexing

.

Bioinformatics

2016

;

32

(

12

):

i70

–

9

.

25.

Dai

S

,

You

R

,

Lu

Z

, et al.

FullMeSH: improving large-scale MeSH indexing with full text

.

Bioinformatics

2020

;

36

(

5

):

1533

–

41

.

26.

Murzin

AG

,

Brenner

SE

,

Hubbard

T

, et al.

SCOP: a structural classification of proteins database for the investigation of sequences and structures

.

J Mol Biol

1995

;

247

(

4

):

536

–

40

.

27.

Chen

J

,

Guo

M

,

Wang

X

, et al.

A comprehensive review and comparison of different computational methods for protein remote homology detection

.

Brief Bioinform

2018

;

19

(

2

):

231

–

44

.

28.

Wang

JY

,

Bensmail

H

,

Gao

X

.

Multiple graph regularized protein domain ranking

.

BMC Bioinformatics

2012

;

13

(

1

):

307

.

29.

Altschul

SF

,

Gish

W

,

Miller

W

, et al.

Basic local alignment search tool

.

J Mol Biol

1990

;

215

(

3

):

403

–

10

.

30.

Altschul

SF

,

Madden

TL

,

Schaffer

AA

, et al.

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

.

Nucleic Acids Res

1997

;

25

(

17

):

3389

–

402

.

31.

Kuang

R

,

Weston

J

,

Noble

WS

, et al.

Motif-based protein ranking by network propagation

.

Bioinformatics

2005

;

21

(

19

):

3711

–

8

.

32.

Weston

J

,

Elisseeff

A

,

Zhou

DY

, et al.

Protein ranking: from local to global structure in the protein similarity network

.

Proc Natl Acad Sci U S A

2004

;

101

(

17

):

6559

–

63

.

33.

Melvin

I

,

Weston

J

,

Leslie

C

, et al.

RANKPROP: a web server for protein remote homology detection

.

Bioinformatics

2009

;

25

(

1

):

121

–

2

.

34.

Liu

B

,

Chen

J

,

Wang

X

.

Application of learning to rank to protein remote homology detection

.

Bioinformatics

2015

;

31

(

21

):

3492

–

8

.

35.

Liu

B

,

Xu

JH

,

Zou

Q

, et al.

Using distances between top-n-gram and residue pairs for protein remote homology detection

.

BMC Bioinformatics

2014

;

15

(

2

):

1

–

10

.

36.

Liu

B

,

Zhang

DY

,

Xu

RF

, et al.

Combining evolutionary information extracted from frequency profiles with sequence-based kernels for protein remote homology detection

.

Bioinformatics

2014

;

30

(

4

):

472

–

9

.

37.

Liu

B

,

Chen

JJ

,

Wang

XL

.

Protein remote homology detection by combining Chou’s distance-pair pseudo amino acid composition and principal component analysis

.

Mol Genet Genomics

2015

;

290

(

5

):

1919

–

31

.

38.

Chen

JJ

,

Liu

BQ

,

Huang

D

.

Protein remote homology detection based on an ensemble learning approach

.

Biomed Res Int

2016

:

5813645

–

5

.

39.

Liu

B

,

Chen

JJ

,

Wang

SY

.

Protein remote homology detection by combining pseudo dimer composition with an ensemble learning method

.

Curr Proteomics

2016

;

13

(

2

):

86

–

91

.

40.

Chen

JJ

,

Long

R

,

Wang

XL

, et al.

dRHP-PseRA: detecting remote homology proteins using profile-based pseudo protein sequence and rank aggregation

.

Sci Rep

2016

;

6

(

1

):

32333

–

3

.

41.

Chen

J

,

Guo

M

,

Li

S

, et al.

ProtDec-LTR2.0: an improved method for protein remote homology detection by combining pseudo protein and supervised learning to rank

.

Bioinformatics

2017

;

33

(

21

):

3473

–

6

.

42.

Liu

B

,

Zhu

Y

.

ProtDec-LTR3.0: protein remote homology detection by incorporating profile-based features into learning to rank

.

IEEE Access

2019

;

7

:

102499

–

507

.

43.

Piana

S

,

Klepeis

JL

,

Shaw

DE

.

Assessing the accuracy of physical models used in protein-folding simulations: quantitative evidence from long molecular dynamics simulations

.

Curr Opin Struct Biol

2014

;

24

:

98

–

105

.

44.

Marks

DS

,

Hopf

TA

,

Sander

C

.

Protein structure prediction from sequence variation

.

Nat Biotechnol

2012

;

30

(

11

):

1072

–

80

.

45.

Jing

X

,

Dong

Q

,

Lu

R

.

RRCRank: a fusion method using rank strategy for residue-residue contact prediction

.

BMC Bioinformatics

2017

;

18

(

1

):

390

.

46.

Zhang

Y

.

Protein structure prediction: when is it useful?

Curr Opin Struct Biol

2009

;

19

(

2

):

145

–

55

.

47.

Ghosh

S

,

Vishveshwara

S

.

Ranking the quality of protein structure models using sidechain based network properties

.

F1000Research

2014

;

3

:

17

–

7

.

48.

Pawlowski

M

,

Kozlowski

L

,

Kloczkowski

A

.

MQAPsingle: a quasi single-model approach for estimation of the quality of individual protein structure models

.

Proteins

2016

;

84

(

8

):

1021

–

8

.

49.

Wang

QG

,

Shang

C

,

Xu

D

, et al.

New Mds and clustering based algorithms for protein model quality assessment and selection

.

Int J Art Intell Tools

2013

;

22

(

5

):

1360006

–

6

.

50.

Jing

X

,

Dong

Q

.

MQAPRank: improved global protein model quality assessment by learning-to-rank

.

BMC Bioinformatics

2017

;

18

(

1

):

1

–

8

.

51.

Sleator

RD

,

Walsh

P

.

An overview of in silico protein function prediction

.

Arch Microbiol

2010

;

192

(

3

):

151

–

5

.

52.

Hamp

T

,

Kassner

R

,

Seemayer

S

, et al.

Homology-based inference sets the bar high for protein function prediction

.

BMC Bioinformatics

2013

;

14

(

3

):

1

–

10

.

53.

Gillis

J

,

Pavlidis

P

.

Characterizing the state of the art in the computational assignment of gene function: lessons from the first critical assessment of functional annotation (CAFA)

.

BMC Bioinformatics

2013

;

14

(

3

):

1

–

12

.

54.

Ashburner

M

,

Ball

CA

,

Blake

JA

, et al.

Gene ontology: tool for the unification of biology. The gene ontology consortium

.

Nat Genet

2000

;

25

(

1

):

25

–

9

.

55.

You

R

,

Zhang

Z

,

Xiong

Y

, et al.

GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank

.

Bioinformatics

2018

;

34

(

14

):

2465

–

73

.

56.

You

R

,

Yao

S

,

Xiong

Y

, et al.

NetGO: improving large-scale protein function prediction with massive network information

.

Nucleic Acids Res

2019

;

47

(

1

):

379

–

W387

.

57.

Stock

M

,

Fober

T

,

Hullermeier

E

, et al.

Identification of functionally related enzymes by learning-to-rank methods

.

IEEE/ACM Trans Comput Biol Bioinform

2014

;

11

(

6

):

1157

–

69

.

58.

Ding

H

,

Takigawa

I

,

Mamitsuka

H

, et al.

Similarity-based machine learning methods for predicting drug-target interactions: a brief review

.

Brief Bioinform

2014

;

15

(

5

):

734

–

47

.

59.

Xiao

X

,

Min

JL

,

Wang

P

, et al.

iGPCR-drug: a web server for predicting interaction between GPCRs and drugs in cellular networking

.

PLoS One

2013

;

8

(

8

):

e72234

.

60.

Luo

YA

,

Zhao

XB

,

Zhou

JT

, et al.

A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information

.

Nat Commun

2017

;

8

(

1

):

573

–

3

.

61.

Hu

J

,

Li

Y

,

Yang

JY

, et al.

GPCR-drug interactions prediction using random forest with drug-association-matrix-based post-processing procedure

.

Comput Biol Chem

2016

;

60

:

59

–

71

.

62.

Agarwal

S

,

Dugar

D

,

Sengupta

S

.

Ranking chemical structures for drug discovery: a new machine learning approach

.

J Chem Inf Model

2010

;

50

(

5

):

716

–

31

.

63.

Rathke

F

,

Hansen

K

,

Brefeld

U

, et al.

StructRank: a new approach for ligand-based virtual screening

.

J Chem Inf Model

2011

;

51

(

1

):

83

–

92

.

64.

Ohue

M

,

Suzuki

SD

,

Akiyama

Y

.

Learning-to-rank technique based on ignoring meaningless ranking orders between compounds

.

J Mol Graph Model

2019

;

92

:

192

–

200

.

65.

Liu

J

,

Ning

X

.

Multi-assay-based compound prioritization via assistance utilization: a machine learning framework

.

J Chem Inf Model

2017

;

57

(

3

):

484

–

98

.

66.

Zhang

W

,

Ji

L

,

Chen

Y

, et al.

When drug discovery meets web search: learning to rank for ligand-based virtual screening

.

J Chem

2015

;

7

(

1

):

5

–

5

.

67.

Suzuki

SD

,

Ohue

M

,

Akiyama

Y

.

PKRank: a novel learning-to-rank method for ligand-based virtual screening using pairwise kernel and RankSVM

.

Artif Life Robotics

2018

;

23

(

2

):

205

–

12

.

68.

Dorr

A

,

Rosenbaum

L

,

Zell

A

.

A ranking method for the concurrent learning of compounds with various activity profiles

.

J Chem

2015

;

7

(

1

):

2

–

2

.

69.

Liu

J

,

Ning

X

.

Differential compound prioritization via bidirectional selectivity push with power

.

J Chem Inf Model

2017

;

57

(

12

):

2958

–

75

.

70.

Rahangdale

A

,

Raut

S

. Gene-expression based predictor for drug selection and prioritization using learning-to-rank. In:

International Conference on Bioinformatics, 2018

. IEEE, 345 E 47th st, New York, NY 10017 USA. Allahabad, India.

71.

He

Y

,

Liu

J

,

Ning

X

.

Drug selection via joint push and learning to rank

.

IEEE/ACM Trans Comput Biol Bioinform

2020

;

17

(

1

):

110

–

23

.

72.

Yuan

Q

,

Gao

J

,

Wu

D

, et al.

DrugE-rank: improving drug-target interaction prediction of new candidate drugs or targets by ensemble learning to rank

.

Bioinformatics

2016

;

32

(

12

):

i18

–

27

.

73.

Ru

X

,

Wang

L

,

Li

L

, et al.

Exploration of the correlation between GPCRs and drugs based on a learning to rank algorithm

.

Comput Biol Med

2020

;

119

:

103660

.

74.

Shivani

A

,

Shiladitya

S

. Ranking genes by relevance to a disease. In:

Proceedings of the 8th International Conference on Computational Systems Bioinformatics, 2009

, Vol.

8

,

37

–

46

. CBS 2009 On-line Proceedings. San Francisco, United States.

75.

Lee

PF

,

Soo

VW

. An ensemble rank learning approach for gene prioritization. In:

Conference Proceedings: Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

,

3507

–

10

. IEEE, 345 E 47th st, New York, NY 10017 USA. Osaka, Japan.

76.

Raj

MR

,

Sreeja

A

.

Analysis of computational gene prioritization approaches

.

Procedia Comput Sci

2018

;

143

:

395

–

410

.

77.

Qeli

E

,

Omasits

U

,

Goetze

S

, et al.

Improved prediction of peptide detectability for targeted proteomics using a rank-based algorithm and organism-specific data

.

J Proteomics

2014

;

108

:

269

–

83

.

78.

Leaman

R

,

Islamaj Dogan

R

,

Lu

Z

.

DNorm: disease name normalization with pairwise learning to rank

.

Bioinformatics

2013

;

29

(

22

):

2909

–

17

.

79.

Wu

JJ

,

Huang

JX

,

Ye

Z

.

Learning to rank diversified results for biomedical information retrieval from multiple features

.

Biomed Eng Online

2014

;

13

(

2

):

1

–

10

.

80.

Shang

Y

,

Hao

HH

,

Wu

JJ

, et al.

Learning to rank-based gene summary extraction

.

BMC Bioinformatics

2014

;

15

(

12

):

1

–

8

.

81.

Guan

W

,

Ozakin

A

,

Gray

A

, et al.

Learning protein folding energy functions

. In:

International Conference on Data Mining, 2011

.

1062

–

7

. IEEE Computer Society. Vancouver, Canada.

82.

Chen

Z

,

Zhao

P

,

Li

FY

, et al.

iFeature: a python package and web server for features extraction and selection from protein and peptide sequences

.

Bioinformatics

2018

;

34

(

14

):

2499

–

502

.

83.

Liu

B

,

Liu

FL

,

Wang

XL

, et al.

Pse-in-one: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences

.

Nucleic Acids Res

2015

;

43

(

W1

):

W65

–

71

.

84.

Lovric

M

,

Molero

JM

,

Kern

R

.

PySpark and RDKit: moving towards big data in cheminformatics

.

Mol Inform

2019

;

38

(

6

):

4

.

85.

Wang

JY

,

Cui

X

,

Yu

G

, et al.

When sparse coding meets ranking: a joint framework for learning sparse codes and ranking scores

.

Neural Comput Appl

2019

;

31

(

3

):

701

–

10

.

86.

Li

Y

,

Kuwahara

H

,

Yang

P

, et al.

PGCN: disease gene prioritization by disease and gene embedding through graph convolutional neural networks

.

bioRxiv

2019

;

00

:

532226

.