key: cord-0020224-uddx1hmu
authors: Ursuleanu, Tudor Florin; Luca, Andreea Roxana; Gheorghe, Liliana; Grigorovici, Roxana; Iancu, Stefan; Hlusneac, Maria; Preda, Cristina; Grigorovici, Alexandru
title: Deep Learning Application for Analyzing of Constituents and Their Correlations in the Interpretations of Medical Images
date: 2021-07-30
journal: Diagnostics (Basel)
DOI: 10.3390/diagnostics11081373
sha: eadf330184ff70c68bc3eba96d960b1f18d33e62
doc_id: 20224
cord_uid: uddx1hmu

The need for time and attention, given by the doctor to the patient, due to the increased volume of medical data to be interpreted and filtered for diagnostic and therapeutic purposes has encouraged the development of the option to support, constructively and effectively, deep learning models. Deep learning (DL) has experienced an exponential development in recent years, with a major impact on interpretations of the medical image. This has influenced the development, diversification and increase of the quality of scientific data, the development of knowledge construction methods and the improvement of DL models used in medical applications. All research papers focus on description, highlighting, classification of one of the constituent elements of deep learning models (DL), used in the interpretation of medical images and do not provide a unified picture of the importance and impact of each constituent in the performance of DL models. The novelty in our paper consists primarily in the unitary approach, of the constituent elements of DL models, namely, data, tools used by DL architectures or specifically constructed DL architecture combinations and highlighting their “key” features, for completion of tasks in current applications in the interpretation of medical images. The use of “key” characteristics specific to each constituent of DL models and the correct determination of their correlations, may be the subject of future research, with the aim of increasing the performance of DL models in the interpretation of medical images.

The performance of deep learning architectures (DL) has a continuously improved by increasing the number and quality, respectively diversification data resources similar to medical data, developing specific methods of integrating data into DL models according to the objectives for which they were built and perfecting the construction of DL models used in medical applications.

Deep learning (DL) has experienced an exponential development of medicine, but applications in interpretations of medical imaging are in continuous development. DL has managed to achieve performance in diagnosis, classification, detection, segmentation, reconstruction of medical images [1] but also in achieving the correlation between image diagnosis and patient survival, predicting new directions of development [2] .

The novelty in our paper consists in the unitary approach, of the constituent elements of DL models, namely, data, tools used by DL architectures or specifically constructed DL 

Medical data, types of images, images from time series, audio-video data represent unstructured information have a need for labeling because they make the process of data extraction difficult because they suffer high levels of noise and variability, and classical deep learning architectures achieve low performance in interpretations of medical images.

The interpretation of medical images in diagnostic radiology through the use of deep learning architectures has applications in cancer diagnosis, with satisfactory results in the diagnostic detection of breast cancer, lung cancer, glaucoma and skin cancer.

CT, PET-CT, MRI, X-rays, Ultrasound, Diagnostic Biopsy, Mammography and Spectrography are the most used imaging and exploratory investigations in the process of image interpretation, in the objective of extracting characteristics, reducing or enlarging the size, in the group, segmentation and classification of images and by using integration methods contribute to the performance of deep learning models, see Figure 2 [3] . 

Medical data, types of images, images from time series, audio-video data represent unstructured information have a need for labeling because they make the process of data extraction difficult because they suffer high levels of noise and variability, and classical deep learning architectures achieve low performance in interpretations of medical images.

The interpretation of medical images in diagnostic radiology through the use of deep learning architectures has applications in cancer diagnosis, with satisfactory results in the diagnostic detection of breast cancer, lung cancer, glaucoma and skin cancer.

CT, PET-CT, MRI, X-rays, Ultrasound, Diagnostic Biopsy, Mammography and Spectrography are the most used imaging and exploratory investigations in the process of image interpretation, in the objective of extracting characteristics, reducing or enlarging the size, in the group, segmentation and classification of images and by using integration methods contribute to the performance of deep learning models, see Figure 2 [3] . Acronyms: MRI Magnetic Resonance Images, CT Computed Tomography, SLO Scanning Laser Ophthalmoscopy images, X-ray on weakly-supervised classification and localization of common thorax diseases.

Larger datasets, compared to the small size of many medical datasets, result in better deep learning models [4] . The large and well-annotated data sets are: ImageNet, COCO 2, (open source) medical data sets, see Figure 3 . Acronyms: MRI Magnetic Resonance Images, CT Computed Tomography, SLO Scanning Laser Ophthalmoscopy images, X-ray on weakly-supervised classification and localization of common thorax diseases.

Larger datasets, compared to the small size of many medical datasets, result in better deep learning models [4] . The large and well-annotated data sets are: ImageNet, COCO 2, (open source) medical data sets, see Figure 3 .

Acronyms: MRI Magnetic Resonance Images, CT Computed Tomography, SLO Scanning Laser Ophthalmoscopy images, The Alzheimer's disease neuroimaging initiative (ADNI), Automated cardiac diagnosis challenge (ACDC), The autism brain imaging data exchange (ABIDE), Hospital-scale chest x-ray database and benchmarks on weaklysupervised classification and localization of common thorax diseases (Chestx-ray14), The lung image database consortium (LIDC) and image database resource initiative (IDRI) (LIDC-IDRI), Algorithms for automatic detection of pulmonary nodules in computed tomography images (LUNA16), Large dataset for abnormality detection in musculoskeletal radiographs (MURA), Machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the brats challenge (BraTS2018), Locating blood vessels in retinal images (STARE), Digital database for screening mammography (DDSM), Automated mining of large-scale lesion annotations and universal lesion detection with deep learning (DeepLesion), Cardiac Magnetic Resonance Images (Cardiac MRI), International skin imaging collaboration (ISIC).

The knowledge of experienced clinical-imagists, follow certain characteristics in images, namely, contrast, color, appearance, topology, shape, edges, etc., contributes to the performance of medical image interpretation through the use of deep learning models, namely, anomaly detection by identifying the characteristics in the image; image segmentation; image reconstruction; combining two different images into one [5] . Acronyms: MRI Magnetic Resonance Images, CT Computed Tomography, SLO Scanning Laser Ophthalmoscopy images, The Alzheimer's disease neuroimaging initiative (ADNI), Automated cardiac diagnosis challenge (ACDC), The autism brain imaging data exchange (ABIDE), Hospital-scale chest x-ray database and benchmarks on weaklysupervised classification and localization of common thorax diseases (Chestx-ray14), The lung image database consortium (LIDC) and image database resource initiative (IDRI) (LIDC-IDRI), Algorithms for automatic detection of pulmonary nodules in computed tomography images (LUNA16), Large dataset for abnormality detection in musculoskeletal radiographs (MURA), Machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the brats challenge (BraTS2018), Locating blood vessels in retinal images (STARE), Digital database for screening mammography (DDSM), Automated mining of large-scale lesion annotations and universal lesion detection with deep learning (DeepLesion), Cardiac Magnetic Resonance Images (Cardiac MRI), International skin imaging collaboration (ISIC).

The knowledge of experienced clinical-imagists, follow certain characteristics in images, namely, contrast, color, appearance, topology, shape, edges, etc., contributes to the The knowledge of imaging doctors can be classified as follows:

1.

Low-level medical data 3. Diagnostic training model that represents specific data identified by doctors [9] .

The type and volume of medical data, the labels, the category of field knowledge and the methods of their integration into the DL architectures implicitly determine their performance in medical applications. 

We will further expose, the types of medical images and data used in diagnosisclassification, segmentation, detection, reconstruction, recovery and, respectively, the generation of medical reports.

Natural images-from natural datasets, ImageNet 1 (over 14 million images tagged in 20 k categories) and COCO 2 (with over 200 images annotated in 80 categories).

Medical images-from medical datasets of the same diseases in similar and different ways or from different diseases [10] .

High-level medical data (diagnostic pattern), low-level medical data (areas of images, disease characteristics).

Specific data identified by doctors (attention maps, hand-highlighted features) increase the diagnostic performance of deep learning networks (no comparative studies have been conducted).

Large natural images (ImageNet) are incorporated for the detection of characteristics in the medical images. Natural images are used in multiple applications.

Medical images are used in multiple applications. Multi-modal medical images, PET images are incorporated for the detection of lesions in CT scans.

High-level medical data (diagnostic pattern), low-level medical data (areas of images, disease characteristics).

Specific data identified by doctors (attention maps, hand-highlighted features) increase the diagnostic performance of deep learning networks (no comparative studies have been carried out).

Natural Images, ImageNet, PASCAL VOC "static data" set, Sports-1M video datasets [11] . Medical images, (CT, MRI, Angio-CT, butt eye images, annotated retinal images) used in multiple applications.

External medical data and images of other diseases, dataset 3DSeg-8 [12] . High-level and low-level medical data, e.g., anatomical aspects of the image, shape, position, typology of lesions integrated into segmentation tasks, example of the ISBI 2017 dataset used in skin injury segmentation. Many applications use additional data with satisfactory results to improve CT image segmentation tasks in order to improve applications for MRI use [13] .

Medical data from doctors, hand-made features, hand-highlighted features, are first processed from the reference images. These features are used in the BRATS2015 dataset in input-level merging image segmentation applications.

Two-stage models for injury and organ detection consist of a network of regional proposals (RPN) that involves the locations of candidate objects and a detection network that selects regional proposals are Faster R-CNN [21] and Mask R-CNN [18, 22] .

Models with a faster and simpler stage, which go over the stage of the proposal of the region and run the detection directly, taking into account the probability that the object will appear at every point in the image such as YOLO (You Only Look Once) [23] , SSD (Single Shot MultiBox Detector) [9] and RetinaNet [24] .

Combined FCN and GAN architectures, through PET images are generated first from CT scans then synthesized PET images are used in a false positive reduction layer [18, 25] .

Three categories can be exemplified: FCN-based models [26] ; U-Net-based models [27] ; GAN-based models [28] .

Types of FCN: Cascading FCN [29, 30] , parallel FCN [31] and recurrent FCN [32] also achieve medical image segmentation goals with good results.

U-Net [27] and its derivatives segment the medical image with good results. U-Net is based on the FCN structure, consisting of a series of convolutional and devolutionary layers and with short connections between equal resolution layers. U-Net and its variants such as UNet ++ [33] and recurrent U-Net [34] perform well in many medical image segmentation tasks [18, 35] .

GAN is a type of mixed architecture (supervised and unsupervised) called semisupervised architecture, an architecture composed of two neural networks, a generator and a discriminator or classifier, which compete with each other in a contradictory formation process [28] . In models, the generator is used to predict the target mask based on encoderdecoder structures (such as FCN or U-Net) [18] . The discriminator serves as a form regulator that helps the generator achieve satisfactory segmentation results [16, 33] . GAN has use in the generation of synthetic instances of different classes. 

Deep auto-encoders (AUD) are included in the type of unsupervised learning that uses unlabeled input data, there is no a priori knowledge, and the results to be obtained from the processing of input data are unknown, and can learn to organize information without providing an error calculation to evaluate the possible solution [36, 37] . The main feature of the autoencoder is represented by the input and output layers have the same size, and the output must reproduce the input, while the hidden layers are smaller in size because the input patterns are progressively encoded and decoded throughout the process, and has the ability to extract the fundamental characteristics of the input, being used to reduce the size of the data, but also to reduce noise in input data (such as images). They are often used for data reconstruction (image and signal), denoising or augmentation [37, 38] . Table 1 ) [39] . Table 1 . Medical applications of DL models according to the scope for which they were used [39] .

Classification Benefit from unlabelled data for lung tumour stratification DBN [40] Introduction of a transfer learning approach in rectal cancer prediction CNN [41] Identification of bladder tumour sub-types from histopathological images

ResNet [42] Improvement in breast tumour estimation by considering a large set of risk factors CNN [43] Estimation of the cancer grade CNN [44] Estimation of the cancer type CNN [45, 46] , ResNet [47] Limitation of overfitting GAN [48] , ResNet [49] Analysis of the particular characteristics of the heart by using echocardiograms

ResNet [50] Improvement in bone image quality U-Net [51] Analysis of the impact of gender on skeletal muscles CNN [52] Automatic estimation of brain diseases risk AlexNet [53] , CNN [54] Improvement of accuracy and efficiency in COP diseases

ResNet [55] , VGGNet + CNN [56] , DBN [57] Analysis of interstitial lung diseases CNN [58] 

Estimation of the normal levels of the pancreas CNN [59, 60] Improvement in image quality CNN [61] , CNN + LSTM [62] Improvement in accuracy in abdominal ultrasounds CNN [63] Detection Optimal localization of lung cancer sub-types CNN [64] Low-cost object detection for malaria YOLO [65] Improvement in image accuracy in neoplasia analysis ResNet [66] Segmentation Analysis of colour contrast and parameter variability issues in pancreatic tumour U-Net [67] Impact of dimension variations on DL model performance in thyroid melanomas U-Net [68] Limitation of the overfitting problem in bone cancer CNN [69] , GAN + U-Net [70] Improvement in image accuracy in lung and prostate cancer U-Net [71, 72] , GAN [73] DL model for multi-step integration and registration error reduction in atrial fibrillation analysis CNN + LSTM [74] Accuracy in the analysis of irregular pelvic hematoma images U-Net [75] Improvement in aortic disease analysis with the introduction of new accuracy measures U-Net [76] Introduction of the transfer learning approach in atrium study U-Net [49] Analysis of the impact of the image quality in osteoarthritis U-Net [77] , RCNN [78] Introduction of transfer learning and attention mechanism in the study of the knees VGGNet + U-Net [79] Improvement in image accuracy of the cartilage U-Net [80] , HNN [15] , U-Net + GAN [81] , RCNN Combination of the region-based approach with U-Net for bone diseases RCC + U-Net [82] Limitation of overfitting in White Matter analysis GAN [83] Colour quality improvement in orbital analysis U-Net [84] Segmentation of lung lobe using different types of datasets U-Net [85] Analysis of image effects in neoplasia and catheter detection U-Net [66] , RNN [86] 

Reconstruction Improvement in the Signal-to-Noise Ratio Multi-data integration CNN [87] Improvement in image quality at high levels in the study of coronary diseases CNN [88] Application of CNNs to computed tomography for chest digital images CNN [89] 

CNN (convolutional neural network) are popular in areas where the shape of an object is an important feature, such as image analysis [5, 39, 94, 95] , particularly in the study of cancers and bodily injuries in the medical sector [96, 97] and video analysis [39, 98] .

CNN contains convolutive layers, grouping layers, dropout layers, and an output layer, hierarchically positioned that each learn stun specific characteristics in the image [99] .

CNN in image analysis has low performance when high-resolution datasets are considered [100] and when localization over large patches is required, especially in medical images [101, 102] .

We will synthesize in Figure 4 Classification of DL models according to the characteristics and tasks for which they were designed, classification of DL models according to the characteristics and tasks for which they were designed and describe them later [102] . DL architectures classification [103] : Supervised DL models: CNN's performance is strongly influenced by the selection of hyper-parameters. Any small changes in hyper-parameters will affect CNN's overall performance. Therefore, careful selection of parameters is an extremely significant problem that should be taken into account during the development of the optimisation scheme.

Impressive and robust hardware resources, such as GPs, are needed for an effective CNN workout. Moreover, they are also needed to explore the effectiveness of using CNN in intelligent and embedded systems.

CNN (convolutional neural network) are popular in areas where the shape of an object is an important feature, such as image analysis [5, 39, 94, 95] , particularly in the study of cancers and bodily injuries in the medical sector [96, 97] and video analysis [39, 98] .

CNN contains convolutive layers, grouping layers, dropout layers, and an output layer, hierarchically positioned that each learn stun specific characteristics in the image [99] .

CNN in image analysis has low performance when high-resolution datasets are considered [100] and when localization over large patches is required, especially in medical images [101, 102] .

We will synthesize in Figure 4 Classification of DL models according to the characteristics and tasks for which they were designed, classification of DL models according to the characteristics and tasks for which they were designed and describe them later [102] .

Exploitation of depth and various structural adaptations is significantly improved in CNN's learning capacity. Replacing the traditional layer configuration with blocks leads to significant progress in CNN's performance, as shown in recent literature. Today, the development of new and efficient block architectures is the main trend in the new research models of CNN architectures. HRNet is just one example that shows that there are always ways to improve the architecture. Cloud-based platforms are expected to play a key role in the future development of DL computing applications [104] .

Several deep learning, computer assisted diagnosis (CAD) systems for digital breast tomosynthesis (DBT) are currently available and many new systems will be developed. However, there are still many challenges to overcome. As Wang et al. [105] have recently demonstrated, published models for the full-field digital mammography (FFDM) classification fail when applied to different datasets, even when these data sets include purchases using similar equipment. For FFDMs, deep learning-based detection models have proven to be performing with almost human precision [106] . As more studies and data become available, there is no reason to believe that this should be different for DBT. However, the trained radiologist can adapt when analyzing different data sets, indicating that highperformance deep learning models still lack the "key" characteristics that differentiate the disease from normal [107] .

Image analysis performance is enhanced by the use of the following architectures: AlexNet, VGGNet and ResNet, YOLO or U-net that we describe below:

AlexNet was proposed by Krizhevsky et al. [97] for the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2012 [39] .

AlexNet [103] consists of 8 layers, 5 layers of convolution and 3 dense, fully connected layers, overlapping overlay, abandonment, data augmentation, ReLU activations after each convolutive layer and fully connected, SGD with impulse [97] . AlexNet is used for image recognition in image analysis and is usually applied to issues involving semantic segmentation and high-resolution data classification tasks [39, 70, 73] .

VGG (Visual Geometry Group): Consists of 13 convolution layers (in VGG16) & 16 convolution layers (in VGG19), 3 dense layers, pooling and three RELU units, very small responsive fields [108] . VGG is used for object recognition, classification of medical images [109, 110] and image segmentation [18, 36] VGG loses accuracy when the depth becomes too high.

ResNet (Residual Neural Network): Contains closed units or closed recurring units and has a strong similarity to recent successful elements applied in RNNs [103] . ResNet is characterized by: residual mapping, identity function, and a two-layer residual block, one layer learns from the residue, the other layer learns from the same function and has high level of performance in image classification [111] and audio analysis tasks [39, 112] .

GoogLeNet is built from 22 deep LAYERS CNN and 4 million parameters and contains several layer filters and stacked convolution layers [113] . It was used for batch normalization, image distortions, and RMSprop [103] .

U-Net, developed by Ronneberger [101] , addresses the problem of locating images of a standard CNN by extracting data features followed by reconstruction of the original dimension through an upsampling operation. U-Net is a type of Enconder-Decoder network in which the codification output belongs to the input space. U-Net is used in single-stage segmentation and classification [114] , specifically in the location of cancerous lesions [38, 115, 116] . SegNet [39, 117] is a U-Net variant that uses maximum grouping indices in the upsampling step that reduces the complexity of U-Net space [118] .

RNNs were developed by Rumelhart et al. [119] using with efficiency the correlations existing between input data of a prediction problem, through which they process sequential data in relation to text analysis [84, 119, 120] in electronic medical records to predict diseases [121, 122] and speech recognition [123] . RnN variants are: one-way, learning from the past and predicting the future and bidirectional that uses the future to restore the past. RNN has the following variants: LSTM, GRU, Recursive NNs and two-way RNNs (BiRNN). LSTMs were introduced by Hochreiter and Schmidhuber [39, 103, 124] and consist of: the gate of oblivion that alleviates the escape and explosion gradient, the entrance gate and the exit gate, the last two track the flow of data coming in and out of the cell. They were used in speech recognition [45] , path prediction [46] and medical diagnosis [64] , in which the authors proposed an LSTM network, called DeepCare, combining different types of data to identify clinical diseases.

GURs (recurrent unit gated) created by Kyunghyun Cho et al. in 2014 [48] , solve the problem of increasing the time complexity of LSTM, when large amounts of data are used. The GRU consists of a reset gate in which it is decided how much information from the past is transmitted in the future, and an update gate that decides how much information from the past can be forgotten. GRU and LSTMs have similar applications especially in speech recognition [39, 125] .

The two-way recurring neural network and the Boltzmann BRNNs introduced by Schuster and Paliwal [44] are characterized by the fact that the hidden state is updated by using past information, as in a classic RNN, and by using information related to future moments. They were applied in handwriting and speech recognition, where they are used to detect missing parts of a sentence in a knowledge of the other words [41, 126] . BM models are a family of RNNs that are easy to implement and that reproduce many probability distributions, BMs are used in image classification. BMs combined with other models are used to locate objects [39, 40, 127] . In the classification of images, BMs are used to identify the presence of a tumor [128] . BM models are slow and ineffective when the data size increases exponentially due to the complete connection between neurons [129] . A restricted BM was proposed in which relaxing the connections between neurons of the same or one-way connection between neurons would solve the problem of the classic BM model [5] .

AEs, developed by Rumelhart et al. [119] , consisting of encoder and decoder, with the aim of reducing the size of the data through significant representations and learning data characteristics for the reconstruction of outputs. They are used in applications in medical image analysis [72, 130] , natural language processing [67] and video analysis [68] .

Additional variants of AE that can be found in the literature are variational AE (VAE). In a VAE, the encoder is represented by the probability density function of the input into the feature space and, after the encoding stage, a sampling of the new data using the PDF is added. Differently from the DAE and the SAE, a VAE is not a regularized AE, but is part of the generation class [39] .

GAN it is used to generate synthetic training data from original data using latent distribution [131] . It consisted of two networks, a generator estimates false data from input data, and a discriminator, which differentiates fake data from real data and separates it in order to increase the quality of the data generated. GAN has two problems: the problem of the collapse of the mode, and the fact that, can become very unstable [103] .

DBN: The DBN (Deep Network of Beliefs), created by Hinton [132] , consists of two networks that build each other: of beliefs represented by an acyclic graph composed of layers of stochastic binary units with weighted and respectively weighted connections, restricted Boltzmann Machines which is a stochastic. DBNs are applied in image recognition and speech recognition, in classification to detect lesions in medical diagnosis and, in video recognition to identify the presence of persons [133] , in speech recognition to understand missing words in a sentence [134] and in application on physiological signals to recognize human emotion [39, 135, 136] .

DTN contains a characteristic extraction layer, which teaches a shared feature subspace in which marginal source distributions and target samples are drawn close and a layer of discrimination that match conditional distributions by classified transduction [103, 106] .

TDSN contains two parallel hidden representations that are combined using a bilinear mapping [137] . This arrangement provides better generalization compared to the architecture of a single module. The prejudices of the generalizers with regard to the learning set shall be inferred. It works effectively and better than an eco-validation strategy when used with multiple generalizers compared to individual generalizers.

DIM maximizes mutual information between an input and output of a highly flexible convolutive encoder [103, 138] by forming another neural network that maximizes a lower limit on a divergence between the marginal product of encoder input and output. Estimates obtained by another network can be used to maximize the reciprocal information of the features in the input encoder. The memory requirement of the DIM is lower because it requires only encoder not decoder.

DL models can be combined in five different ways depending on the type of data involved in the problem to be solved. Of these, three types of HA (hybrid architectures), namely the integrated model, the built-in model and the whole model.

In the integrated model, the output of the convolution layer is transmitted directly as input to other architectures to the residual attention network, the recurrent convolutive neural network (RCNN) and the model of the recurrent residual convolutive neural network (IRRCNN) [103, 139] .

In the built-in model (the improved common hybrid CNN-BiLSTM), the size reduction model and the classification model perform together, the results of one represent the inputs for the other model. In the model (EJH-CNN-BiLTM), several basic models are combined.

In the transfer learning model (TL) is trained and uses the same type of problem. CNN models that use the TL model are VGG (e.g., VGG16 or VGG19), GoogLeNet (e.g., InceptionV3), Inception Network (Inception-v4), Repiuled Neural Network (e.g., ResNet50), AlexNet. Joint AB based DL combines max pooling, and careful sharing [103] . [140] . Because the result of a CNN is a 3D value and an RNN works with 2D-data, a remodeling layer is, associated between CNN and RNN, to convert production of CNN into an array. CNN + RNN have been successfully applied in text analysis to identify missing words [141] and image analysis to increase the speed of magnetic resonance image storage [49, 50] . CNN + RNN variants are obtained by replacing the Standard RNN component with an LSTM component [39, 48, 65] .

AE + CNN architecture combines AE as a pre-training model when using data with high noise levels, and a CNN as a feature extractor model. AE + NVs have an application in image analysis to classify noisy medical images [76] and in the reconstruction of medical images [86, 130] .

GAN + CNN combines GAN as a pre-workout model to moderate the problem of over-mounting, and a CNN, used as a feature extractor. It has applications in image analysis [39, 88, 142] .

The DL architectures applied especially in image analysis are CNN, AE and GAN. NVs preserve the spatial structure of the data, and are used as feature extractors (especially U-Net), AEs reduce the characteristics of complex images in the analysis process, and GANs are pre-training architectures that select input categories to control overfitting. U-Net + Kite-Net + Attention U-Net + HarDNet-MSEG ahitecture, the DL model imagined by Luca, A.R. & all [143] , combined model it designed takes into account the key features of the architectures involved: U-Net will be enhanced with a block context aggregation encoder and still retains the low-level image features that result from U-Net, but will generate slightly finer segmentation without adding costs due to context aggregation blocks; Kite-Net will contain a unit with attention gates and a Kite-Net decoder, in this way add a benefit of attention to the details of Kite-Net; a partial decoder like the one in the HarDNet-MSEG architecture used as the new U-Net decoder to reduce training time; U-Net Attention that suppresses irrelevant regions, key features, does not add significant computing costs, with a slightly smoother segmentation of image features. This combined DL model is not demonstrated in practice being a project [143, 144] .

We further highlight the acquisitions in the study of deep learning and its applications in the analysis of the medical image [41] . You can easily identify references to image labeling and annotation, developing new deep learning models with increased performance, and new approaches to medical image processing: [155] .

We will exemplify in Table 2 [37] applications in medicine and the performance of DL models depending on types of medical images and the therapeutic areas in which they were used. 

We will exemplify the methods of incorporation of medical knowledge and data according to the purpose of DL models in medical applications, namely, diagnosis-classification, detection, segmentation, reconstruction and recovery of medical images, generation of medical reports, see Figure 5 . 

We will exemplify the methods of incorporation of medical knowledge and data according to the purpose of DL models in medical applications, namely, diagnosis-classification, detection, segmentation, reconstruction and recovery of medical images, generation of medical reports, see Figure 5 . 

Transfer learning uses multimodal medical images and natural images. Multitask learning uses medical data from other diseases. Curriculum learning uses pattern training to incorporate medical data from doctors. Network design uses diagnostic pattern from medical data from doctors. Attention mechanism used areas doctors focus on from medical data from doctors. Decision level fusion uses features doctors focus on from medical data from doctors. Multi-task learning /network design used from medical data from doctors

Imaging doctors when interpreting medical images use patterns or procedures in diagnosing diseases. Incorporating these patterns and procedures from physicians into deep learning networks increases their performance.

Types of medical data used in deep learning models for diagnosing the disease:

hand crafted features (appearance, structures, shapes), 

Transfer learning uses multimodal medical images and natural images. Multitask learning uses medical data from other diseases. Curriculum learning uses pattern training to incorporate medical data from doctors. Network design uses diagnostic pattern from medical data from doctors. Attention mechanism used areas doctors focus on from medical data from doctors. Decision level fusion uses features doctors focus on from medical data from doctors. Multi-task learning/network design used from medical data from doctors

Imaging doctors when interpreting medical images use patterns or procedures in diagnosing diseases. Incorporating these patterns and procedures from physicians into deep learning networks increases their performance.

Types of medical data used in deep learning models for diagnosing the disease:

hand crafted features (appearance, structures, shapes),

• related diagnostic information • other types of diagnostic-related information (1) . The training model consists in the curricular learning through which tasks, images evolve from simple to complex in the training process. The curriculum involves a suite of training samples classified in ascending order of learning difficulty. The training model through curricular learning introduced into the deep learning network is developed by [162] .

(2). General models of diagnosis of doctors, namely, the patterns and procedures used by imaging doctors when interpreting medical images. Radiologists diagnose imaging in three stages in the interpretation of X-ray images of the chest: overview, local lesion regions and subsequently combine general data [163] .

(3). The use of the diagnostic pattern of radiologists for the diagnosis of thoracic disease) by extracting and combining global and local traits is carried out in [163] . Target regions or "attention maps". Imaging doctors focus on specific areas in the diagnosis of diseases, "warning maps", which indicates the target areas when interpreting images.

(4). Attention features (appearance, structure, shapes), "handcrafted characteristics", as they are made by doctors, can be described characteristics, asymmetry, edge, color, margin, shape, micro-calcification and echo pattern, acoustic attenuation, side acoustic shade, and also benign-malignant risk of pulmonary nodules is classified by six characteristics of nodules: calcification, sphericality, edge, spiculation and texture and other. (5) . Related Diagnostic Information (Merger at Decision Level, Characteristics Level Fusion, Imput-Level Fusion, Features as Labels).

Merger at decision-level. The CNN classifier model automatically extracts and combines by merger at the decision-making level of handcrafted characteristics and extracted characteristics (contrast, texture, spiculation of the image) from CNN, by merger-level decision-level results from two classifiers [164] .

Characteristic-level fusion. Feature-level fusion model combines two handcrafted features, parameter less threshold adhesion statistics and gray-level co-occurrence matrix, with the five groups of deep learning features extracted from five different deep models [18, 37] .

Input-level fusion. Input-level fusion is achieved by the fact that handmade features are used as patches that describe specific features and are used as input for CNN followed by combination in solving the problem. In some models these patches are used as input into DScGAN to increase diagnostic performance.

Using features as labels of CNN. Image classification labels and labels of handmade features are included into deep learning patterns through the multi-task learning arhitecture to increase their performance. (6) . Other Types of Diagnostic-Related Information (Additional Labels, Additional Clinical Diagnostic Reports).

These are represented by additional labels and clinical diagnostic reports. Type of additional category labels for medical images, normal, malignant or benign, condition of the lesions is incorporated into a multi-task learning structure can improve the performance of the diagnosis of major classification load [18] .

Additional clinical diagnostic reports. The clinical report is a summary of descriptions of the doctor made during the imaging examination.

We can exemplify four categories:

• paternal training, • paternal diagnosis, • target regions, • hand crafted features (appearance, structures, shapes).

CASED performs adaptive curriculum sampling to solve the problem of highly data imbalance and makes it possible for the model to distinguish nodules from immediate proximity and subsequently enlarges the hard-declassified global context, up to uniform categories in the empirical data pool. In this way, CASED is the most performant and is used in the detection of pulmonary nodules in thoracic CT [165] .

LUNA16 also based on curricular learning is used in the detection of cardiac [166] .

Radiologists use patterns to locate lesions in medical images, namely:

Combine images in different settings (brightness and contrast), 2.

Uses bilateral, transverse, adjacent images, 3.

Radiologists combine collected images in different settings (brightness and contrast) to locate lesions by visual interpretation of CT images. In the same way is built a model with multi-viewing features (FPN) brightness and contrast, combined later using an attention module that identifies the position with an increase in accuracy compared to NIH DeepLesion [167] .

Bilateral information is compared by radiologists when interpreting images.

Handmade characteristics, e.g., locations, structures, shapes are represented by "Hand-Crafted Characteristics" for Identifying target objects, nodules or lesions in medical images.

The description of the target regions, e.g., information, radiological reports, additional labels is extracted from the radiological information and coupled with the curricular learning and the results are used by the network in the ascending order of the difficulties.

Transfer learning uses data from natural images for performance in the segmentation of the medical image. The transfer of the acquired data of a CNN arhitectures originally trained for segmenting WM hyper-intensity on old low-resolution data to new data from the same scanner, but with good image resolution is studied by [168] .

Multimodal learning in which MRI, CT, are used simultaneously by pre-trained architecture deep learning.

Training pattern. For the segmentation of lesions into medical images deep learning models used curriculum learning.

Diagnostic pattern. Specific patterns used by doctors and embedded in the network. Characteristics of the image (shape, location, topology).

Radiologists rely on certain characteristics of the image, shape, position, typological lesions, when interpreting medical images.

There are three types of incorporation of features injuries from medical imaging in deep learning architectures:

1.

incorporating the characteristics of the lesions in the post-processing stage, 2.

incorporating the characteristics of the lesions as elements of regularization in the loss function, 3.

learning the characteristics of the lesion through generational models.

For input fusion, handmade characteristics are transformed into input patches, subsequently, the original image patches and the tagged patches are inserted into a deep segmentation network [18] .

The objective is to reconstruct a diagnostic image from a series of measurements.

Deep learning architecture use knowledge from natural images (pre-trained VGG model based on ImageNet) or medical data.

The deep learning models for image subtitles have been successfully applied for the automatic generation of medical reports [169, 170] . Some templates in radiologist reports are used during the sentence generation process [80, 167] .

Model-agnostic method attempts to learn the short description of the text to explain this decision-making process [171] and transfer the visual characteristics of medical images to a graph of anomalies [18] .

Module to incorporate the pre-built graph on multiple findings of the disease to help generate reports by using the IU-RR dataset [18, 172] .

Imaging doctors combine data from different stages and experiences as opposed to DL models that incorporate the same types and modes of handcrafted features. Data quality and volume, annotations and labels, identification and automatic extraction of specific medical terms can help deep learning models perform in the tasks of image analysis [18] Simultaneous incorporation of different medical knowledge types features, labels, into DL architectures increases their performance (see Table 3 ) [102] . Table 3 . Applications in medicine, methods of incorporation of types of data, datasets and their correlation. Characteristics level fusion • diagnosis of pulmonary nodules [183] • classification of breast cancer in histological images [146] • diagnosis of glaucoma disease [184] • diagnosis-classification of skin lesions [185] • diagnosis-classification of lung nodules [18, 186] • diagnosis of brain tumors [187] Incorporation patch characteristics

• diagnosis-classification of lung nodules [56] • diagnosis-classification of thyroid disease [188] DSc-GAN

• diagnosis of thyroid nodules [18, 189] • diagnosis of breast cancer in multi-sequence MRI [190] As labels of CNNs

• diagnosis-classification of lung nodules [191] • differentiation (benign-malignant) of lung nodules in CT scans [8] • diagnosis of glioma [192] • predicts the sensitive, specific, balanced result merged for images of glaucoma in [193] Additional clinical diagnosis reports (abstract descriptions)

• Tie-Net classifies common thoracic disease into chest X-rays [194] • facilitates the interpretation of pathological images of bladder cancer [133] Natural Datasets Images Natural images ImageNet 1 and COCO 2

Transfer learning -fixed feature extracts -initialization

• diagnosis-detection of lymph node [18, 195] • diagnosis-detection of polyp and pulmonary embolism [196] • diagnosis-detection of breast tumors [65] • diagnosis-detection of colorectal polyps [197, 198] Medical Datasets Images

PET CT, Mammography, X-ray,

Learning with more tasks (multi-task)

• PET image applications are incorporated for the diagnosis-detection of lesions in CT images of the liver [25] • diagnosis-detection of liver tumors [199] • diagnosis-detection of breast masses [200] • diagnosis-detection of pulmonary nodules in CT images [18, 201] • diagnosis-detection of retinal diseases in the bottom of the retina [202] • diagnosis-detection colitis in CT images [203] • intervertebral disc detection in X-ray images [18, 204] • diagnosis-detection architectural distortions in mammograms [18, 205] • diagnosis-detection breast tumors in mammograms [18, 206] • diagnosis-detection of pulmonary lung nodules in CT [207] • diagnosis-detection of various lesions (e.g., liver damage, lung lesion, bone lesion, abdominal lesion) in CT images [18, 208] • diagnosis-detection of malignant lesions of the liver and reduce by 28% false positive average per case [18, 25] • diagnosis-detection of breast masses from digital tomosynthesis [200] [207] Area of interest, specific data by doctors, "attention maps" Models explicitly incorporates "attention maps"

• diagnosis-detection of thoracic disease [173] • diagnosis-detection of mammograms [18, 214, 215] Hand-crafted features Attention features

• diagnosis-detection mammographic lesions [125] • diagnosis detection of pulmonary nodules [216] • diagnosis-detection of thyroid nodules, size and shape of the attribute of nodules [18, 189] • diagnosis-detection of lymph nodes in oncological imaging [217] • diagnosis-detection of lung lesions [18, 218] Natural Datasets Images [18, 196] • diagnosis prenatal segmentation of the ultrasound image [222] • diagnosis-segmentation of the gland in histopathological images [18, 117] • diagnosis-segmentation of the proximal femur in 3D MRI [18, 223] • diagnosis-segmentation of multiple sclerosis [224] • some templates from the reports of radiologists are used during the process of generating sentences [80, 167] • model-agnostic method to learn the short description of the text to explain this decision-making process [18, 171] • transfers the visual characteristics of medical images to a graph of anomalies, then retrieves text templates based on anomalies and their attributes for thoracic X-ray images [18, 167] • incorporate the pre-built graph (modeled with a CNN graph) on multiple findings of the disease to help generate reports by using the IU-RR dataset [18, 172] 

In this paper, as a research novelty, we approached in a unitary way, the constituent elements of DL models:

• Updated presentation of data types, DL models used in medical image analysis; • Correlation and contribution to the performance of DL models of the constituent elements: data type, incorporation methods and DL architectures; • Features and "key" tasks of DL models for the successful completion of tasks in applications in the interpretation of medical images.

The quality of the data and their volume, annotations and labels, the identification and automatic extraction of specific terms, from reports, guides, books in the medical field, can increase the diagnostic accuracy of doctors and help deep learning models perform in the tasks of image analysis. Doctors use a descriptive language, namely, contour, contrast, appearance, localization, topology, etc., or compare bilateral images. The incorporation of these representations, attributes from images, in DL architectures increase their performance.

Imaging doctors combine data from different stages and experiences as opposed to DL models that incorporate the same types and modes of handcrafted features. Data quality and volume, annotations and labels, identification and automatic extraction of specific medical terms can help deep learning models perform in the tasks of image analysis [18] . Incorporating these features, labels, into DL architectures increases their performance [102] .

The diagnostic model, the training model simultaneously incorporates high-level and low-level knowledge (handcrafted features, anatomical priorities). High-level medical data is incorporated as input images, and low-level medical data is learned using specific network structures [18, 237] and along with direct networking, information from low-level medical data can also be used to design training commands when combined with the easyto-use training model [18, 173] . Simultaneous incorporation of different medical knowledge types can increase performance of deep learning patterns in medical applications.

DL can be a support in solving complex problems, with uncertainties of options in investigations and therapy and could help medically and by filtering, providing data from literature. This aspect leads to a personalized medicine of the patient's disease with diagnostic and therapeutic options based on scientific evidence. Another aspect is represented by the time encoded by the doctor in patient care, time gained by the constructive and effective support of DL in medical decision-making and synthesis activities.

The use of "key" characteristics specific to each constituent of DL models and the correct determination of their correlations, may be the subject of future research, with the aim of increasing the performance of DL models in the interpretation of medical images.

Problems in medical image analysis can be categorized as follows:

• identification and automatic extraction and standardization of specific medical terms, • representation of medical knowledge, • incorporation of medical knowledge. • Problems in medical image analysis are related to: • medical images provided as data for deep-street models require: quality, volume, specificity, labelling. • providing data from doctors, descriptive data, labels are ambiguous for the same medical and non-standard references. • laborious time in data processing are problems to solve in the future. • lack of clinical trials demonstrating the benefits of using DL medical applications in reducing morbidity and mortality and improving patient quality of life [39, 102, 264, 265] .

Full analysis of the mechanism of realization of medical applications, from data, databases, methods of incorporation of knowledge into DL models and improvement of DL models to their performance transposed into medical applications lead to the following problems to be solved: identification and automatic extraction of specific terms from medical documents, representation of medical knowledge, incorporation of medical knowledge.

Specific medical terms and descriptive attributes corresponding to diseases in medical images, by incorporating in DL models improve their performance and therefore involve solving problems related to the identification and automatic extraction of specific terms from medical documents, the presentation of medical knowledge, the incorporation of medical knowledge.

Problems in medical image analysis are related to quality, volume, specificity and data labelling in medical images used for a particular action by DL. Also, the provision of data from doctors, handmade, ambiguous expressions for the same medical references, uncertain limits of segments in images, low resolution of images, annotations, labels and laborious time in data processing are problems to solve in the future.

Another problem is the lack of clinical trials demonstrating the benefits of using DL's medical applications in reducing morbidity and mortality and improving the quality of life of patients.

These consist of domain adaptation, knowledge graph, generational models, and network architecture search techniques.

The adaptation of the domain consisted of transferring information from a source domain to a target domain, such as adversarial learning [266] , makes it narrow the domain change between source and target domain in input space [267] , feature space [268, 269] and output space [270, 271] . It can be used to transfer knowledge of one set of medical data to another [212] even when they have different modes of imaging or belong to different diseases [18, 168, 272] . UDA (unsupervised adaptation of the field) that uses medical labels has demonstrated performance in disease diagnosis and organ segmentation [18, 81, 188, 255, 273] .

The knowledge graph, which has the specificity of incorporating multimodal medical data achieves performance in the analysis of the medical image and the creation of medical reports [167] . The graphs of medical data describing, the relationship between different types of knowledge, the relationship between different diseases, the relationship between medical datasets and a type of medical data, help the models of deep learning to perform [274] .

Generative models, GAN and AE are used for segmentation tasks in particular. GAN uses MRI datasets for CT image segmentation [18, 225, 272] . GAN is a type of unsupervised deep learning network used in medical image analysis. AE are used in extracting characteristics, shape priorities in objects such as organs or lesions, completely unsupervised and are easily incorporated into the process of network training [18, 85, 237] .

In traditional machine learning, the common learning process is separated and is carried out only on certain models, data sets and tasks. Therefore, knowledge is not retained or transferred to each other models. Instead, in deep learning, transfer learning can use knowledge such as the weights and characteristics of a pre-trained model to prepare a new model, as well as to address problems in the task that has a smaller amount of data. Transfer learning with deep learning patterns is faster, has improved accuracy and/or needs less training data [275] .

A new approach to transfer learning, to address the problem of lack of data training in medical imaging tasks is represented by the technique of learning by transfer called dual transfer learning. Using the characteristics learned to improve the performance of other tasks by, such as classification in skin lesions, such us, benign and malignant or in the case of breast lesions to classify histological mammary images into four classes: invasive carcinoma, in situ carcinoma, benign tumor and normal tissue [276] .

Using cloud computing provides a solution for managing the enormous amount of data. It also helps to increase efficiency and reduce costs. In addition, it offers the flexibility to train DL architectures [104] .

With the recent development of computing tools, including a chip for neural networks and a mobile GPU, we will see more deep learning applications on mobile devices. It will be easier for users to use DL [104] .

Network Architecture Search Technique (NAS) can automatically identify a certain network architecture in computer vision tasks [277] and promises that use and performance in the medical field [18, 278] .

With audacity, hope and confidence in the realization of our scientific desires we, authors, we launch an appeal to the international scientific forum with the aim that the following ideas will be put into practice at the initiative of some standard researchers in the field, "voices heard and heard" and who have the power to flesh them out:

• the establishment of a federation institution integrating scientific data and products specific to the field; • value categorization of industry-specific achievements; • launching challenges to be developed and completed; • facilitating the free circulation of discoveries, methods, formulas of scientific products within this federation institution; • establishing the board of the federation institution through the input and integration of "consequential brains" in the field; • the creation of a Hub of Ideas under coordination within the federation board with assignment of themes for development on specific teams; • joint effort for an idea launched within the federation institution; • an inventory of functional applications and methods, performing in the specific field; • the creation of a financing system to support and implement ideas specific to the field; • integration of researchers with notable ideas and performance limited funding or access to knowledge by belonging to geographical areas or institutions under represented internationally in the specific field.

Funding: Scientific research funded by the University of Medicine and Pharmacy "Gr. T. Popa" of Iasi, based on contract number 4714.

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Deep Learning in Medical Image Registration: A Survey

Challenges related to artificial intelligence research in medical imaging and the importance of image analysis competitions

Deep Learning Method to Detect Plaques in IVOCT Images

The Unreasonable Effectiveness of Data

An overview of deep learning in medical imaging focusing on MRI

Expert knowledge-infused deep learning for automatic lung nodule detection

Combining deep learning and hand-crafted features for skin lesion classification

Risk Stratification of Lung Nodules Using 3D CNN-Based Multi-task Learning. arXiv 2017

Single Shot MultiBox Detector

Multi-task deep convolutional neural network for cancer diagnosis

Learning Spatiotemporal Features with 3D Convolutional Networks

Transfer Learning for 3D Medical Image Analysis. arXiv 2019

Multi-modal Learning from Unpaired Images: Application to Multi-organ Segmentation in CT and MRI

Convolutional Recurrent Neural Networks for Dynamic MR Image Reconstruction

A Deep Cascade of Convolutional Neural Networks for Dynamic MR Image Reconstruction

Automatic Liver Segmentation Using an Adversarial Image-to-Image Network

Limited-Angle Diffuse Optical Tomography Image Reconstruction Using Deep Learning

A survey on incorporating domain knowledge into deep learning for medical image analysis

Synergistic Reconstruction and Synthesis via Generative Adversarial Networks for Accelerated Multi-Contrast MRI. arXiv 2018

Densely Connected Convolutional Networks

Towards Real-Time Object Detection with Region Proposal Networks. arXiv 2019

Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV)

You Only Look Once: Unified, Real-Time Object Detection

Diagnostics 2021

Focal Loss for Dense Object Detection

Cross-modality synthesis from CT to PET using FCN and GAN networks for improved automated lesion detection

Fully convolutional networks for semantic segmentation

Convolutional Networks for Biomedical Image Segmentation

Generative adversarial nets

Towards Image-Guided Pancreas and Biliary Endoscopy: Automatic Multi-organ Segmentation on Abdominal CT with Dense Dilated Networks

Automatic Liver and Lesion Segmentation in CT Using Cascaded Fully Convolutional Neural Networks and 3D Conditional Random Fields

Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation

Fine-Grained Recurrent Neural Networks for Automatic Prostate Segmentation in Ultrasound Images

UNet++: A Nested U-Net Architecture for Medical Image Segmentation

Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation. arXiv 2018

Deep Learning with Lung Segmentation and Bone Shadow Exclusion Techniques for Chest X-Ray Analysis of Lung Cancer; Advances in Computer Science for Engineering and Education

Comparison of Supervised and Unsupervised Learning Algorithms for Pattern Classification

A survey of deep learning models in medical therapeutic areas

Adaptive Augmentation of Medical Data Using Independently Conditional Variational Auto-Encoders

A survey on deep learning in medicine: Why, how and when?

Large scale deep learning for computer aided detection of mammographic lesions

Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis

Leveraging uncertainty information from deep neural networks for disease detection

Attention to Lesion: Lesion-Aware Convolutional Neural Network for Retinal Optical Coherence Tomography Image Classification

Applying Data-driven Imaging Biomarker in Mammography for Breast Cancer Screening: Preliminary Study

Marginal Space Deep Learning: Efficient Architecture for Detection in Volumetric Image Data

Diagnostics 2021

Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network

Fast Convolutional Neural Network Training Using Selective Data Sampling: Application to Hemorrhage Detection in Color Fundus Images

On the properties of neural machine translation: Encoder-decoder approaches

Automated mammographic breast density estimation using a fully convolutional network

Automated image quality evaluation of T2-weighted liver MRI utilizing deep learning architecture

Deep image mining for diabetic retinopathy screening

Automated Quality Assessment of Colour Fundus Images for Diabetic Retinopathy Screening in Telemedicine

Super-resolution musculoskeletal MRI using deep learning

Fusing texture, shape and deep model-learned information at decision level for automated classification of lung nodules on chest

Automated image quality evaluation of structural brain MRI using an ensemble of deep learning networks

Knowledge-based Collaborative Deep Learning for Benign-Malignant Lung Nodule Classification on Chest

Gland Segmentation in Histopathology Images Using Deep Networks and Handcrafted Features

Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans

Transfer learning from RF to B-mode temporal enhanced ultrasound features for prostate cancer detection

Deep Doubly Supervised Transfer Network for Diagnosis of Breast Cancer with Imbalanced Ultrasound Imaging Modalities

Training Medical Image Analysis Systems like Radiologists

An Adaptive Curriculum Learning Framework for Unbiased Glaucoma Diagnosis

Incorporating the Knowledge of Dermatologists to Convolutional Neural Networks for Skin Lesion Diagnosis

Liver Fibrosis: Deep Convolutional Neural Network for Staging by Using Gadoxetic Acid-enhanced Hepatobiliary Phase MR Images

Automated Breast Ultrasound Lesions Detection Using Convolutional Neural Networks

Deep learning with non-medical training used for chest pathology identification

Deep learning predictions of survival based on MRI in amyotrophic lateral sclerosis

Deep learning is effective for the classification of OCT images of normal versus Age-related Macular Degeneration

Diagnostics 2021

Collaborative Learning of Cross-channel Clinical Attention for Radiotherapy-Related Esophageal Fistula Prediction from, CT

Supervised Classification with Graph Convolutional Networks. arXiv 2016

Detecting Cardiovascular Disease from Mammograms with Deep Learning

Detecting and Locating Gastrointestinal Anomalies Using Deep Learning and Iterative Cluster Unification

Finding Structure in Time

Introducing Margin Information into CNN for Breast Cancer Diagnosis in Ultrasound Images

Short-Term Lesion Change Detection for Melanoma Screening with Novel Siamese Neural Network

Articulated Multi-Instrument 2-D Pose Estimation Using Fully Convolutional Networks

The Segmentation of the Left Ventricle of the Heart from Ultrasound Data Using Deep Learning Architectures and Derivative-Based Search Methods

A computer-aided diagnosis system for differentiation and delineation of malignant regions on whole-slide prostate histopathology image using spatial statistics and multidimensional DenseNet

A preliminary examination of the diagnostic value of deep learning in hip osteoarthritis

Computer-aided assessment of breast density: Comparison of supervised deep learning and feature-based statistical learning

Hybrid adversarial-discriminative network for leukocyte classification in leukemia

Digital mammographic tumor classification using transfer learning from deep convolutional neural networks

Pattern Classification for Gastrointestinal Stromal Tumors by Integration of Radiomics and Deep Convolutional Features

Refining diagnosis of Parkinson's disease with deep learning-based interpretation of dopamine transporter imaging

Shape Constrained Network for Eye Segmentation in the Wild

Convolutional Neural Network Architectures for the Automated Diagnosis of Celiac Disease

Improving Arterial Spin Labeling by Using Deep Learning

Accurate Segmentation of Cervical Cytoplasm and Nuclei Based on Multiscale Convolutional Network and Graph Partitioning

A hybrid learning approach for semantic labeling of cardiac CT slices and recognition of body position

A collaborative computer aided diagnosis (C-CAD) system with eye-tracking, sparse attentional model, and deep learning

Deep Learning for Prediction of Obstructive Disease From Fast Myocardial Perfusion SPECT: A Multicenter Study

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Embedding Human Knowledge into Deep Neural Network via Attention Map. arXiv 2019

Artificial intelligence in healthcare: Past, present and future

Deep Learning and Medical Diagnosis: A Review of Literature

Neurocomputing: Picking the human brain

ImageNet Classification with Deep Convolutional Neural Networks

Extracting Structured Data from Web Pages

Fusion in Breast Cancer Histology Classification

Time Series Analysis for Psychological Research

Backpropagation Applied to Handwritten Zip Code Recognition

Unified Analysis Specific to the Medical Field in the Interpretation of Medical Images through the Use of Deep Learning. E-Health Telecommun

A comprehensive survey of deep learning in the field of medical imaging and medical natural language processing: Challenges and research directions

Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions

Inconsistent Performance of Deep Learning Models on Mammogram Classification

International evaluation of an AI system for breast cancer screening

Applying deep learning in digital breast tomosynthesis for automatic breast cancer detection: A review

Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv

Deep learning for EEG-based Motor Imagery classification: Accuracy-cost trade-off

Long short-term memory

Deep Medical Image Reconstruction with Autoencoders using Deep Boltzmann Machine Training. EAI Endorsed Trans. Pervasive Health Technol. 2020, 6, e2

Medical Image Denoising Using Convolutional Denoising Autoencoders

Places: An Image Database for Deep Scene Understanding

Classification before Segmentation: Improved U-Net Prostate Segmentation

An Auto-Encoder Strategy for Adaptive Image Segmentation

Unsupervised pathology detection in medical images using conditional variational autoencoders

Deep Features Learning for Medical Image Analysis with Convolutional Autoencoder Neural Network

Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images

Learning representations by back-propagating errors

Retinal Lesion Detection with Deep Learning Using Image Patches

Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images

Prediction of Tissue Outcome and Assessment of Treatment Effect in Acute Ischemic Stroke Using Deep Learning

Prediction of Bispectral Index during Target-controlled Infusion of Propofol and Remifentanil: A Deep Learning Approach

Multi-Site Diagnostic Classification of Schizophrenia Using Discriminant Deep Learning with

Discriminating solitary cysts from soft tissue lesions in mammography using a pretrained deep convolutional neural network

Identification of autism spectrum disorder using deep learning and the ABIDE dataset

Disc-Aware Ensemble Network for Glaucoma Screening From Fundus Image

Correction: Acral melanoma detection using a convolutional neural network for dermoscopy images

Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network

Based on DICOM RT Structure and Multiple Loss Function Deep Learning Algorithm in Organ Segmentation of Head and Neck Image

Combining deep learning and level set for the automated segmentation of the left ventricle of the heart from cardiac cine magnetic resonance

Distilling the knowledge in a neural network

Classification of breast cancer histology images using Convolutional Neural Networks

Deep Transfer Network: Unsupervised Domain Adaptation. arXiv 2015

Tensor deep stacking networks

Learning deep representations by mutual information estimation and maximization

Breast Cancer Classification from Histopathological Images with Inception Recurrent Residual Convolutional Neural Network

Automatic Recognition of fMRI-Derived Functional Networks Using 3-D Convolutional Neural Networks

Automatic Knee Osteoarthritis Diagnosis from Plain Radiographs: A Deep Learning-Based Approach

A Sparse-View CT Reconstruction Method Based on Combination of DenseNet and Deconvolution

Designing a High-Performance Deep Learning Theoretical Model for Biomedical Image Segmentation by Using Key Elements of the Latest U-Net-Based Architectures

A Simple Encoder-Decoder Polyp Segmentation Neural Network that Achieves Over 0.9 Mean Dice and 86 FPS

Convolutional Neural Network (CNN) for gland images classification

Improve the performance of transfer learning without fine-tuning using dissimilaritybased multi-view learning for breast cancer histology images

Advanced Endoscopic Navigation: Surgical Big Data, Methodology, and Applications

Opportunities and challenges in developing deep learning models using electronic health records data: A systematic review

Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis

Convolution Neural Network: A Shallow Dive in to Deep Neural Net Technology

Harnessing the Power of Machine Learning in Dementia Informatics Research: Issues, Opportunities, and Challenges

Strategies to Improve Performance of Convolutional Neural Network on Histopathological Images Classification

Computer-Aided Histopathological Image Analysis Techniques for Automated Nuclear Atypia Scoring of Breast Cancer: A Review

Malaria detection using deep residual networks with mobile microscopy

Distributed machine learning cloud teleophthalmology IoT for predicting AMD disease progression

Effects of Hypertension, Diabetes, and Smoking on Age and Sex Prediction from Retinal Fundus Images

Automatic AMD identification in OCT volumetric data

Classification of Medical Images and Illustrations in the Biomedical Literature Using Synergic Deep Learning. arXiv 2017

A Deep Convolutional Neural Network for Lung Cancer Diagnostic. arXiv 2018

Assessment of the Robustness of Convolutional Neural Networks in Labeling Noise by Using Chest X-Ray Images from Multiple Centers

Curriculum learning

Diagnose like a Radiologist: Attention Guided Convolutional Neural Network for

Comparison and Fusion of Deep Learning and Radiomics Features of Ground-Glass Nodules to Predict the Invasiveness Risk of Stage-I Lung Adenocarcinomas in CT Scan

CASED: Curriculum Adaptive Sampling for Extreme Data Imbalance

Curriculum Deep Reinforcement Learning with Different Exploration Strategies: A Feasibility Study on Cardiac Landmark Detection

Knowledge-Driven Encode, Retrieve, Paraphrase for Medical Image Report Generation

Transfer Learning for Domain Adaptation in MRI: Application in Brain Lesion Segmentation

On the Automatic Generation of

Producing radiologist-quality reports for interpretable artificial intelligence. arXiv 2018

Collaborative Unsupervised Domain Adaptation for Medical Image Diagnosis

Attention-Guided Curriculum Learning for Weakly Supervised Classification and Localization of Thoracic Diseases on Chest Radiographs

Medical-based Deep Curriculum Learning for Improved Fracture Classification

Curriculum learning for annotation-efficient medical image analysis: Scheduling data with prior knowledge and uncertainty

Learn Like a Pathologist: Curriculum Learning by Annotator Agreement for Histopathology Image Classification

Learning to Recognize Thoracic Disease in Chest X-Rays With Knowledge-Guided Deep Zoom Neural Networks

Dual-Ray Net: Automatic Diagnosis of Thoracic Diseases Using Frontal and Lateral Chest X-rays

Mammographic Multi-view Mass Identification Networks

Semi-Supervised Medical Image Classification With Relation-Driven Self-Ensembling Model

Breast tumor classification using different features of quantitative ultrasound parametric images

A deep feature fusion methodology for breast cancer diagnosis demonstrated on three imaging modality datasets

Lung nodule classification by jointly using visual descriptors and deep features. In Medical Computer Vision and Bayesian and Graphical Models for Biomedical Imaging

Glaucoma diagnosis based on both hidden features and domain knowledge through deep learning models. Knowledge-Based Syst

Deep Learning and Handcrafted Method Fusion: Higher Diagnostic Accuracy for Melanoma Dermoscopy Images

Characterization of Lung Nodule Malignancy Using Hybrid Shape and Appearance Features

Brain tumor detection using fusion of hand crafted and deep learning features

Unsupervised domain adaptation via disentangled representations: Application to cross-modality liver segmentation

Automated detection and classification of thyroid nodules in ultrasound images using clinical-knowledge-guided convolutional neural networks

A knowledge-driven feature learning and integration method for breast cancer diagnosis on multi-sequence MRI

Automatic Scoring of Multiple Semantic Attributes With Multi-Task Feature Leverage: A Study on Pulmonary Nodules in CT Images

Center-focusing multi-task CNN with injected features for classification of glioma nuclear images

Difficulty-Aware Glaucoma Classification with Multi-rater Consensus Modeling

Deep active self-paced learning for accurate pulmonary nodule segmentation

A new 2.5 D representation for lymph node detection using random sets of deep convolutional neural network observations

Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?

Deep transfer learning of virtual endoluminal views for the detection of polyps in CT colonography

Automatic Detection and Classification of Colorectal Polyps by Transferring Low-Level CNN Features From Nonmedical Domain

Synthesizing liver contrast-enhanced MRI to improve tumor detection

Hierarchical Convolutional Neural Networks for Segmentation of Breast Tumors in MRI With Application to Radiogenomics

Pulmonary Nodule Detection in CT Images: False Positive Reduction Using Multi-View Convolutional Networks

Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs

Detection and diagnosis of colitis on computed tomography using deep convolutional neural networks

Intervertebral disc detection in X-ray images using faster R-CNN

Domain specific convolutional neural nets for detection of architectural distortion in mammograms

Automated breast cancer diagnosis using deep learning and region of interest detection (BC-Droid)

Detection and attention: Diagnosing pulmonary lung cancer from CT by imitating physicians

One stage lesion detection based on 3D context convolutional neural networks

A deep learning approach to characterize 2019 coronavirus disease (COVID-19) pneumonia in chest CT images

Thrombus detection in ct brain scans using a convolutional neural network

Context-aware convolutional neural networks for stroke sign detection in non-contrast CT scans

A New Three-stage Curriculum Learning Approach for Deep Network Based Liver Tumor Segmentation

Digital breast tomosynthesis versus digital mammography: Integration of image modalities enhances deep learning-based breast mass classification

Classification and Detection in Mammograms with Weak Supervision via Dual Branch Deep Neural Net

Weakly-Supervised Self-Training for Breast Cancer Localization

Automatic detection of lung nodules: False positive reduction using convolution neural networks and handcrafted features

Lymph Node Gross Tumor Volume Detection in Oncology Imaging via Relationship Learning Using Graph Neural Network

Evaluating several ways to combine handcrafted features-based system with a deep learning system using the LUNA16 Challenge framework

Brain tumor segmentation with Deep Neural Networks

Automatic liver and tumor segmentation of CT and MRI volumes using cascaded fully convolutional neural networks

Deep convolutional networks for pancreas segmentation in CT imaging

Cascaded fully convolutional networks for automatic prenatal ultrasound image segmentation

3D U-net with multi-level deep supervision: Fully automatic segmentation of proximal femur in 3D MR images

One-shot domain adaptation in multiple sclerosis lesion segmentation using convolutional neural networks

Semantic-aware generative adversarial nets for unsupervised domain adaptation in chest x-ray segmentation

Knowledge distillation from multi-modal to mono-modal segmentation networks

Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

Annotation-free cardiac vessel segmentation via knowledge transfer from retinal images

Generative adversarial networks to segment skin lesions

Generative adversarial learning for reducing manual annotation in semantic segmentation on large scale miscroscopy images: Automated vessel segmentation in retinal fundus image as test case

Unsupervised anomaly detection with generative adversarial networks to guide marker discovery

An adaptive sampling scheme to efficiently train fully convolutional networks for semantic segmentation

Ayed, I.B. Curriculum semi-supervised segmentation

Semi-supervised self-taught deep learning for finger bones segmentation

Weakly supervised vessel segmentation in X-ray angiograms by self-paced learning from noisy labels with suggestive annotation

Joint learning for pulmonary nodule segmentation, attributes and malignancy prediction

Learning Shape Priors for Robust Cardiac MR Segmentation from Multi-view Images

End-to-end boundary aware networks for medical image segmentation

DeepTarget: Gross tumor and clinical target volume segmentation in esophageal cancer radiotherapy

Automated sub-cortical brain structure segmentation combining spatial and deep convolutional features

Cascading handcrafted features and Convolutional Neural Network for IoT-enabled brain tumor segmentation

Combining Deep Learning with Handcrafted Features for Cell Nuclei Segmentation

Medical knowledge constrained semantic breast ultrasound image segmentation

Cardiac Segmentation With Strong Anatomical Guarantees

Cardiac MRI segmentation with strong anatomical guarantees

Cardiac segmentation from LGE MRI using deep neural network incorporating shape and spatial priors

Star shape prior in fully convolutional networks for skin lesion segmentation

Learning and incorporating shape models for semantic segmentation

Semi-supervised segmentation of liver using adversarial learning with deep atlas prior

Anatomically Constrained Neural Networks (ACNNs): Application to Cardiac Image Enhancement and Segmentation

Anatomical priors in convolutional networks for unsupervised biomedical segmentation

Dpa-densebiasnet: Semi-supervised 3d fine renal artery segmentation with dense biased network and deep priori anatomy

Shape Mask Generator: Learning to Refine Shape Priors for Segmenting Overlapping Cervical Cytoplasms

Combining shape priors with conditional adversarial networks for improved scapula segmentation in MR images

Liver Segmentation in CT with MRI Data: Zero-Shot Domain Adaptation by Contour Extraction and Shape Priors

Agan: An anatomy corrector conditional generative adversarial network

FocusNetv2: Imbalanced large and small organ segmentation with adversarial shape constraint for head and neck CT images

Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction

Content-Based Brain Tumor Retrieval for MR Images Using Transfer Learning

A sequential search-space shrinking using CNN transfer learning and a Radon projection pool for medical image retrieval

A comparative study for chest radiograph image retrieval using binary texture and deep learning classification

Visualizing and enhancing a deep learning framework using patients age and gender for chest x-ray image retrieval

SiNC: Saliency-injected neural codes for representation and efficient retrieval of medical radiographs

The Use of Artificial Intelligence on Segmental Volumes, Constructed from MRI and CT Images, in the Diagnosis and Staging of Cervical Cancers and Thyroid Cancers: A Study Protocol for a Randomized Controlled Trial

The Use of Artificial Intelligence on Colposcopy Images, in the Diagnosis and Staging of Cervical Precancers: A Study Protocol for a Randomized Controlled Trial

Opportunities and obstacles for deep learning in biology and medicine

Unsupervised domain adaptation with residual transfer networks

Adversarial discriminative domain adaptation

Taking a closer look at domain shift: Category-level adversaries for semantics consistent domain adaptation

Learning to adapt structured output space for semantic segmentation

Adversarial Domain Adaptation from CT to MRI for Lung Cancer Segmentation

Integrate domain knowledge in training CNN for ultrasonography breast cancer diagnosis

Knowledge graph and text jointly embedding

Towards a Better Understanding of Transfer Learning for Medical Imaging: A Case Study

Novel Transfer Learning Approach for Medical Imaging with Limited Labeled Data

A survey on neural architecture search. arXiv 2019

Organ at risk segmentation for head and neck cancer using stratified learning and neural architecture search