Handbook on Neural Information Processing


Deep Learning of RepresentationsIntroductionWhy Learn Representations?Why Distributed Representations?Why Deep?Why Semi-supervised or Unsupervised Learning?Deep Learning of Representations: A Review and Recent TrendsGreedy Layerwise Pre-trainingUndirected Graphical Models and Boltzmann MachinesThe Restricted Boltzmann MachineThe Zoo: Auto-Encoders, Sparse Coding, Predictive Sparse Decomposition, Denoising Auto-Encoders, Score Matching, and MoreConvolutional ArchitecturesLocal Receptive Fields and Weight SharingFeature PoolingLearning Invariant Feature SetsDealing with Factors of Variation: Invariant FeaturesInvariance via SparsityTeasing Apart Explanatory Factors via Slow Features AnalysisLearning to Pool FeaturesBeyond Learning Invariant FeaturesDisentangling Factors of VariationOn the Importance of Top-Down ConnectionsConclusionReferencesRecurrent Neural NetworksIntroductionArchitectureConnectionist Network TopologiesNon-local Recurrent Globally Feedforward (NLRGF) NetworksSpecific ArchitecturesTime Delay Neural Networks (TDNN)Williams-Zipser Recurrent NetworksPartially-Connected Recurrent NetworksState-Space Recurrent NetworksSecond-Order Recurrent NetworksNonlinear Autoregressive Model with Exogenous Inputs (NARX) Recurrent NetworksMemoryDelayed Activations as MemoryShort-Term Memory and Generic PredictorTypes of Memory KernelsModular Components with Different ParametersModular Components with Identical ParametersLearningRecurrent Back-Propagation: Learning with Fixed PointsTraditional Back-Propagation AlgorithmBack-Propagation through Time: Learning with Non-fixed PointsReal-Time Recurrent LearningLong-Term DependenciesModelingFinite State AutomataBeyond Finite State AutomataApplicationsNatural Language ProcessingIdentification and Control of Dynamical SystemsConclusionReferencesSupervised Neural Network Models for Processing GraphsGraphsNeural Models for Graph ProcessingThe Graph Neural Network ModelProcessing DAGs with Recursive Neural NetworksDPAGsDAGs-LESupervised Learning for Graph Neural NetworksLearning ObjectiveLearning Procedure for GNNsGradient Computation for Graph Neural NetworksOutput networkState transition functionLearning Procedure for Recursive Neural NetworksForward stepBackward stepSummaryReferencesTopics on Cellular Neural NetworksThe CNN ConceptThe ArchitectureMathematical DescriptionOther Tasks CNN’s Can Accomplish - The CNN Universal MachineA Particular ArchitectureThe Architecture and the EquationsThe Decoupling TechniqueParticular CasesImplementation IssuesUsing OTA’sUsing Log-Domain TechniquesLog-Domain Transistor Level SimulationsComparison between 1D OTA and Log-Domain ImplementationsA “Toy” Application: 1D “Edge” DetectionA 2D Log-Domain ArchitectureComparison between 2D OTA and Log-Domain Implementations for Applications Using 64x64 Log-Domain ArchitecturesImage SegmentationTwo-Grid Coupled CNN’sThe Architecture and the EquationsThe CellsThe InterconnectionsThe EquationsThe Decoupling TechniqueBoundary Conditions (BC’s) and Their Influence on Pattern FormationDispersion CurveTuring Pattern Formation MechanismBoundary Conditions in 2D CNN’sAn ApplicationResults Obtained with Ideal Circular FiltersResults with CNN Spatial FiltersReferencesApproximating Multivariable Functions by Feedforward Neural NetsIntroductionDictionaries and Variable-Basis ApproximationThe Universal Approximation PropertyQuadratic Rates of ApproximationGeometric Rates of ApproximationApproximation of Balls in Variational NormsBest Approximation and Non-continuity of ApproximationTractability of ApproximationA Shift in Point-of-View: Complexity and DimensionMeasuring Worst-Case Error in ApproximationGaussian RBF Network TractabilityPerceptron Network TractabilityDiscussionSummary of Main NotationsReferencesBochner Integrals and Neural NetworksIntroductionVariational Norms and CompletenessBochner IntegralsSpaces of Bochner Integrable FunctionsMain TheoremAn Example Involving the Bessel PotentialApplication: A Gamma Function InequalityTensor-Product InterpretationAn Example Involving Bounded Variation on an IntervalPointwise-Integrals vs. Bochner IntegralsEvaluation of Bochner IntegralsEssential Boundedness Is Needed for the Main TheoremConnection with Sup NormSome Concluding RemarksAppendix I: Some Banach Space BackgroundAppendix II: Some Key TheoremsReferencesSemi-supervised LearningIntroductionSemi-supervised LearningSelf-TrainingSSL with Generative ModelsSemi-supervised SVMs (S3VMs)Semi-supervised Learning with GraphsSemi-supervised Learning with Committees (SSLC)SSLC with Multiple ViewsSSLC with Single ViewFor ClassificationFor RegressionCombination with Active LearningSSL with GraphsSSL with Generative ModelsSSL with CommitteesConclusionReferencesStatistical Relational LearningIntroductionAttribute-Value LearningRelational LearningMapping Relational Data to Attribute-Value DataSummary of This SectionRelational Learning: Tasks and FormalismsInductive Logic ProgrammingLearning from GraphsMulti-relational Data MiningNeural Network Based Approaches to Relational LearningCIL P-2 Relational Neural Networks-3 Graph Neural NetworksStatistical Relational LearningStructuring Graphical ModelsApproaches in the Relational Database SettingApproaches in the Logical SettingProbabilistic LogicsExamples of FormalismsOther ApproachesGeneral Remarks and ChallengesUnderstanding Commonalities and DifferencesParameter Learning and Structure LearningScalabilityRecommended ReadingReferencesKernel Methods for Structured DataA Gentle Introduction to Kernel MethodsMathematical FoundationsKernelsSupervised Learning with KernelsKernel Machines for Structured InputSVM for Binary ClassificationSVM for RegressionSmallest Enclosing HypersphereKernel Principal Component AnalysisKernels on Structured DataBasic KernelsKernel CombinationKernels on Discrete StructuresStringsTreesGraphsKernels from Generative ModelsDynamic Alignment KernelsFisher KernelMarginalized KernelsKernels on Logical RepresentationsKernels on Ground TermsKernels on Proof TreesLearning KernelsLearning Kernel CombinationsLearning Logical KernelsSupervised Kernel Machines for Structured OutputConclusionsReferencesMultiple Classifier Systems: Theory, Applications and ToolsMCS TheoryMCS ArchitecturesConditional TopologyHierarchical (Serial) TopologyMultiple (Parallel) TopologyHybrid TopologyCombining RulesMajority VotingWeighted Majority VotingBehavior-Knowledge SpaceBayesian CombinationThe Dempster-Shafer ApproachStrategies for Constructing a Classifier EnsembleBoostingBaggingStacked GeneralizationRandom Subspace MethodError-Correcting Output Codes (ECOC)ApplicationsRemote-Sensing Data AnalysisDocument AnalysisBiometricsFigure and GroundMedical Diagnosis SupportChemistry and BiologyTime Series Prediction/AnalysisImage and Video AnalysisComputer and Network SecurityMiscellaneaToolsTool CategorizationWekaKNIMEPRToolsConclusionsReferencesSelf Organisation and Modal Learning: Algorithms and ApplicationsIntroductionSnap-Drift Neural NetworkDescriptionArchitectureAlgorithmSnap-Drift Self-Organising MapDescriptionArchitectureAlgorithmApplicationsApplications of SDNN and SDSOM to Publicly Available DataDescriptionofDataExperiments and ResultsApplication of SDNN to E-LearningConclusions and Future WorkReferencesIntroductionBayesian NetworksBayesian Network TheoryBayesian Network ModelingAn Example Application: Medical DiagnosisModelingReasoningKnowledge RepresentationDiagnostic ReasoningDiscussionBonaparte: A Bayesian Network for Disaster Victim IdentificationLikelihood Ratio of Two HypothesesDNA ProfilesA Bayesian Network for Kinship AnalysisAllele ProbabilitiesObservationsInferenceThe ApplicationSummaryA Petrophysical Decision Support SystemProbabilistic ModelingThe Prior and the Observation ModelBayesian InferenceDecision SupportThe ApplicationSummaryDiscussionReferencesRelevance Feedback in Content-Based Image Retrieval: A SurveyIntroductionContent-Based Image RetrievalLow-Level Feature ExtractionSimilarity MeasureMinkowski-Form DistanceKullback-Leibler DivergenceEarth Mover’s DistanceClassification MethodsArtificial Neural NetworksSVMEnsemble LearningBaggingBoostingCurrent DatabasesCorel Image GalleryTinyImageImageNetShort-Term Learning RFOne-ClassTwo-ClassMulti-classLong-Term Learning RFLatent Semantic Indexing-Based TechniquesCorrelation-Based ApproachesClustering-Based AlgorithmsFeature Representation-Based MethodsSimilarity Measure Modification-Based ApproachesOthersSummaryReferencesLearning Structural Representations of Text Documents in Large Document CollectionsIntroductionRepresentation of Unstructured or Semi-structured Text DocumentsGeneral Framework for Processing Graph Structured DataSelf Organizing Maps for StructuresGraph Neural NetworksClustering of the Wikipedia DatasetDiscussion of ResultsRanking of DocumentsRelated WorkConclusionsReferencesNeural Networks in BioinformaticsIntroductionAnalyzing DNA SequencesExample ApplicationConclusionPeptide Sequence AnalysisExample ApplicationConclusionDiagnostic PredictionsExample ApplicationConclusionConclusionReferences
 
Next >