articles+ search results

205 articles+ results

View results as:
Number of results to display per page

1. Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

2. Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine

3. DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training

4. Fully Automated Task Management for Generation, Execution, and Evaluation: A Framework for Fetch-and-Carry Tasks with Natural Language Instructions in Continuous Space

5. JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models

6. Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions

7. Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks

8. Prototypical Contrastive Transfer Learning for Multimodal Language Understanding

9. Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query

14. Visual Explanation of Deep Q-Network for Robot Navigation by Fine-tuning Attention Branch

16. LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation

17. Operational solar flare prediction model using Deep Flare Net

18. Target-dependent UNITER: A Transformer-Based Multimodal Language Comprehension Model for Domestic Service Robots

19. Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions

22. CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation

23. Predicting and Attending to Damaging Collisions for Placing Everyday Objects in Photo-Realistic Simulations

24. Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network

25. Reliable Probability Forecast of Solar Flares: Deep Flare Net-Reliable (DeFN-R)

28. A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects

29. Multimodal Attention Branch Network for Perspective-Free Sentence Generation

30. Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification

32. A Multimodal Classifier Generative Adversarial Network for Carry and Place Tasks from Ambiguous Language Instructions

33. SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Networks by Drones

34. Deep Flare Net (DeFN) model for solar flare prediction

38. Grounded Language Understanding for Manipulation Instructions Using GAN-Based Classification


Books, media, physical & digital resources


Course- and topic-based guides to collections, tools, and services.