An autonomous vehicle is able to navigate city streets and other less-busy environments by recognizing pedestrians, other vehicles and potential obstacles through artificial intelligence. This is achieved with the help of artificial neural networks, which are trained to “see” the car’s surroundings, mimicking the human visual perception system.
But unlike humans, cars using artificial neural networks have no memory of the past and are in a constant state of seeing the world for the first time – no matter how many times they’ve driven down a particular road before. This is especially problematic in adverse weather conditions, when the car cannot safely rely on its sensors.
Researchers at the Cornell Ann S. Bowers College of Computing and Information Science and the College of Engineering have produced three concurrent research papers with the goal of overcoming this limitation by providing the car with the ability to create “examples” of past experiences and use them. in future navigation.
Doctoral student Yurong You is the lead author of “HINDSIGHT 20/20: Leveraging Past Traversals to Aid 3D Perception,” which you presented virtually in April at ICLR 2022, the International Conference on Learning Representations. “Learning representations” includes deep learning, a kind of machine learning.
“The fundamental question is, can we learn from repeated traversals?” said senior author Kilian Weinberger, professor of computer science at Cornell Bowers CIS. “For example, a car may mistake a weirdly shaped tree for a pedestrian its first time its laser scanner perceives it from a distance, but once it is close enough, the object category will become clear. So the second time you drive past the very same tree, even in the fog or snow, you would hope that the car has now learned to recognize it correctly. “
“In reality, you rarely drive for a very first time,” said co-author Katie Luo, a doctoral student in the research group. “Either you or someone else has driven it recently, so it just seems natural to collect and experience it.”
Spearheaded by doctoral student Carlos Diaz-Ruiz, the group compiled a dataset driving a car equipped with LiDAR (Light Detection and Ranging) sensors repeatedly along a 15-kilometer loop in and around Ithaca, 40 times over an 18-month period. The traversals capture changing environments (highway, urban, campus), weather conditions (sunny, rainy, snowy) and times of day.
This resulting dataset – which the group refers to as Ithaca365, and which is the subject of one of the other two papers – has more than 600,000 scenes.
“It deliberately exposes one of the key challenges in self-driving cars: poor weather conditions,” said Diaz-Ruiz, co-author of the Ithaca365 paper. “If the street is covered by snow, humans can rely on params, but without examples a neural network is heavily disadvantaged.”
HINDSIGHT is an approach that uses neural networks to compute descriptors of objects as their car passes them. It then compresses these descriptions, which the group has dubbed SQuaSH (Spatial-Quantized Sparse History) features, and stores them on a virtual map, similar to a “memory” stored in a human brain.
The next time the self-driving car traverses the same location, it can query every local LiDAR point of the SQuaSH database along the route and “remember” what it learned last time. The database is constantly updated and shared across vehicles, thus enriching the information available to perform recognition.
“This information can be added as features to any LiDAR-based 3D object detector;” You said. “Both the detector and the SQuaSH representation can be jointly trained without any additional supervision, or human annotation, which is time- and labor-intensive.”
While HINDSIGHT still assumes that the artificial neural network is already trained to detect objects and augments it with the capability to create it, MODEST (Mobile Object Detection with Ephemerality and Self-Training) – the third publication of the subject – goes even further.
Here, the authors let the car learn the entire concept of pipeline from scratch. Initially the artificial neural network in the vehicle has never been exposed to any objects or streets at all. Through multiple traversals of the same route, it can learn what parts of the environment are stationary and which are moving objects. Slowly it teaches itself what constitutes other traffic participants and what is safe to ignore.
The algorithm can then detect these objects reliably – even on roads that were not part of the initial repeated traversals.
The researchers hope that both approaches can dramatically reduce the cost of developing autonomous vehicles (which currently relies heavily on costly human annotated data) and make such vehicles more efficient by learning to navigate the locations in which they are used.
Both Ithaca365 and MODEST will be presented at the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), held June 19-24 in New Orleans.
Other contributors include Mark Campbell, the John A. Mellowes ’60 Professor in Mechanical Engineering at the Sibley School of Mechanical and Aerospace Engineering, assistant professors Bharath Hariharan and Wen Sun, from computer science at Bowers CIS; Former postdoctoral researcher Wei-Lun Chao, now an assistant professor of computer science and engineering at Ohio State; and doctoral students Cheng Perng Phoo, Xiangyu Chen and Junan Chen.
The research for all three papers was supported by grants from the National Science Foundation; the Office of Naval Research; and the Semiconductor Research Corporation.
Editor’s Note: This article was originally published in The Cornell Chronicle.
General Chat Chat Lounge