9:30 Opening Comments
9:45 Keynote Speaker - Kristen Grauman
10:30 Break
11:00 Spotlight Presentations:
  • 11:00 - Mehrnoosh Sameki (5 min talk + 2 min Q&A)
  • 11:07 - Shay Sheinfeld (5 min talk + 2 min Q&A)
  • 11:14 - Apeksha Kumava (5 min talk + 2 min Q&A)
  • 11:21 - Sneha Mehta (5 min talk + 2 min Q&A)
  • 11:28 - Darius Lam (5 min talk + 2 min Q&A)
  • 11:35 - Abhisek Dash (5 min talk + 2 min Q&A)
11:45 Invited speaker - Ariel Shamir
12:15 Invited speaker - Kavita Bala
12:45 Lunch sponsored by Evolv Technology
2:15 Sponsor talk - Brendan McCord
2:30 Invited speaker - Sanja Fidler
3:00 Invited speaker - Kotaro Hara
3:30 Break
4:00 Poster Session
5:00 Open Discussion
5:15 Best Paper Award (sponsored by Evolv Technology) & Closing Comments

Accepted Papers

Encore Track Papers (previously appearing at other venues)

Keynote Speaker

Kristen Grauman

University of Texas at Austin

"My research interests are in computer vision and machine learning. In general, the goal of computer vision is to develop the algorithms and representations that will allow a computer to autonomously analyze visual information. I am especially interested in learning and recognizing visual object categories, and scalable methods for content-based retrieval and visual search."

Invited Speakers

Kavita Bala


Crowdsourcing for Material Recognition in the Wild

Human beings are good at perceiving subtle distinctions in material appearance (e.g., is this fabric silk or velvet?), but computers lag far behind in the task of material recognition. I will describe how we collected several large-scale, crowdsourced datasets of materials and reflectance comparisons from consumer photographs, and how we used these datasets to achieve state-of-the-art results in material recognition, intrinsic image decomposition, and material-based image browsing and design.

Kavita Bala is a Professor in the Computer Science Department and Program of Computer Graphics at Cornell University. Prof. Bala specializes in computer graphics and computer vision, leading research projects in material perception, recognition, and acquisition; realistic rendering; perception; and computational lighting design.

Ariel Shamir

Efi Arazi School of Computer Science at the Interdisciplinary Center Israel

Passive Human Computation

Image and video processing have gone a long way in recent years with automatic algorithms. However, some simple tasks that even a child can do, still pose a challenge for automatic algorithms. Especially if these tasks involve some semantic understanding. Children and humans often learn by observing others. In this talk I will present some examples where algorithms for image and video processing learn semantic understanding by observing human behaviors. This can be seen as a form of passive human computation - humans must participate in the computation process, but they are not actively computing anything.

Ariel Shamir is interested in geometric modeling, computer graphics, fabrication, visualization, and machine learning. Besides his professorial work, he is or was affiliated with Mitsubishi Electric Research Labs, Disney Research, NASA, Lawrence Livermore National Labs, and a number of high-tech companies in Israel. Prof. Shamir has been a part of many seminal projects in computer vision and computational photography such as Seam Carving, Sketch2Photo, and 3-Sweep.

Sanja Fidler

University of Toronto

"My work is in the area of Computer Vision. My main research interests are 2D and 3D object detection, particularly scalable multi-class detection, object segmentation and image labeling, and (3D) scene understanding. I am also interested in the interplay between language and vision: generating sentential descriptions about complex scenes, as well as using textual descriptions for better scene parsing (e.g., in the scenario of the human-robot interaction)."

Kotaro Hara

University of Maryland, College Park

Using Crowdsourcing, Computer Vision, and Google Street View to Collect Sidewalk Accessibility Data

Poorly maintained sidewalks pose considerable accessibility challenges for people with mobility impairments. Despite comprehensive civil rights legislation of Americans with Disabilities Act, many city streets and sidewalks in the U.S. remain inaccessible. The problem is not just that sidewalk accessibility fundamentally affects where and how people travel in cities, but also that there are few, if any, mechanisms to determine accessible areas of a city a priori.

In this talk, I will introduce a scalable data collection method for acquiring street-level accessibility information using a combination of crowdsourcing, computer vision, automatic workflow controller, and Google Street View. Our work shows that by combining crowdsourcing and automated methods, we can increase data collection efficiency by 13% compared to a fully manual approach without sacrificing accuracy. Our overarching goal is to transform the ways in which accessibility information is collected and visualized for every sidewalk, street, and building facade in the world.

"My research focuses on design of systems to collect and deliver information about accessibility of streets and sidewalks that help people with mobility impairments. Using geo-tagged street-level imagery like Google Street View as data source, we locate sidewalk accessibility problems with crowdsourcing and computer vision technologies."

M. Brendan McCord

Director at Evolv Technology

AI + IQ: Building Best of Breed Security Systems

Evolv, a Boston-based startup with a mission to keep people safe, is developing a new real-time threat detection and prevention platform called Mosaiq that utilizes the judgment of distributed human "Agents" to augment and train state of the art deep neural networks. We'll review the challenges in dividing tasks between AI and IQ as well as in obtaining high quality real-time human judgments. Mr. McCord and Evolv Principal Software Engineer Mr. Brandon Wolfe will share learnings from product development and pose open-ended technical problems to the audience.

Brendan McCord is Director at Evolv Technology, a startup backed by Bill Gates, General Catalyst, and Lux Capital that is comprised of a multi-disciplinary team of experts. Evolv's goal is to build the most advanced threat detection system the world has ever known by combining powerful sensors, AI, and human IQ. Mosaiq was built for today's increasingly dangerous world, and it introduces a proactive approach to security that shatters the status quo.