US20210229673A1 - Seamless driver authentication using an in-vehicle camera in conjunction with a trusted mobile computing device - Google Patents
Seamless driver authentication using an in-vehicle camera in conjunction with a trusted mobile computing device Download PDFInfo
- Publication number
- US20210229673A1 US20210229673A1 US16/764,322 US201916764322A US2021229673A1 US 20210229673 A1 US20210229673 A1 US 20210229673A1 US 201916764322 A US201916764322 A US 201916764322A US 2021229673 A1 US2021229673 A1 US 2021229673A1
- Authority
- US
- United States
- Prior art keywords
- user
- vehicle
- feature data
- image
- face
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 claims abstract description 105
- 239000013598 vector Substances 0.000 claims description 68
- 239000002131 composite material Substances 0.000 claims description 11
- 238000010801 machine learning Methods 0.000 claims description 2
- 230000001815 facial effect Effects 0.000 description 61
- 230000008569 process Effects 0.000 description 31
- 230000004044 response Effects 0.000 description 26
- 210000003128 head Anatomy 0.000 description 23
- 238000004891 communication Methods 0.000 description 21
- 230000006870 function Effects 0.000 description 17
- 230000015654 memory Effects 0.000 description 15
- 238000010586 diagram Methods 0.000 description 12
- 230000003287 optical effect Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 6
- 238000013136 deep learning model Methods 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 239000004973 liquid crystal related substance Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 4
- 238000013179 statistical model Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 238000013500 data storage Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000001537 neural effect Effects 0.000 description 3
- 238000004806 packaging method and process Methods 0.000 description 3
- 229920003023 plastic Polymers 0.000 description 3
- 239000004033 plastic Substances 0.000 description 3
- 238000010897 surface acoustic wave method Methods 0.000 description 3
- 230000000007 visual effect Effects 0.000 description 3
- 229910000831 Steel Inorganic materials 0.000 description 2
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 2
- 229910052782 aluminium Inorganic materials 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 229920001971 elastomer Polymers 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 239000010959 steel Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 125000000391 vinyl group Chemical group [H]C([*])=C([H])[H] 0.000 description 2
- 229920002554 vinyl polymer Polymers 0.000 description 2
- 241000282412 Homo Species 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000004378 air conditioning Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000000887 face Anatomy 0.000 description 1
- 210000004247 hand Anatomy 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 238000012776 robust process Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000001931 thermography Methods 0.000 description 1
- 238000003325 tomography Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 210000000707 wrist Anatomy 0.000 description 1
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60R—VEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
- B60R25/00—Fittings or systems for preventing or indicating unauthorised use or theft of vehicles
- B60R25/20—Means to switch the anti-theft system on or off
- B60R25/2027—Means to switch the anti-theft system on or off with data signals passing through the human body
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/08—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to drivers or passengers
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60R—VEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
- B60R25/00—Fittings or systems for preventing or indicating unauthorised use or theft of vehicles
- B60R25/20—Means to switch the anti-theft system on or off
- B60R25/24—Means to switch the anti-theft system on or off using electronic identifiers containing a code not memorised by the user
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60R—VEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
- B60R25/00—Fittings or systems for preventing or indicating unauthorised use or theft of vehicles
- B60R25/20—Means to switch the anti-theft system on or off
- B60R25/25—Means to switch the anti-theft system on or off using biometry
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/30—Authentication, i.e. establishing the identity or authorisation of security principals
- G06F21/31—User authentication
- G06F21/32—User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
-
- G06K9/00241—
-
- G06K9/00288—
-
- G06K9/00845—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/59—Context or environment of the image inside of a vehicle, e.g. relating to seat occupancy, driver state or inner lighting conditions
- G06V20/597—Recognising the driver's state or behaviour, e.g. attention or drowsiness
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
- G06V40/164—Detection; Localisation; Normalisation using holistic features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/08—Network architectures or network communication protocols for network security for authentication of entities
- H04L63/0861—Network architectures or network communication protocols for network security for authentication of entities using biometrical features, e.g. fingerprint, retina-scan
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W12/00—Security arrangements; Authentication; Protecting privacy or anonymity
- H04W12/06—Authentication
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/08—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to drivers or passengers
- B60W2040/0809—Driver authorisation; Driver identity check
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W12/00—Security arrangements; Authentication; Protecting privacy or anonymity
- H04W12/60—Context-dependent security
- H04W12/69—Identity-dependent
- H04W12/72—Subscriber identity
Definitions
- Vehicles such as automobiles, motorcycles, aircraft, and watercraft, may include one or more computing systems for performing functions and providing occupants of the vehicles with information, entertainment, assistance, and/or environmental control.
- an automobile may include an entertainment system for playing music, videos, or other content, a navigation system for providing information and navigational assistance, a temperature control system for heating or cooling the in-vehicle cabin, a control system for adjusting various components or features of the car, such as a sun roof or window shades, or an “infotainment system” that performs some or all of these aforesaid functions.
- Modern vehicles are equipped with an infotainment head unit (IHU) having a display device (e.g., presence-sensitive display) and compute engine, which is configured to execute an operating system and one or more applications.
- IHU infotainment head unit
- the present application describes techniques for performing a seamless authentication of a driver of a vehicle using an in-vehicle camera and a trusted mobile computing device that is communicatively coupled to the vehicle.
- the trusted mobile computing device may initially perform an enrollment process to capture data (e.g., facial feature data associated with a captured image) of a face of a known user in a variety of different poses.
- the trusted device may assign each group of data of the known user to a respective pose bucket of a group of pose buckets.
- the trusted device enrolls the data by associating data of the known user with a user account for the known user.
- the trusted device may subsequently receive authentication data (e.g., computed test data, such as computed facial feature information) of an unknown user who is situated inside of the vehicle, where an authentication image is captured using the in-vehicle camera of the vehicle, and where computed data (e.g., feature data) is obtained from the image and used by the trusted device.
- authentication data e.g., computed test data, such as computed facial feature information
- computed data e.g., feature data
- the trusted device may authenticate the unknown user by comparing the authentication feature data for the unknown user to enrolled feature data of the known user. Based on the comparison, the trusted device sends a result of the authentication process to the vehicle, which may then perform login operations at the infotainment head unit of the vehicle that are customized for the authenticated user (e.g., based on that user's profile and/or account information).
- the trusted mobile computing device may send the enrolled feature data to the vehicle computing system, which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data for the unknown user to the received enrolled feature data of the known user.
- a method includes establishing, by a mobile computing device, a connection with a vehicle computing system of a vehicle, and, after establishing the connection, receiving, by the mobile computing device and from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle.
- the example method further includes determining, by the mobile computing device and based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user, authenticating, by the mobile computing device and based on the match, the user of the vehicle and sending, by mobile computing device and to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- a computer-readable storage medium stores instructions that, when executed, cause at least one processor to: establish a connection with a vehicle computing system of a vehicle; after establishing the connection, receive, from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle; determine, based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user; authenticate, based on the match, the user of the vehicle; and send, to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- a mobile computing device includes at least one processor and at least one computer-readable storage device.
- the at least one computer-readable storage device store instructions that, when executed by the at least one processor, cause the at least one processor to: establish a connection with a vehicle computing system of a vehicle; after establishing the connection, receive, from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle; determine, based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user; authenticate, based on the match, the user of the vehicle; and send, to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- a method in another example, includes establishing, by a vehicle computing system of a vehicle, a connection with a mobile computing device, wherein the vehicle computing system includes an infotainment head unit, determining, by the vehicle computing system, a presence of a user inside the vehicle, and, after determining the presence of the user inside the vehicle, capturing, by the vehicle computing system using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user.
- the example method further includes determining, by the vehicle computing system, first feature data associated with the at least one image of the face of the user, receiving, by the vehicle computing system, authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user, and accessing, by the vehicle computing system and based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- a computer-readable storage medium stores instructions that, when executed, cause at least one processor to: establish a connection with a mobile computing device; determine a presence of a user inside a vehicle; after determining the presence of the user inside the vehicle, capture, using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user; determine first feature data associated with the at least one image of the face of the user; receive authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user; and access, based on the authentication data for the user, user account information to log the user into a infotainment head unit.
- a vehicle computing system includes at least one processor and at least one computer-readable storage device.
- the at least one computer-readable storage device store instructions that, when executed by the at least one processor, cause the at least one processor to: establish a connection with a mobile computing device, wherein the vehicle computing system includes an infotainment head unit; determine a presence of a user inside a vehicle; after determining the presence of the user inside the vehicle, capture, using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user; determine first feature data associated with the at least one image of the face of the user; receive authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user; and access, based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- FIG. 1 is a conceptual diagram illustrating a side view of an interior of a vehicle that includes an example vehicle computing system that is configured to communicate with a mobile computing device for authenticating a user of a vehicle, in accordance with one or more aspects of the present disclosure.
- FIG. 2 is a conceptual diagram illustrating further details of an interior of a vehicle, in accordance with one or more aspects of the present disclosure.
- FIG. 3 is a conceptual diagram illustrating an example computing device that performs facial enrollment operations, in accordance with one or more aspects of the present disclosure.
- FIG. 4 is a block diagram illustrating an example computing system, in accordance with one or more aspects of the present disclosure.
- FIG. 5 is a diagram illustrating an example driver onboarding and authentication process, in accordance with one or more aspects of the present disclosure.
- FIG. 6 is a diagram illustrating an example facial enrollment process, in accordance with one or more aspects of the present disclosure.
- FIG. 7 is a flowchart illustrating example operations performed by an example computing system, in accordance with one or more aspects of the present disclosure.
- FIG. 8 is a flowchart illustrating example operations performed by an example vehicle computing system, in accordance with one or more aspects of the present disclosure.
- IHU infotainment head unit
- a display device e.g., presence-sensitive display
- compute engine which is configured to execute an operating system and one or more applications.
- the IHU enables a user to have a rich, personalized experience while driving, and the applications provided by the IHU may enable users to listen to preferred music, browse emails, or pick favored and/or frequent destinations, to name only a few examples.
- the user's personalized experience may be achieved by having user-specific profiles stored in per-user accounts on the IHU.
- a key fob or an external device e.g., mobile phone
- a key fob or an external device may identify a user and log them in to the IHU.
- the mere presence of key fob or external device does not necessarily achieve the authentication of a particular, known or previously identified user of the vehicle.
- the IHU may further prompt the user of a known, trusted device to enter a personal passcode or password on the user's device to enable explicit permission for login into the IHU. This approach, however, may potentially negate the user's expectation of a seamless, “get-in-and drive” experience.
- Modern automobiles support different levels of autonomous driving. Some automobiles use in-car cameras to monitor the driver's attentiveness, or to determine safe handoff to and from the autonomous mode.
- One approach is to leverage in-car cameras to provide face-based identification, similar to the face unlock feature on personal mobile devices.
- this face-based authentication process will typically involve an enrollment step, which may involve a user moving the user's face in a directed pattern in order for the camera to capture facial/biometric features of the user in different poses. Each pose may be associated with a distinct group of facial or other biometric features that are associated with that pose.
- Performing enrollment directly in a vehicle, using its own camera may have certain challenges, as the user may not receive any visual or other directions from the vehicle for moving the user's head while at the same looking at the driver-facing camera.
- the present application describes techniques for performing a seamless authentication of a driver of a vehicle using an in-vehicle camera and a trusted mobile computing device that is communicatively coupled to the vehicle.
- the techniques may provide a reliable authentication mechanism and a seamless user experience, while also ensuring user privacy of facial image information.
- the trusted mobile computing device may initially perform an enrollment process to capture image data (e.g., computed data such as facial feature data) of a face of a known user in a variety of different poses.
- image data refers to data (e.g., feature data) that is computed or otherwise determined from a captured image of, e.g., a face of a user.
- the trusted device may assign each group of image data of the known user to a respective pose bucket of a group of pose buckets.
- the trusted device enrolls the image data by associating the image data of the known user with a user account for the known user.
- the trusted device may subsequently receive authentication image data (e.g., test image data, such as computed facial feature data associated with an image) of an unknown user who is situated inside of the vehicle, where the authentication image is captured using the in-vehicle camera of the vehicle, and where computed image data (e.g., feature data) is obtained from the image and used by the trusted device.
- the trusted device may authenticate the unknown user by comparing the authentication feature data for the unknown user to enrolled feature data of the known user.
- the trusted device sends a result of the authentication process to the vehicle, which may then perform login operations at the IHU that are customized for the authenticated user (e.g., based on that user's profile and/or account information).
- the trusted mobile computing device may send the enrolled feature data to the vehicle computing system, which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data for the unknown user to the received enrolled feature data of the known user.
- the computing device may determine a pose bucket associated with authentication feature data of the unknown user's face, select feature data of the known user that is included in the same pose bucket as the pose bucket associated with the authentication feature data, and compare the selected feature data to the authentication feature data to determine whether the unknown user is the known user.
- the computing device may compare the authentication feature data to each of the groups of enrolled feature data to determine which of the enrolled feature data of the known user are most similar to the authentication feature data for the unknown user. The computing device may determine whether the unknown user is the known user based on the most similar enrolled feature data of the known user, regardless of the pose buckets.
- the trusted device may more accurately perform facial recognition to authenticate an unknown user, such as the current user of a vehicle. For instance, enrolling feature data of a known user that are included in several pose buckets may increase the probability that the pose bucket associated with the authentication feature data (e.g., the pose of the unknown user's face) is similar to the pose buckets that include the enrolled feature data of the known user (e.g., the pose of the known user in the enrolled feature data). Increasing the probability that the pose of the authentication feature data of the unknown user is similar to the pose of one or more enrolled feature data of a known user may reduce the probability of falsely rejecting the unknown user when the unknown user is in fact a known, authorized user.
- the pose bucket associated with the authentication feature data e.g., the pose of the unknown user's face
- the enrolled feature data of the known user e.g., the pose of the known user in the enrolled feature data
- increasing the probability that the pose of the authentication feature data of the unknown user is similar to the pose of one or more enrolled feature data of a known user may reduce the probability of falsely accepting the unknown user when the unknown user is not a known, authorized user. In this way, the computing device may more accurately authenticate feature data of unknown users regardless of the pose of the unknown user.
- FIG. 1 is a conceptual diagram illustrating a side view of an interior of a vehicle (e.g., automobile), which includes an example vehicle computing system 100 .
- FIG. 1 shows a cross-sectional view of a vehicle interior in addition to components of vehicle computing system 100 .
- Vehicle computing system 100 is configured to detect and process user input.
- the vehicle illustrated in FIG. 1 may be an automobile, but aspects of the present disclosure may also be applicable to other types of vehicles, including trucks, motorcycles, aircraft, watercraft, trains, or other vehicles.
- a driver 150 may normally occupy seat 152 .
- Seat 152 of the automobile may be positioned directly behind steering wheel 154 of the vehicle such that an occupant of seat 152 may physically control steering wheel 154 .
- the seat 152 is positioned within the vehicle illustrated in FIG. 1 under roof 158 .
- Steering wheel 154 may protrude from dashboard 156 .
- At least one front passenger seat may be laterally positioned adjacent to seat 152 .
- Other passenger seats may be positioned behind seat 152 or in front of seat 152 .
- Vehicle computing system 100 may include, but is not limited to, presence-sensitive panel 102 and camera 104 , as well as display 112 and control unit 106 .
- One or more components of vehicle computing system 100 such as presence-sensitive panel 102 and camera 104 , may be directly and physically accessible to occupants seated in the front driver and front passenger seats of the automobile, and may be located within, near, or on center console 101 . Such components may be within easy reach of such occupants, and may also or alternatively be positioned in another passenger area of the vehicle, such as a back seat.
- a component may be within easy reach if a vehicle occupant does not need to change positions in his or her seat in order to reach the component with an outstretched arm. Stated another way, for many drivers, for example, the usual positions of the steering wheel, stick shift, and center console may be considered within easy reach of the driver.
- presence-sensitive panel 102 and camera 104 may function as input devices for vehicle computing system 100 .
- one or more components of vehicle computing system 100 that might not necessarily require physical access by occupants of the vehicle (such as, in some examples, display 112 and control unit 106 ), may be positioned in or on or integrated into dashboard 156 .
- vehicle computing system 100 may include display 112 that may output a graphical user interface.
- additional cameras may be provided within the vehicle.
- steering wheel 154 may include a user-facing camera 111 .
- additional user-facing cameras may be positioned on other elements or components of the vehicle, such as on dashboard 156 , roof 158 , console 101 or panel 102 , and/or display 112 .
- additional cameras may be positioned on a rear-view mirror or the windshield of the vehicle.
- the in-car cameras e.g., 104 / 111
- the in-car cameras may be mounted or otherwise connected to one or more portions or components of the vehicle.
- User 150 Seated on seat 152 is user 150 .
- User 150 may be a driver, but user 150 could also be a passenger or other vehicle occupant.
- FIG. 1 user 150 is shown in a position that may often be considered a front seat (characterized, e.g., by steering wheel 154 and dashboard 156 ), user 150 may be seated in another location within the vehicle, including a back seat.
- user 150 may navigate or operate the vehicle, may interact with one or more components of the vehicle, and/or may provide input at input devices or presence-sensitive panel 102 or camera 104 .
- user 150 is shown interacting with presence-sensitive panel 102 .
- Presence-sensitive panel 102 may detect one or more taps, gestures, and/or other user inputs at locations of presence-sensitive panel 102 . Such taps, gestures, or other inputs may be from one or more fingers of user 150 , or may be from a stylus or other device used by user 150 . Such input may be on the surface of presence-sensitive panel 102 , or within a threshold distance of the surface of presence-sensitive panel 102 . In the illustration of FIG. 1 , the threshold distance may extend above presence-sensitive panel 102 , towards roof 158 .
- presence-sensitive panel 102 may output to UI module 108 an indication of input detected by presence-sensitive panel 102 .
- UI module 108 may determine, based on the indication of input, information about the input. Such information may, for example, indicate one or more lines, characters, or shapes corresponding to the input.
- UI module 108 may output to one or more application modules 110 information about the input.
- one or more application modules 110 may determine an operation corresponding to the input and/or perform an operation.
- one or more application modules 110 may output to display 112 information about the input, the operation, or an operation to be performed.
- vehicle computing system 100 may be housed within dashboard 156 , which may in some examples be constructed of plastic, vinyl, rubber, aluminum, steel, or any other suitable material.
- Control unit 106 may include at least one processor and/or at least one storage device, and may be housed within housing 105 , which may also be constructed of plastic, vinyl, rubber, aluminum, steel, or any other suitable material.
- housing 105 may also be a rigid case that encloses and otherwise protects one or more electrical components that provide functionality for vehicle computing system 100 .
- housing 105 may be affixed, mounted or otherwise integrated with the automobile dashboard or console.
- Control unit 106 may provide an operating environment or platform for one or one more modules, such as a combination of hardware, firmware, and software, as further illustrated in FIG. 4 .
- control unit 106 may include one or more processors and storage devices that may execute instructions and store data of one or more modules.
- Control unit 106 may also be operably coupled to one or more other software and/or hardware components, including presence-sensitive panel 102 , camera 104 , and display 112 to control, configure, and/or communicate information with the components, to name only a few example operations.
- Display 112 may function as an output device, such as a display device, using any one or more of a liquid crystal display (LCD), dot matrix display, light emitting diode (LED) display, organic light-emitting diode (OLED) display, e-ink, or similar monochrome or color display capable of outputting visible information to a user or vehicle occupant.
- display 112 may also function as an input device, so that it serves as both an input and output device.
- display 112 may include an integrated presence-sensitive input device and a display device.
- display 112 may function as a presence-sensitive input device using a presence-sensitive screen, such as a resistive touchscreen, a surface acoustic wave touchscreen, a capacitive touchscreen, a projective capacitance touchscreen, a pressure-sensitive screen, an acoustic pulse recognition touchscreen, or another presence-sensitive screen technology.
- a presence-sensitive screen such as a resistive touchscreen, a surface acoustic wave touchscreen, a capacitive touchscreen, a projective capacitance touchscreen, a pressure-sensitive screen, an acoustic pulse recognition touchscreen, or another presence-sensitive screen technology.
- display 112 may present output to a user.
- display 112 may present various user interfaces of applications (e.g., a navigation application) executing at vehicle computing system 100 .
- An occupant of the vehicle such as a driver, may provide user input to interact with one or more of such applications.
- Vehicle computing system 100 may operate to assist, inform, entertain, or perform other tasks that require user interactions with occupants of a vehicle.
- Vehicle computing system 100 may, in some examples, be referred to as an infotainment head unit (IHU), an infotainment system, or a subcomponent thereof.
- vehicle computing system 100 may include one or more application modules 110 that perform functions or process information, on behalf of one or more occupants of the vehicle.
- vehicle computing system 100 may provide a navigation service that provides directions to destinations.
- Vehicle computing system 100 may also provide an information retrieval service that provides information in response to queries and/or as preemptive assistance or recommendations.
- Vehicle computing system 100 may also provide vehicle data about the vehicle, or multimedia such as audio or video. Mentioned are only a few examples of the functionality that may be provided by vehicle computing system 100 , and vehicle computing system 100 may provide many additional capabilities. In this and other ways, vehicle computing system 100 may improve the driving or riding experience for one or more occupants of the vehicle.
- vehicle computing system 100 may be controlled through input detected by presence-sensitive panel 102 , through input detected by camera 104 , and/or through input detected by a combination of presence-sensitive panel 102 and camera 104 . Vehicle computing system 100 may also be controlled through input detected by one or more additional input devices (e.g., microphones, physical buttons or switches, or other types of input devices).
- additional input devices e.g., microphones, physical buttons or switches, or other types of input devices.
- Presence-sensitive panel 102 may, in some examples, function simply as an input device for touch input, provided by user input that may occur directly and physically at presence-sensitive panel 102 .
- presence-sensitive panel 102 may function as a multi-touch presence-sensitive input device using a presence-sensitive device, such as a resistive touchscreen or touch panel, a surface acoustic wave touchscreen or touch panel, a capacitive touchscreen or touch panel, a projective capacitance touchscreen or touch panel, a pressure-sensitive screen or touch panel, an acoustic pulse recognition touchscreen or touch panel, or another presence-sensitive screen or touch panel technology.
- a presence-sensitive device such as a resistive touchscreen or touch panel, a surface acoustic wave touchscreen or touch panel, a capacitive touchscreen or touch panel, a projective capacitance touchscreen or touch panel, a pressure-sensitive screen or touch panel, an acoustic pulse recognition touchscreen or touch panel, or another presence-sensitive screen or touch panel technology.
- presence-sensitive panel 102 may detect an object at and/or near, or within range of the presence-sensitive component(s) associated with presence-sensitive panel 102 .
- presence-sensitive panel 102 may detect an object, such as a finger or stylus that is within 2 cm or less of presence-sensitive panel 102 .
- Presence-sensitive panel 102 may determine a location (e.g., an (x,y) coordinate) of the presence-sensitive input device at which the object was detected.
- presence-sensitive panel 102 may detect an object 6 inches or less from presence-sensitive panel 102 ; other ranges are also possible.
- Presence-sensitive panel 102 may detect a user's finger, stylus, or similar using capacitive, inductive, and/or optical recognition techniques.
- presence-sensitive panel 102 may be positioned in center console 101 above camera 104 , and center console 101 may be transparent to camera 104 so that camera 104 may capture images directly above presence-sensitive panel 102 even though presence-sensitive panel 102 physically obscures the lens or field-of-view of camera 104 .
- camera 104 may be an infrared camera that captures images by receiving infrared light and presence-sensitive panel 102 may be transparent to infrared light such that camera 104 is able to receive the infrared light originating between the roof 158 and presence-sensitive panel 102 .
- camera 104 might not be positioned directly under presence-sensitive panel 102 , and camera 104 may be positioned elsewhere within the vehicle.
- steering wheel 154 may include a user-facing camera 111 .
- additional user-facing cameras may be positioned on other elements or components of the vehicle, such as on dashboard 156 , roof 158 , console 101 or panel 102 , and/or display 112 .
- presence-sensitive panel 102 may function as both an input device and as an output device.
- presence-sensitive panel 102 may include an integrated presence-sensitive input device and a display device, and could be any one or more of a liquid crystal display (LCD), dot matrix display, light emitting diode (LED) display, organic light-emitting diode (OLED) display, e-ink, or similar monochrome or color display capable of outputting visible information to a user or vehicle occupant.
- presence-sensitive panel 102 may be implemented by two separate components: a presence-sensitive input device for receiving input and a display device for providing output.
- presence-sensitive panel 102 may still be positioned in center console 101 above camera 104 , and center console 101 may still be transparent to camera 104 so that camera 104 may capture images directly above presence-sensitive panel 102 , even if positioned under presence-sensitive panel 102 .
- Camera 104 and/or camera 111 may be one or more of any appropriate type of image acquisition or capture device, such as a camera or charge-coupled device.
- camera 104 may be one or more infrared cameras with a high field-of-view and shallow depth of focus, and may be a backlit infrared camera oriented to point generally upward within the vehicle, having a particular field-of-view.
- camera 104 may be or may further include one or more other types of cameras or image sensors, which may include one or more other infrared cameras, thermographic cameras, thermal imaging cameras, light-sensitive cameras, range sensors, tomography devices, radar devices, red-green-blue (RGB) cameras, or ultrasonic cameras.
- RGB red-green-blue
- camera 104 may be any image capture device appropriate for application of computer vision techniques.
- the resulting image may include two-dimensional images, three-dimensional volumes, or an image sequence. Pixel values typically correspond to light intensity in one or more spectral bands, but might also be related to various physical measures, such as depth, absorption or reflectance of sonic or electromagnetic waves, or nuclear magnetic resonance.
- Camera 104 may be configured to capture movements of an occupant of the vehicle, such as a driver, as the occupant moves an arm, wrist, hand, stylus, and/or fingers as he or she gestures in, for example, a field-of-view, and may be configured to capture images of the face of user 150 .
- vehicle computing system 100 may include user interface (UI) module 108 and application modules 110 .
- UI module 108 and application modules 110 may perform operations described herein using software, hardware, firmware, or a mixture of both hardware, software, and firmware residing in and executing by vehicle computing system 100 or at one or more other remote computing devices.
- UI module 108 and application modules 110 may be implemented as hardware, software, and/or a combination of hardware and software.
- Vehicle computing system 100 may execute UI module 108 , application modules 110 , or one or more other modules as or within a virtual machine executing on underlying hardware.
- UI module 108 and application modules 110 may be implemented in various ways.
- UI module 108 and application modules 110 may be implemented as a downloadable or pre-installed application or “app.”
- UI module 108 and application modules 110 may be implemented as part of an operating system of vehicle computing system 100 .
- Application modules 110 may include functionality to perform any variety of operations on vehicle computing system 100 .
- application modules 110 may include a navigation application, weather application, a phone dialer application, an information retrieval application, a multimedia application, a vehicle information application, an email application, a text messing application, instant messaging application, social networking application, weather application, stock market application, emergency alert application, sports application, to name only a few examples.
- vehicle computing system 100 may be configured to perform operations including those relating to climate control systems (e.g., heating and air conditioning), audio or infotainment systems, seat, window, sunshade, or windshield wipers, cruise control, in-cabin display system, steering wheel controls, headrest, arm rest, side or rear view mirrors, collision sensors.
- climate control systems e.g., heating and air conditioning
- infotainment systems e.g., audio and infotainment systems
- seat, window, sunshade, or windshield wipers e.g., cruise control, in-cabin display system, steering wheel controls, headrest, arm rest, side or rear view mirrors, collision sensors.
- Such operations may be controlled by one or more application modules 110 , or may be controlled by other systems within the vehicle. In some examples, such operations may be limited to non-safety features of the vehicle.
- such operations may encompass one or more features of the vehicle that may be considered safety-related (e.g., turning on a turn-signal, adjusting a mirror, adjusting or fastening/disconnecting a seat belt, adjusting cruise control features, accelerating, braking).
- safety-related e.g., turning on a turn-signal, adjusting a mirror, adjusting or fastening/disconnecting a seat belt, adjusting cruise control features, accelerating, braking).
- one or more of application modules 110 may be operable by a remote computing device (e.g., mobile computing device 170 ) that is communicatively coupled to vehicle computing system 100 .
- a remote computing device e.g., mobile computing device 170
- an application module executing at a remote computing device may cause the remote computing device to send the content and intent information using any suitable form of data communication (e.g., wired or wireless network, short-range wireless communication such as Near Field Communication or BLUETOOTH, etc.).
- a remote computing device may be a computing device that is separate from a computing device included in vehicle computing system 100 .
- the remote computing device may be operatively coupled to vehicle computing system 100 by a network.
- An example of a remote computing device may include, but is not limited to a server, smartphone, tablet computing device, smart watch, and desktop computer.
- a remote computing device may or may not be an integrated component of vehicle computing system 100 .
- mobile computing device 170 which may include a display device 172 and one or more image capture devices 173 , and which may execute one or more applications 174 .
- Examples of mobile computing device 170 may include, but are not limited to, a mobile phone, a tablet computer, a personal digital assistant (PDA), a laptop computer, a portable gaming device, a portable media player, an e-book reader, a wearable device (e.g., a watch, a wrist-mounted computing device, a head-mounted computing device), or other type of mobile computing device.
- Mobile computing device 170 may be or include one or more processors.
- FIG. 1 further illustrates a mobile, wearable device 107 (e.g., smartwatch), which is worn by user 150 , and which may be communicatively coupled (e.g., via one or more wireless connections) to mobile computing device 170 and/or system 100 .
- a mobile, wearable device 107 e.g., smartwatch
- IHU system 100 may communicate with wearable device 107 and/or mobile computing device 170 using a wireless communication protocol (e.g., BLUETOOTH, WIFI, BLUETOOTH Low Energy (BLE)).
- a wireless communication protocol e.g., BLUETOOTH, WIFI, BLUETOOTH Low Energy (BLE)
- BLE BLUETOOTH Low Energy
- this device may be recognized as a trusted device with respect to IHU system 100 , and is assigned a unique identifier by IHU system 100 .
- This trusted device, and its corresponding unique identifier are associated by IHU system 100 with user 150 of the vehicle, and any profile and/or account information for user 150 .
- UI module 108 of vehicle computing system 100 may receive from presence-sensitive panel 102 one or more indications of user input detected at presence-sensitive panel 102 . Generally, each time presence-sensitive panel 102 detects user input at a particular location of presence-sensitive panel 102 , UI module 108 may receive an indication of user input or information about the user input from presence-sensitive panel 102 . UI module 108 may assemble the information received from presence-sensitive panel 102 into a set of one or more events, such as a sequence of one or more touch events or gesture events. Each gesture event in the sequence may include data or components that represent parameters (e.g., when, where, originating direction) characterizing a presence and/or movement of input at presence-sensitive panel 102 .
- parameters e.g., when, where, originating direction
- Each gesture event in the sequence may include a location component corresponding to a location of presence-sensitive panel 102 , a time component related to when presence-sensitive panel 102 detected user input at the location, and/or an action component related to whether the gesture event corresponds to a lift up or a push down at the location.
- UI module 108 may determine one or more characteristics of the user input based on the sequence of gesture events and include information about these one or more characteristics within each gesture event in the sequence of gesture events. For example, UI module 108 may determine a start location of the user input, an end location of the user input, a density of a portion of the user input, a speed of a portion of the user input, a direction of a portion of the user input, and a curvature of a portion of the user input. UI module 108 may transmit indications of user input from presence-sensitive panel 102 to other modules, such as application modules 110 . UI module 108 may determine one or more single- or multi-touch gestures provided by a user.
- UI module 108 may also act as an intermediary between various components of vehicle computing system 100 to make determinations based on input detected by presence-sensitive panel 102 and generate output presented by display 112 .
- UI module 108 may receive data from one or more application modules 110 and cause display 112 to output content, such as a graphical user interface, for display.
- UI module 108 of vehicle computing system 100 may also receive from camera 104 one or more indications of user input detected by camera 104 . Generally, each time camera 104 detects a user gesture or movement, UI module 108 may receive an indication of user input or information about the user input from camera 104 . UI module 108 may assemble the information received from camera 104 into a set of one or more events, such as a sequence of movements or gesture events. Each gesture event in the sequence may include data or components that represents parameters (e.g., when, where in three dimensional space, originating direction, direction in three dimensional space, hand or arm orientation or posture) characterizing a presence, gesture, and/or movement captured by camera 104 within a field-of-view.
- parameters e.g., when, where in three dimensional space, originating direction, direction in three dimensional space, hand or arm orientation or posture
- Each gesture event in the sequence may include a location component corresponding to a three-dimensional location within a field-of-view, a time component related to when camera 104 detected user input within the three-dimensional space, an action component related to what type of gesture was made, and/or one or more images captured by camera 104 .
- the arrangement and/or placement of presence-sensitive panel 102 and camera 104 within the vehicle may provide an ergonomic and comfortable way for a driver (or other vehicle occupant) to interact with vehicle computing system 100 .
- presence-sensitive panel 102 and camera 104 may detect different types of input
- the positioning of presence-sensitive panel 102 and camera 104 in accordance with one or more aspects of this disclosure may be such that input detected by presence-sensitive panel 102 may be perceived by a vehicle occupant to be a natural extension of input detected by camera 104 .
- input detected by camera 104 may be perceived by a vehicle occupant to be a natural extension of input detected by presence-sensitive panel 102 .
- such a system may provide a particularly natural or easy user interface for a vehicle occupant to use.
- a vehicle occupant may find interacting with vehicle computing system 100 to be relatively instinctive.
- a mobile computing device may establish a connection with vehicle computing system 100 of the vehicle illustrated in FIG. 1 .
- the mobile computing device may receive, from vehicle computing system 100 , first feature data associated with at least one image of a face of user 150 of the vehicle.
- the at least one image of the face of user 150 of the vehicle is captured by an image capture device (e.g., camera 104 / 111 ) connected to at least a portion of the vehicle.
- the mobile computing device may determine, based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user, as described further below in reference to FIG. 3 .
- the mobile computing device authenticates, based on the match, user 150 of the vehicle.
- the mobile computing device the sends, to vehicle computing system 100 , authentication data for user 150 of the vehicle, where the authentication data is indicative of the match.
- vehicle computing system 100 may establish a connection with a mobile computing device (e.g., mobile computing device 170 , wearable device 107 ), where the vehicle computing system includes an infotainment head unit, and determine a presence of a user inside the vehicle. After determining the presence of the user inside the vehicle, vehicle computing system 100 may capture, using an image capture device (e.g., camera 104 / 111 ) that is connected to at least a portion of the vehicle, at least one image of a face of the user, and determine first feature data associated with the at least one image of the face of the user.
- an image capture device e.g., camera 104 / 111
- Vehicle computing system 100 may receive authentication data for the user, where the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user. Vehicle computing system 100 may then access, based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- vehicle computing system 100 may send, to the mobile computing device, the first feature data associated with the at least one image of the face of the user. After sending the first feature data, vehicle computing system 100 receives, from the mobile computing device, the authentication data for the user.
- the trusted mobile computing device may send the enrolled feature data to vehicle computing system 100 , which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data of the unknown user to the received enrolled feature data of a known user.
- vehicle computing system 100 receives, from the mobile computing device, the second feature data associated with the at least one image of the face of the previously enrolled user, and compares the first feature data and the second feature data. Vehicle computing system 100 then determines, based on the comparing, the match between the user and the previously enrolled user, where the authentication data comprises an indication of the match.
- a computing device and/or a computing system analyzes information (e.g., facial image information, etc.) associated with a computing device and a user of a computing device, only if the computing device receives permission from the user of the computing device to analyze the information.
- information e.g., facial image information, etc.
- the user may be provided with an opportunity to provide input to control whether programs or features of the computing device and/or computing system can collect and make use of user information (e.g., information about a user's current location, current speed, etc.), or to dictate whether and/or how to the device and/or system may receive content that may be relevant to the user.
- certain data may be treated in one or more ways before it is stored or used by the computing device and/or computing system, so that personally identifiable information is removed.
- a user's identity may be treated so that no personally identifiable information can be determined about the user, or a user's geographic location may be generalized where location information is obtained (such as to a city, ZIP code, or state level), so that a particular location of a user cannot be determined.
- location information such as to a city, ZIP code, or state level
- FIG. 2 is a conceptual diagram illustrating further details of an interior of a vehicle, in accordance with one or more aspects of the present disclosure.
- a portion of steering wheel 154 from FIG. 1 is shown, and a finger of user 150 is interacting with panel 102 of console 101 .
- camera 104 has acquired facial images of user 150 , and provided information associated with these images to mobile computing device 170 .
- Mobile computing device 170 has authenticated user 150 based on a comparison of received facial feature data with stored facial feature data of authorized, known users stored by mobile computing device 170 , to identify a match with the stored facial feature data of user 150 .
- Mobile computing device 170 sends authentication data for user 150 to system 100 .
- System 100 may send feature data to mobile computing device 170 that comprises actual image information or, in various examples, comprises feature data (e.g., feature vector data) for various features of the images of user 150 that are captured by image capture device 173 .
- feature data e.g., feature vector data
- this authentication data may comprise a unique identifier associated with user 150 and/or mobile computing device 170 .
- System 100 may use this received authentication data to access the user profile, user preference, login credentials, and/or account data for user 150 that is stored in system 100 (e.g., for logging user 150 into IHU system 100 ).
- System 100 may then display personalized or customized data display 112 of system 100 for user 150 based upon the accessed information (e.g., preferences) that are particular to user 150 .
- This personalized information may include user interface elements 208 and 204 . Cursor 206 may overlap one of user interface elements 204 .
- presence-sensitive panel 102 may detect one or more taps or inputs at presence-sensitive panel 102 , and system 100 may determine, based on this input, that the input corresponds to selection of the user interface elements 204 overlapped by cursor 206 .
- display 112 may be a presence-sensitive panel that operates both as an input device and an output device.
- display 112 may detect one or more inputs at or near a location on display 112 where display 112 presents user interface element 204 .
- System 100 may identify, based on the input, a selected user interface element 204 corresponding to the input.
- system 100 may perform an operation, which may include displaying information or updating a graphical user interface at display 112 .
- presence-sensitive panel 102 also acts as a display
- computing device 200 may additionally or alternatively display information at presence-sensitive panel 102 or update a graphical user interface displayed at presence-sensitive panel 102 .
- Display 112 of IHU system 100 may be configured to display personalized or customized information for any particular user that is authenticated by mobile computing device 170 that is communicatively coupled to system 100 .
- display 112 may be configured to display different customized information for each user, depending on the setup of the user profile or account for each user that is authenticated by mobile computing device 170 .
- system 100 may also customize various other settings of vehicle that are associated with operation of the vehicle and/or IHU system 100 (e.g., temperature settings, radio settings, vehicle environment settings, etc.).
- FIG. 3 is a conceptual diagram illustrating an example computing device 310 that performs facial enrollment operations, in accordance with one or more aspects of the present disclosure.
- Computing device 310 may be one example of mobile computing device 170 shown in FIG. 1 that is configured to save enrolled image data of authorized users.
- image data refers to data (e.g., feature data) that is computed or otherwise determined from a captured image of, e.g., a face of a user.
- Computing device 310 may be a mobile device, such as a smart phone, a tablet computer, a laptop computer, computerized watch, computerized eyewear, computerized gloves, or any other type of portable computing device. Additional examples of computing device 310 include other mobile and non-mobile devices, such as desktop computers, televisions, personal digital assistants (PDA), portable and non-portable gaming systems, digital media players or micro-consoles, e-book readers, mobile television platforms, automobile navigation and entertainment systems, or any other types of wearable and non-wearable, mobile or non-mobile computing devices.
- PDA personal digital assistants
- computing device 310 includes a presence-sensitive display (PSD) 312 , one or more image capture devices 314 , facial recognition module (FRM) 322 , and enrolled information 328 .
- Enrolled information 328 may comprise a data store.
- FRM 322 may perform operations described using software, hardware, firmware, or a mixture of hardware, software, and firmware residing in and/or executing at computing device 310 .
- Computing device 310 may execute FRM 322 with multiple processors or multiple devices.
- Computing device 310 may execute FRM 322 as a virtual machine executing on underlying hardware.
- FRM 322 may execute as one or more services of an operating system or computing platform.
- FRM 322 may execute as one or more executable programs at an application layer of a computing platform.
- PSD 312 of computing device 310 may function as respective input and/or output devices for computing device 310 .
- PSD 312 may be implemented using various technologies. For instance, PSD 312 may function as input devices using presence-sensitive input screens, such as resistive touchscreens, surface acoustic wave touchscreens, capacitive touchscreens, projective capacitance touchscreens, pressure sensitive screens, acoustic pulse recognition touchscreens, or another presence-sensitive display technology.
- PSD 312 may detect input from a user of computing device 310 .
- PSD 312 may detect one or more gestures performed on or within a threshold distance of PSD 312 (e.g., a user touching PSD 312 with a finger or a stylus or moving the finger or stylus within a threshold distance of a surface of PSD 312 ).
- PSD 312 may also function as output (e.g., display) devices using any one or more display devices, such as liquid crystal displays (LCD), dot matrix displays, light emitting diode (LED) displays, organic light-emitting diode (OLED) displays, e-ink, or similar monochrome or color displays capable of outputting visible information to a user of computing device 310 .
- PSD 312 may output information (e.g., to a user) as a user interface (e.g., a graphical user interface), which may be associated with functionality provided by computing device 310 .
- PSD 312 may display various user interfaces related to an application module or other features of computing platforms, operating systems, applications, and/or services executing at or accessible from computing device 310 .
- Image capture devices 314 may be one example of image capture devices 173 ( FIG. 1 ) and may include one or more cameras, such as digital cameras, still cameras, motion picture cameras, and the like. Image capture devices 314 may include any other devices capable of capturing and storing still or moving images. In some examples, image capture devices 314 may be capable of digitally recording images via an electronic image sensor. Image capture devices 314 may be configured to generate data indicative of an image in response to detecting visible light (e.g., light visible by humans, with wavelengths between approximately 380 nanometers to approximately 740 nanometers) or near infrared (NIR) light (e.g., light adjacent to the visible light spectrum, such as light with wavelengths between approximately 750 nanometers and approximately 3400 nanometers).
- visible light e.g., light visible by humans, with wavelengths between approximately 380 nanometers to approximately 740 nanometers
- NIR near infrared
- computing device 310 includes an image capture device 314 configured to generate data indicative of 2-dimensional images.
- computing device 310 includes a plurality of image capture device 314 configured to generate data indicative of 3-dimensional images. In this way, a plurality of image capture devices 314 may generate data indicative of 2-dimensional images, 3-dimensional images, or both.
- FRM 322 may perform facial recognition to authenticate a user of computing device 310 .
- FRM 322 may perform an enrollment process (e.g., one time, such as during an initial setup of computing device 310 ) and periodically execute an authentication process to determine whether an unknown user is, in fact, a known user.
- image capture devices 314 may determine one or more feature data 330 A- 330 H (collectively, “feature data 330 ” or, more generally, “image data 330 ”) of a known user 338 (e.g., a user logged in to a user account associated) and PSD 312 may output a graphical user interface (GUI) that includes one or more images 333 of the known user 338 .
- GUI graphical user interface
- Known user 338 may also be referred to as an enrolled user.
- each of feature data 330 may be associated with a particular pose of user 338 (e.g., feature data 330 A for a “pose A” of user 338 , feature data 330 B for a “pose B” of user 338 ).
- Known user 338 may be able to view images 333 within the GUI as data is captured by image captures devices 314 , and may adjust his or her head and/or computing device 310 to enable image capture devices 314 to capture images 333 of the face of known user 338 in a variety of different poses.
- Image capture devices 314 may output data associated with images 333 to FRM 322 .
- user 338 may face image capture devices 314 and press a button (e.g., a physical button or a graphical button displayed by PSD 312 ) to cause image captures devices 314 to capture images 333 .
- a button e.g., a physical button or a graphical button displayed by PSD 312
- image capture devices 314 may automatically capture images 333 in response to the user facing image capture devices 314 .
- FRM 322 analyzes the data received from image capture devices 314 and assigns each of computed feature data 330 to one or more pose buckets 332 AA- 332 EE (collectively, pose buckets 332 ).
- Feature data 330 comprises features or characteristics of images 333 .
- Each pose bucket 332 is associated with a range of pitch angles (also referred as tilt angle) and yaw angles (also referred to as a pan angle) of the user's face.
- the pitch angle refers to an angle of the user's face relative to a horizontal axis
- the yaw angle refers to an angle of the user's face relative to a vertical axis that is perpendicular to the horizontal axis.
- each of pose buckets 332 may be associated with a respective yaw and pitch range.
- the size of each of pose buckets 332 is equal (e.g., 10 degrees).
- each of pose buckets 332 is associated with a 10-degree range of pitch angles and a 10-degree range of yaw angles.
- the size of pose buckets 332 may be different.
- pose bucket may associated with an 8-degree range of pitch angles (and/or range of yaw angles) and another pose bucket may be associated with a 10-degree range of pitch angles (and/or range of yaw angles).
- pose buckets 332 are shown in FIG. 3 as part of a table 331 .
- the yaw and pitch angles shown in table 331 represent the center of each respective pose bucket 332 .
- the center of pose bucket 332 AA is ⁇ 20 degrees in yaw and 20 degrees in pitch.
- the pose bucket 332 AA may represent ⁇ 15 to ⁇ 25 degrees in yaw, and 15 to 25 degrees in pitch.
- the center of pose bucket 332 AN is 20 degrees in yaw and 20 degrees in pitch, such that pose bucket 332 AN represents 15 degrees to 25 degrees in yaw and 15 degrees to 25 degrees in pitch.
- table 331 includes 25 pose buckets 332
- table 331 may include a different number of pose buckets 332 .
- pose buckets 332 are illustrated as part of a table 331 to aid understanding, pose buckets 332 may not be stored in a table.
- Pose buckets 332 may be stored in any data structure and may be organized in any manner.
- FRM 322 may determine which of pose buckets 332 is associated with feature data based on characteristics or landmarks of the face of known user 338 included in the data for images 333 . For instance, FRM 322 may detect landmarks in images 333 of the user's face, such as the user's eyes, nose, and mouth, and may determine the yaw and pitch angles of the user's face based on the landmarks. For example, FRM 322 may determine that feature data 330 A, which is associated with a particular pose (e.g., “POSE A”), should be included in pose bucket 332 CC based on the yaw and pitch angles of the user's face in feature data 330 A, and the range yaw and pitch angles associated with pose bucket 332 CC.
- a particular pose e.g., “POSE A”
- FRM 322 may determine that the yaw angle of the user's face in feature data 330 A is 0 degrees and the pitch angle of the known user's face is 0 degrees. FRM 322 may determine that pose bucket 332 CC is associated with a range of yaw angles from ⁇ 5 to 5 degrees and a range of pitch angles from ⁇ 5 degrees to 5 degrees (e.g., pose bucket 332 CC is centered at 0 degrees yaw and 0 degrees pitch).
- FRM 322 may determine that pose bucket 332 CC includes feature data 330 A in response to determining that the yaw and pitch angle of the user's face in feature data 330 A falls within the range of yaw and pitch angles associated with pose bucket 332 CC.
- FRM 322 may determine the yaw angle of user's face in feature data 330 B is 0 degrees (e.g., centered in the left and right directions) and the pitch angle of the user's face is 23 degrees (e.g., the known user is looking up). FRM 322 may determine that pose bucket 332 AC is associated with a range of yaw angles from ⁇ 5 to 5 degrees and a range of pitch angles from 15 degrees to 25 degrees (e.g., pose bucket 332 CC is centered at 0 degrees yaw and 20 degrees pitch).
- FRM 322 may determine that pose bucket 332 AC includes feature data 330 B in response to determining that the yaw and pitch angle of the user's face in feature data 330 B falls within the range of yaw and pitch angles associated with pose bucket 332 AC.
- FRM 322 may determine feature data 330 and determine whether a threshold number of pose buckets 332 include one of feature data 330 . For example, FRM 322 may determine a number of pose buckets 332 that include feature data 330 of the face of the known user. In the example of FIG.
- FRM 322 determines that feature data 330 are included within 19 pose buckets (e.g., 332 AB, 332 AC, 332 BA, 332 BB, 332 BC, 332 BD, 332 BE, 332 CA, 332 CB, 332 CC, 332 CD, 332 CE, 332 DB, 332 DC, 332 DD, 332 DE, 332 EB, 332 EC, and 332 ED) of 25 possible pose buckets 332 .
- 19 pose buckets e.g., 332 AB, 332 AC, 332 BA, 332 BB, 332 BC, 332 BD, 332 BE, 332 CA, 332 CB, 332 CC, 332 CD, 332 CE, 332 DB, 332 DC, 332 DD, 332 DE, 332 EB, 332 EC, and 332 ED
- FRM 322 determines whether the number of pose buckets that include feature data 330 satisfies (e.g., is greater than or equal to) a threshold number (e.g., 15 pose buckets, 17 pose buckets, 19 pose buckets, etc.; or 65% of pose buckets 332 , 75% of pose buckets 332 , 85% of pose buckets 332 , etc.). For example, FRM 322 may determine that the number of pose buckets 332 that include feature data 330 satisfies the threshold number of pose buckets in response to determining that feature data 330 are included within at least 75% of pose buckets 332 . Determining the number of pose buckets 332 that include feature data 330 satisfies the threshold number of pose buckets may indicate that feature data 330 represent the face of the known user in enough different poses to more accurately authenticate a user.
- a threshold number e.g. 15 pose buckets, 17 pose buckets, 19 pose buckets, etc.; or 65% of pose buckets 332
- image capture device 314 may capture one or more images 333 for the enrollment process.
- FRM 322 may output a graphical user interface instructing known user 338 to move his or her head to capture images 333 of different angles of the face of known user 338 .
- FRM 322 may associate feature data 330 with a user account for known user 338 .
- feature data 330 may include image templates (also referred to as embeddings) for each respective image.
- an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) of a user's face.
- FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.).
- each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity.
- FRM 322 may apply a trained facial recognition model to the plurality of feature data 330 and output an image template (e.g., embedding) for each respective feature data 330 as a vector.
- FRM 322 associates feature data 330 with the user account by assigning data to an image template identifier and associating the respective image template identifier with a user account identifier for the user account for known user 338 .
- FRM 322 may feature data image in enrolled information 328 , which comprises a data store that includes feature data 330 associated with images 333 of user 338 .
- FRM 322 encrypts feature data 330 prior to storing it in enrolled information 328 .
- enrolled information 328 may be stored locally on computing device 310 such that the information is not transmitted over a network to any other devices (e.g., not transmitted to system 100 of the vehicle of FIG. 1 ). Further, computing device 310 may provide the user with an opportunity to delete one or more portions of feature data 330 stored in enrolled information.
- FRM 322 may perform an authentication process for an unknown user (e.g., user 150 of the vehicle of FIG. 1 ) after completing the enrollment process for known user 338 .
- FRM 322 may receive a request to authenticate an unknown user 150 of the vehicle of FIG. 1 , based on feature data of user 150 that is determined from images 333 captured by one of cameras in the vehicle (e.g., camera 104 and/or 111 ), and which is transmitted by IHU system 100 to computing device 310 , which may be one example of mobile computing device 170 of FIG. 1 .
- System 100 may send feature data 330 to computing device 310 that comprises, e.g., feature vector data for various features of the images of user 150 that are captured by image capture device 173 .
- FRM 322 may determine whether unknown user 150 is known user 338 . In some examples, FRM 322 determines whether the unknown user 150 is the known user 338 based on a comparison of the feature data indicative of authentication images of the face of unknown user 150 and the feature data 330 indicative of images 333 of the face of the known user 338 . For example, FRM 322 may compare the feature data indicative of the authentication image(s) of the face of unknown user 150 and feature data 330 that is computed or determined from images 333 of the face of the known user 338 using a pose-independent (also referred to as pose invariant) technique or a pose-dependent technique.
- pose-independent also referred to as pose invariant
- FRM 322 determines whether the unknown user 150 is the known user 338 based on the authentication feature data and feature data 330 associated with the face of the known user 338 in a pose closest to the pose of the face of the unknown user in authentication image of user 150 .
- FRM 322 determines a pose bucket of pose buckets 332 associated with authentication feature data of unknown user 150 .
- FRM 322 may determine the pose bucket associated with feature data of unknown user 150 based on characteristics or landmarks of the face of the unknown user, in a manner similar to determining the pose buckets associated with feature data 330 . For instance, FRM 322 may determine the yaw angle and the pitch angle of the face in the authentication feature data of user 150 .
- FRM 322 may determine which of pose buckets 332 include the yaw and pitch angle of the face in the authentication feature data. For instance, FRM 322 may determine the yaw and pitch angles of the face in the feature data are 20 degrees and 0 degrees respectively. FRM 322 may determine that pose bucket 332 CD is associated with a range of yaw angles from 15 to 25 degrees yaw and a range of pitch angles from ⁇ 5 to 5 degrees.
- FRM 322 may determine that the authentication feature data of the unknown user 150 is associated with pose bucket 332 CD in response to determining the yaw and pitch angles of the face in the authentication feature data are included in the range of yaw and pitch angles associated with pose bucket 332 CD.
- FRM 322 may determine feature data from feature data 330 of the face of the known user 338 that is included within the pose bucket associated with the feature data of the face of the unknown user 150 . In other words, FRM 322 may determine which of feature data 330 has the closest pose to the pose of the authentication feature data of user 150 . In one example, FRM 322 determines that the feature data of user 150 is associated with pose bucket 332 CD and selects feature data (e.g., feature data 330 G for a pose G) of feature data 330 that is included within pose bucket 332 CD. FRM 322 may determine whether user 150 is the known user 338 by determining a similarity score for the selected feature data 330 G, the similarity score indicating a similarity between feature data 330 G and the authentication feature data.
- FRM 322 determines whether the similarity score for feature data 330 G satisfies (e.g., is greater than or equal to) a threshold similarity score. FRM 322 may determine unknown user 150 is the known user 338 in response to determining that the similarity score for feature data 330 G satisfies the threshold similarity score, and may determine that unknown user 150 is not the known user in response to determining that the similarity score for feature data 330 G does not satisfy the threshold similarity score.
- FRM 322 determines whether unknown user 150 is the known user 338 regardless of the pose. In other words, in some examples, FRM 322 utilizes pose invariant techniques to determine whether the unknown user 150 is the known user 338 . For example, FRM 322 may determine a respective similarity score for each feature data of feature data 330 , where the respective similarity score indicates a similarity between the corresponding feature data of feature data 330 and the authentication feature data of unknown user 150 .
- FRM 322 selects feature data of feature data 330 based on the respective similarity scores for feature data 330 to determine whether unknown user 150 is the known user 338 .
- FRM 322 selects the feature data of feature data 330 with the similarity score indicative of a closest match to the authentication feature data.
- the score indicative of the closest match may be the lowest similarity score or the highest similarity score.
- FRM 322 selects two or more feature data of feature data 330 based on the respective similarity scores to determine whether unknown user 150 is the known user 338 .
- FRM 322 determines a composite similarity score for two or more feature data 330 .
- FRM 322 may determine the composite similarity score based on the average of the respective similarity scores for two or more of feature data 330 and may compare the composite similarity score to the threshold similarity score to determine whether unknown user 150 is the known user 338 .
- FRM 322 may compare each respective similarity score for the two or more feature data to the threshold similarity score. In such examples, FRM 322 may determine that unknown user 150 is the known user 338 in response to determining that a threshold number (e.g., 100%, 80%, 60% etc.) of the selected feature data have a similarity score that satisfies the threshold similarity score. For instance, if the set of selected feature data include three feature data of feature data 330 with the highest similarity scores, in some examples, FRM 322 determines that unknown user 150 is the known user 338 in response to determining that the similarity score for two of the three selected feature data satisfies the threshold similarity score.
- a threshold number e.g., 100%, 80%, 60% etc.
- computing device 310 may send authentication data to IHU system 100 of the vehicle. For example, computing device 310 may send data indicative of a successful authentication of unknown user 150 as the known user 338 , thereby indicating authentication of user 150 . In some cases, computing device 310 may send user profile and/or account information that is customized or personalized for user 338 (e.g., authenticated user 150 ) to system 100 . In some cases, computing device 310 may also send a unique identifier of user 338 and/or of computing device 310 to system 100 of the vehicle.
- computing device 310 may also store one or more models 329 .
- computing device 310 may utilize deep learning models to extract facial landmarks that help in identifying an individual, as described above. These models may be tailored to execute on the dedicated neural engines or processors of the personal devices.
- an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) of a user's face.
- FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.).
- each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity.
- FRM 322 may apply a trained facial recognition model to images 333 and output an image template (e.g., embedding) for respective feature data 330 as a vector.
- element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity.
- an enrollment module may apply a trained facial recognition model to images 333 and output an image template (e.g., a vector) for respective feature data 330 .
- an image template may be represented as a vector and may be generated by a trained facial recognition model.
- the vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- computing device 310 is described as enrolling feature data 330 of a known user 338 and authenticating an unknown user 150 , in some examples, one or more remote computing devices may perform all or a subset of the functionality described herein. In certain alternate examples, computing device 310 may send the enrolled feature data to a vehicle computing system (e.g., vehicle computing system 100 ), which then may be configured to perform authentication of an unknown user in the vehicle by comparing the authentication feature data of the unknown user to the received enrolled feature data of the known user.
- vehicle computing system e.g., vehicle computing system 100
- a computing device may utilize user data associated with a user of computing device 310 only if the computing device receives permission from the user of the computing device to utilize the data. For example, before a computing device or computing system can collect or may make use of information associated with a user, the user may be provided with an opportunity to provide input to control whether programs or features of the computing device and/or computing system can collect and make use of user information.
- certain information may be treated in one or more ways before it is stored or used by the computing device and/or computing system, so that personally identifiable information is removed. For example, a user's identity may be treated so that no personally identifiable information can be determined about the user.
- the computing device may store feature data and/or an image template for an image without storing the image itself, and may associate the feature data and/or image template with a user identifier that is not associated with any other user information.
- the user may have control over how information is collected about the user and used by the computing device and computing system.
- techniques of this disclosure may enable computing device 310 to capture feature data of a known user 338 that are included within several different pose buckets.
- computing device 310 may increase the number of pose buckets that include feature data of the known user 338 , which may enable computing device 310 may more accurately authenticate feature data of unknown users regardless of the pose bucket of an authentication feature data for the unknown user (e.g., user 150 ).
- increasing the number of pose buckets that include enrolled feature data may increase the probability that the pose bucket associated with the authentication feature data of the unknown user is similar to the pose bucket that includes one or more of the group of enrolled feature data of the known user 338 , which may reduce the probability of falsely rejecting the unknown user when the unknown user is in fact a known, authorized user, thus potentially improving the user experience.
- reducing the probability of false rejections may reduce the number of authentication attempts (e.g., facial recognition, finger print recognition, PIN or passcodes, etc.) used for the computing device to enter an increased access mode, which may reduce the amount of processing cycles utilized by the processor and improve battery life.
- the described techniques may reduce the probability of falsely authenticating the unknown user when the unknown user is not a known, authorized user, which may increase security of computing device 310 .
- FIG. 4 is a block diagram illustrating an example computing system 410 , in accordance with one or more aspects of the present disclosure.
- Computing system 410 may, in some cases, be a more detailed example of computing device 310 of FIG. 3 , mobile computing device 170 of FIG. 1 , and/or of vehicle computing system 100 of FIG. 1 .
- FIG. 4 illustrates only one particular example of computing system 410 , and many other examples of computing system 410 may be used in other instances and may include a subset of the components included in example computing system 410 or may include additional components not shown in FIG. 4 .
- image data refers to data (e.g., feature data) that is computed or otherwise determined from a captured image of, e.g., a face of a user.
- computing system 410 includes PSD 412 , one or more image capture devices 414 , one or more processors 430 , one or more input components 442 , one or more output components 444 , one or more communication units 446 , and one or more storage devices 448 .
- Storage devices 448 of computing system 410 include FRM 422 and enrolled information 428 .
- Communication channels 449 may interconnect each of the components 412 , 414 , 430 , 442 , 444 , 446 , and/or 448 for inter-component communications (physically, communicatively, and/or operatively).
- communication channels 449 may include a system bus, a network connection, one or more inter-process communication data structures, or any other components for communicating data (also referred to as information).
- Image capture devices 414 may include one or more cameras, such as digital cameras, still cameras, motion picture cameras, and the like. Image capture devices 414 may include any other devices capable of capturing and storing still or moving images. In some examples, image capture devices 414 may be capable of digitally recording images via an electronic image sensor. Image capture devices 414 may include one or more devices configured to detect visible light (e.g., visible light cameras), one or more devices configured to detect near-infrared light (e.g., near-infrared cameras), or a combination therein. In some examples, image capture devices 414 may generate image data indicative of 2-dimensional images, data indicative of 3-dimensional images, or a combination therein. In this way, a plurality of image capture devices 414 may capture visible light, near-infrared light, or a combination therein, and may generate image data indicative of 2-dimensional images, 3-dimensional images, or both.
- visible light e.g., visible light cameras
- near-infrared light e.g., near-infrared cameras
- One or more communication units 446 of computing system 410 may communicate with external devices by transmitting and/or receiving data.
- computing system 410 may use one or more of communication units 446 to transmit and/or receive radio signals on a radio network such as a cellular radio network.
- communication units 446 may transmit and/or receive satellite signals on a satellite network such as a Global Positioning System (GPS) network.
- GPS Global Positioning System
- Examples of communication units 446 include a network interface card (e.g. such as an Ethernet card), an optical transceiver, a radio frequency transceiver, a GPS receiver, or any other type of device that can send and/or receive information.
- communication units 446 may include short wave radios (e.g., NFC, BLUETOOTH (including BLE)), GPS, 3G, 4G, 5G, and WIFI radios found in mobile devices as well as Universal Serial Bus (USB) controllers and the like.
- short wave radios e.g., NFC, BLUETOOTH (including BLE)
- GPS 3G, 4G, 5G, and WIFI radios found in mobile devices as well as Universal Serial Bus (USB) controllers and the like.
- USB Universal Serial Bus
- One or more input components 442 of computing system 410 may receive input. Examples of input are tactile, audio, kinetic, and optical input, to name only a few examples.
- Input components 442 of computing system 410 include, in one example, a mouse, keyboard, voice responsive system, video camera, buttons, control pad, microphone or any other type of device for detecting input from a human or machine.
- input component 442 may be a presence-sensitive input component, which may include a presence-sensitive screen, touch-sensitive screen, etc.
- One or more output components 444 of computing system 410 may generate output. Examples of output are tactile, audio, and video output.
- Output components 444 of computing system 410 include a presence-sensitive screen, sound card, video graphics adapter card, speaker, cathode ray tube (CRT) monitor, liquid crystal display (LCD), or any other type of device for generating output to a human or machine.
- Output components may include display components such as cathode ray tube (CRT) monitor, liquid crystal display (LCD), Light-Emitting Diode (LED) or any other type of device for generating tactile, audio, and/or visual output.
- PSD 412 of computing system 410 may include functionality of input component 442 and/or output components 444 .
- PSD 412 may include a presence-sensitive input component 464 , such as a presence-sensitive screen or touch-sensitive screen.
- presence-sensitive input component 464 may detect an object at and/or near the presence-sensitive input component.
- Presence-sensitive input component 464 may determine a location (e.g., an (x,y) coordinate) of the presence-sensitive input component at which the object was detected.
- presence-sensitive input component 464 may detect an object two inches or less from presence-sensitive input component 464 and other ranges are also possible. Presence-sensitive input component 464 may determine the location of presence-sensitive input component 464 selected by a user's finger using capacitive, inductive, and/or optical recognition techniques.
- PSD 412 may also provide output to a user using tactile, audio, or video stimuli as described with respect to output component 444 .
- PSD 412 may include display component 462 that displays a graphical user interface. Display component 462 may be any type of output component that provides visual output, such as described with respect to output components 444 .
- PSD 412 may, in some examples, be an external component that shares a data or information path with other components of computing system 410 for transmitting and/or receiving input and output.
- PSD 412 may be a built-in component of computing system 410 located within and physically connected to the external packaging of computing system 410 (e.g., a screen on a mobile phone).
- PSD 412 may be an external component of computing system 410 located outside and physically separated from the packaging of computing system 410 (e.g., a monitor, a projector, etc. that shares a wired and/or wireless data path with a tablet computer).
- PSD 412 when located outside of and physically separated from the packaging of computing system 410 , may be implemented by two separate components: a presence-sensitive input component 464 for receiving input and a display component 462 for providing output.
- One or more storage components 448 within computing system 410 may store information for processing during operation of computing system 410 (e.g., computing system 410 may store data accessed by FRM 422 during execution at computing system 410 ).
- storage component 448 is a temporary memory, meaning that a primary purpose of storage component 448 is not long-term storage.
- Storage components 448 on computing system 410 may be configured for short-term storage of information as volatile memory and therefore not retain stored contents if powered off. Examples of volatile memories include random access memories (RAM), dynamic random-access memories (DRAM), static random-access memories (SRAM), and other forms of volatile memories known in the art.
- Storage components 448 also include one or more computer-readable storage media.
- Storage components 448 in some examples include one or more non-transitory computer-readable storage mediums.
- Storage components 448 may be configured to store larger amounts of information than typically stored by volatile memory.
- Storage components 448 may further be configured for long-term storage of information as non-volatile memory space and retain information after power on/off cycles. Examples of non-volatile memories include magnetic hard discs, optical discs, flash memories, or forms of electrically programmable memories (EPROM) or electrically erasable and programmable (EEPROM) memories.
- Storage components 448 may store program instructions and/or information (e.g., data) associated with FRM 422 .
- Storage components 448 may include a memory configured to store data or other information associated with FRM 422 and enrolled information 428 .
- processors 430 may implement functionality and/or execute instructions associated with computing system 410 .
- Examples of processors 430 include application processors, display controllers, auxiliary processors, one or more sensor hubs, and any other hardware configure to function as a processor, a processing unit, or a processing device.
- FRM 422 may be operable by processors 430 to perform various actions, operations, or functions of computing system 410 .
- storage devices 448 may include information and/or modules that are executable by processors 430 to implement corresponding functionality of vehicle computing system 100 (e.g., as illustrated and/or described in reference to system 100 in FIG. 1 and/or FIG. 5 ).
- processors 430 of computing system 410 may retrieve and execute instructions stored by storage components 448 that cause processors 430 to perform the operations described herein that are attributed to FRM 422 .
- the instructions when executed by processors 430 , may cause computing system 410 to store information within storage components 448 .
- FRM 422 may include all functionality of FRM 322 of computing device 310 of FIG. 3 and may perform similar operations as FRM 322 for performing facial recognition to authenticate a user of computing system 410 .
- FRM 422 may include an enrollment module 424 and an authentication module 426 .
- enrollment module 424 may perform an enrollment process to associate image data of a known user of computing system 410 with a user account for the known user. In some examples, enrollment module 424 may perform the enrollment process one time for a given user account, for instance, when setting up a new account.
- image capture devices 414 captures one or more image data 330 (of FIG. 1 ) of a known user (e.g., a user logged in to a user account associated) of computing system 410 and generates image data indicative of each of the images.
- Enrollment module 424 of FRM 422 may receive the image data from image capture device 414 and analyze the image data to assign each of image data 330 to one or more pose buckets 332 (of FIG. 1 ).
- Enrollment module 424 may determine which of pose buckets 332 is associated with image data based on characteristics or landmarks of the face of the known user included in the image data. For instance, enrollment module 424 may detect landmarks in the image data of the unknown user's face, such as the user's eyes, nose, and mouth, and may determine the yaw and pitch angle of the face based on the landmarks. For example, enrollment module 424 may receive image data 330 A and determine the yaw angle of the user's face in image data 330 A is approximately 0 degrees and the pitch angle of the user's face in image data 330 A is approximately 0 degrees based on the landmarks in image data 330 A.
- Enrollment module 424 may determine that pose bucket 332 CC includes a range of yaw angles from ⁇ 5 degrees to 5 degrees and a range of pitch angles from ⁇ 5 degrees to 5 degrees (e.g., pose bucket 332 CC is centered on 0 degrees yaw and 0 degrees pitch). In such examples, enrollment module 424 may determine that the yaw and pitch angles of the user's face in image data 330 A are within the range of pitch and yaw angles for pose bucket 332 CC, such that enrollment module 424 determines that image data 330 A should be included within pose bucket 332 CC. In some examples, enrollment module 424 may determine the amount of roll of the user's face in image data 330 A. For example, enrollment module 424 may determine which of pose buckets 332 include image data 330 A based on the yaw angle, pitch angle, and roll of the user's face in image data 330 A.
- enrollment module 424 determines whether image data 330 A should be included in a plurality of pose buckets 332 .
- enrollment module 424 may include image data 330 A within a given pose bucket when the yaw and pitch angles of image data 330 A are within a predefined distance (e.g., a radius of 10 degrees) of the center of the given pose bucket 332 .
- enrollment module 424 may determine that the yaw and pitch angles of the user's face in image data 330 A are 0 degrees and 0 degrees, respectively.
- the predefined distance may be 10 degrees and pose bucket 332 BC may be centered at 0 degrees yaw and 10 degrees pitch, such that enrollment module 424 may determine that the yaw and pitch angles of image data 330 A lie within the predefined distance of the center of pose bucket 332 BC.
- enrollment module 424 may include image data 330 A in pose bucket 332 BC.
- enrollment module 424 may determine that the yaw and pitch angles for image data 330 A are within the predefined distance of the center of pose buckets 332 CB, 332 CD, and 332 DC, and include image data 330 A in pose buckets 332 CB, 332 CD, and 332 DC in addition to pose buckets 332 CC and 332 BC.
- enrollment module 424 receives image data 330 B after receiving image data 330 A.
- Enrollment module 424 may determine whether to include image data 330 B in any of pose buckets 332 .
- Enrollment module 424 may determine the yaw angle of the user's face in image data 330 B is approximately 0 degrees and the pitch angle of the user's face in image data 330 B is approximately 19 degrees based on the landmarks in image data 330 B.
- Enrollment module 424 may determine that pose bucket 332 AC includes a range of yaw angles from ⁇ 5 degrees to 5 degrees and a range of pitch angles from 15 degrees to 25 degrees (e.g., pose bucket 332 AC is centered on 0 degrees yaw and 20 degrees pitch).
- enrollment module 424 may determine that the yaw and pitch angles of the user's face in image data 330 B are within the range of pitch and yaw angles for pose bucket 332 AC. Enrollment module 424 may determine whether pose bucket 332 AC includes one of image data 330 and may include image data 330 B in pose bucket 332 AC in response to determining that pose bucket 332 AC does not already include image data.
- Enrollment module 424 may determine whether to include image data 330 B in any other pose buckets 332 . For example, enrollment module 424 may determine whether the yaw and pitch angles for image data 330 B are within the predefined distance of the center of other pose buckets 332 . For instance, enrollment module 424 may determine that the predefined distance is 10 degrees and pose bucket 332 BC is centered at 0 degrees yaw and 10 degrees pitch, such that enrollment module 424 may determine that the yaw and pitch angles of image data 330 B (e.g., 0 degrees yaw, 19 degrees pitch) lie within the predefined distance of the center of pose bucket 332 BC. Enrollment module 424 may determine whether to include image data 330 B in pose bucket 332 BC in response to determining that the yaw and pitch angles for image data 330 B are within the predefined threshold of the center of pose bucket 332 BC.
- enrollment module 424 determines that pose bucket 332 BC already includes image data 330 A and determines whether to replace image data 330 A with image data 330 B in pose bucket 332 BC. In one example, enrollment module 424 may determine whether to replace image data 330 A with image data 330 B in pose bucket 332 BC based on a distance between the center of pose bucket 332 BC and the respective yaw and pitch angles for image data 330 A, 330 B.
- enrollment module 424 may determine the yaw and pitch angles for image data 330 A are 0 degrees, 0 degrees; the yaw and pitch angles for image data 330 B are 0 degrees, 19 degrees, and the yaw and pitch angles for the center of pose bucket 332 BC are 0 degrees, 10 degrees. In such examples, enrollment module 424 may determine that image data 330 B is closer to the center of pose bucket 332 BC than image data 330 A, and may replace image data 330 A with image data 330 B within pose bucket 332 BC.
- enrollment module 424 may determine whether to include image data 330 A or image data 330 B within pose bucket 332 BC based on the order of receiving the image data 330 A, 330 B. For example, enrollment module 424 may include the oldest image data (e.g., the image data that was received first) within pose bucket 332 BC. In such examples, enrollment module 424 may determine that image data 330 A should be included within pose bucket 332 BC as the image data 330 A was received first. In another example, enrollment module 424 may include the most recent image data within pose bucket 332 BC. In these examples, enrollment module 424 may determine that image data 330 B should be included within pose bucket 332 BC.
- the oldest image data e.g., the image data that was received first
- enrollment module 424 may determine that image data 330 A should be included within pose bucket 332 BC as the image data 330 A was received first.
- enrollment module 424 may include the most recent image data within pose bucket 332 BC. In these examples, enrollment module 424 may determine that image data 330 B should be included
- Enrollment module 424 may receive data indicative image data 330 from image capture devices 414 and determine whether a threshold number of pose buckets 332 include one of image data 330 . For example, enrollment module 424 may determine a number of pose buckets 332 that include image data 330 of the face of the known user. Responsive to determining the number of pose buckets 332 , enrollment module 424 determines whether the number of pose buckets that include image data 330 satisfies (e.g., is greater than or equal to) a threshold number of pose buckets.
- enrollment module 424 may determine that the number of pose buckets 332 that include image data 330 satisfies the threshold number of pose buckets in response to determining that image data 330 are included within at least 75% of pose buckets 332 . Responsive to determining that the number of pose buckets that include image data 330 does not satisfy the threshold number of pose buckets, enrollment module 424 may cause image capture devices 414 to capture one or more additional image data 330 for the enrollment process.
- enrollment module 424 may associate data indicative of the image data 330 with a user account for the known user in response to determining that the number of pose buckets that include image data 330 satisfies the threshold number of pose buckets.
- image data 330 may include the images themselves or image templates for each respective image (e.g., computed facial feature data).
- the image templates may include a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.).
- each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity.
- enrollment module 424 may apply a trained facial recognition model to the plurality of image data 330 and output an image template (e.g., a vector) for each respective image data 330 .
- enrollment module 424 associates the data indicative of image data 330 with the user account by assigning each image template an image template identifier and associating the respective image template identifier with a user account identifier for the user account for the known user.
- Enrollment module 424 may store the data indicative of image data 330 (e.g., the images themselves or the image templates) to enrolled information 428 .
- enrollment module 424 encrypts the data indicative of image data 330 prior to storing the data indicative of image data 330 .
- Image data 330 may be stored locally on computing system 410 such that the data is not transmitted over a network to any other devices. Further, computing system 410 may provide the user with an opportunity to delete the data indicative of image data 330 .
- authentication module 426 performs the authentication process in response to receiving data indicative of an authentication image of a face of an unknown user 150 (of FIG. 1 ) using authentication module 426 .
- the data indicative of the image of the face of the unknown user may include the image itself or an image template representing characteristics of the unknown user's face.
- authentication module 426 may determine whether unknown user 150 is a known user 338 . Authentication module 426 may determine whether unknown user 150 is a known user 338 based on the authentication image data of user 150 and one or more enrollment image data 330 . In some instances, authentication module 426 determines whether unknown user 150 is a known user 338 using a pose-independent (also referred to as pose invariant) technique or a pose-dependent technique.
- a pose-independent also referred to as pose invariant
- authentication module 426 determines whether the unknown user 150 is the known user 338 based on the authentication image data and a particular image data of image data 330 that includes the face of the known user in a pose closest to the pose of the face of unknown user 150 in the authentication image.
- the particular image data 330 that includes the face of the known user 338 in the pose closest to the pose of the face in the authentication image data may be the particular image of image data 330 that is included in the same pose bucket as a pose bucket 332 associated with the authentication image data.
- Authentication module 426 may determine a pose bucket of pose buckets 332 associated with authentication image data of unknown user 150 . For example, authentication module 426 may determine the pose bucket associated with this image data based on characteristics or landmarks of the face of the unknown user. For instance, authentication module 426 may determine the yaw angle and the pitch angle of the face in the authentication image data. Responsive to determining the yaw and pitch angles of the face in the authentication image data, authentication module 426 may determine which of pose buckets 332 include the yaw and pitch angle of the face in the authentication image data, and determine that pose bucket (e.g., pose bucket 332 CD) is the pose bucket associated with the authentication image data.
- pose bucket e.g., pose bucket 332 CD
- authentication module 426 may determine a roll of the face of unknown user 150 in the authentication image data. Authentication module 426 may determine which of pose buckets 332 is associated with the authentication image data based on the yaw angle, the pitch angle, and the roll of the face in the authentication image data.
- Authentication module 426 may determine which of image data 330 is included within the pose bucket associated with the authentication image data (e.g., pose bucket 332 CD). In an example where pose bucket 332 CD is associated with the authentication image data, authentication module 426 may query enrolled information 428 and determine that image data 330 G is included within pose bucket 332 CD. Responsive to determining that image data 330 G is included within the pose bucket associated with the authentication image data, authentication module 426 may determine whether user 150 is the known user 338 by determining a similarity score for the selected image data 330 G. In some examples, the similarity score for image data 330 G indicates a similarity between image data 330 G and the authentication image data.
- the similarity score for image data 330 G indicates a similarity between image data 330 G and the authentication image data.
- Authentication module 426 may determine a similarity score for image data 330 G based on the data indicative of image data 330 G and the data indicative of the authentication image data.
- the data indicative of image data 330 G includes an image template for image data 330 G.
- Such an image template may be represented as a vector and may be generated by a trained facial recognition model.
- the vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- the authentication image data may include a vector generated in a similar manner.
- authentication module 426 determines the similarity score by calculating an angle between a vector representing image data 330 G and a vector representing the authentication image data.
- authentication module 426 may determine the similarity score for image data 330 G by determining a cosine similarity between a vector representing image data 330 G and a vector representing the authentication image data.
- authentication module 426 determines whether the similarity score for image data 330 G satisfies a threshold similarity score. As one example, authentication module 426 determines the similarity score for image data 330 G by determining the angle between the vector representing image data 330 G and the vector representing the authentication image data, and determines that the similarity score for image data 330 G satisfies the threshold similarity score in response to determining that the similarity score is less than the threshold similarity score.
- authentication module 426 determines the similarity score for image data 330 G by determining the cosine similarity between the vector representing image data 330 G and the vector representing the authentication image data, and determines that the similarity score for image data 330 G satisfies the threshold similarity score in response to determining that the similarity score is greater than the threshold similarity score.
- Authentication module 426 may determine unknown user 150 is the known user 338 in response to determining that the similarity score for image data 330 G satisfies the threshold similarity score. Similarly, authentication module 426 may determine that unknown user 150 is not the known user 338 in response to determining that the similarity score for image data 330 G does not satisfy the threshold similarity score.
- authentication module 426 determines a respective similarity score for each of image data 330 to determine whether the unknown user 150 is the known user 338 .
- the respective similarity score indicates a similarity between the corresponding image data of image data 330 and the authentication image data.
- authentication module 426 may determine a respective similarity score for each of image data 330 based on the data indicative of the respective image data 330 and the data indicative of the authentication image data.
- the data indicative of image data 330 includes a respective image template.
- Such an image template may be represented as a vector and may be generated by a trained facial recognition model.
- the vector may include a plurality of element values that each correspond to a respective feature of the user's face.
- the data indicative of the authentication image data may include a vector that includes a plurality of element values that each correspond to a respective feature of the face of unknown user 150 .
- authentication module 426 determines the respective similarity score for each of image data 330 by calculating an angle between the respective vector and the vector representing the authentication image data.
- authentication module 426 may determine the respective similarity score for each of image data 330 by determining a cosine similarity between the respective vector for each of image data 330 and the vector representing the authentication image data.
- authentication module 426 selects image data of image data 330 based on the respective similarity scores for image data 330 to determine whether unknown user 150 is the known user 338 .
- Authentication module 426 selects the single image data of image data 330 with the similarity score indicative of a closest match to the authentication image data.
- authentication module 426 determines the respective similarity scores for image data 330 based on an angle between each vector representing a respective image data of image data 330 and the vector representing the authentication image data and determines that the score indicative of the closest match is the lowest similarity score (e.g., the smaller angle between two vectors the closer the vectors are to one another).
- authentication module 426 determines the respective similarity scores for image data 330 based on a cosine similarity between each vector representing a respective image data of image data 330 and the vector representing the authentication image data and determines that the score indicative of the closest match is the highest similarity score (e.g., the larger cosine value between two vectors the more the vectors are more similar).
- authentication module 426 selects two or more image data of image data 330 based on the respective similarity scores to determine whether unknown user 150 is the known user 338 .
- authentication module 426 determines a composite similarity score for two or more image data 330 .
- authentication module 426 may determine the composite similarity score based on the highest similarity scores for two or more image data 330 or the lowest similarity scores for two or more image data 330 .
- authentication module 426 may determine the composite similarity score based on the average of the respective similarity scores for two or more of image data 330 and may compare the composite similarity score to the threshold similarity score to determine whether unknown user 150 is the known user 338 .
- authentication module 426 may compare each respective similarity score for the two or more image data to the threshold similarity score. In such examples, authentication module 426 may determine that unknown user 150 is the known user 338 in response to determining that a threshold number (e.g., 100%, 80%, 60% etc.) of the selected image data have a similarity score that satisfies the threshold similarity score. For instance, authentication module 426 may determine that the set of selected image data include the three image data of image data 330 with the highest similarity scores, and may determine that unknown user 150 is the known user 338 in response to determining that the similarity score for two of the three selected image data satisfies the threshold similarity score.
- a threshold number e.g., 100%, 80%, 60% etc.
- storage devices 448 may also store one or more models 429 .
- computing device 310 may utilize deep learning models to extract facial landmarks that help in identifying an individual, as described above. These models may be tailored to execute on the dedicated neural engines or processors of the personal devices.
- an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) of a user's face.
- FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.).
- each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity.
- FRM 322 may apply a trained facial recognition model to the plurality of image data 330 and output an image template (e.g., embedding) for each respective image data 330 as a vector.
- element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity.
- enrollment module 424 may apply a trained facial recognition model to the plurality of image data 330 and output an image template (e.g., a vector) for each respective image data 330 .
- an image template may be represented as a vector and may be generated by a trained facial recognition model.
- the vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- the biometric feature information or identifier associated with a known user may include the various enrolled image data and/or features for that user that are stored in enrolled information 428 , which may include image data and/or feature information for the various different poses for image data 330 .
- This biometric information stored in enrolled information 428 may, in some cases, also include a unique identifier of the user, and/or information for a user profile or account that is specific to the user.
- the machine models may be stored locally (e.g., in models 429 of computing device 410 ), or may be stored remotely from device 410 (e.g., on one or more remote servers).
- computing device 310 may send authentication data to THU system 100 of the vehicle. For example, computing device 310 may send data indicative of a successful authentication of unknown user 150 as the known user 338 , thereby indicating authentication of user 150 . In some cases, computing device 310 may send user profile and/or account information that is customized or personalized for user 338 (e.g., authenticated user 150 ) to system 100 . In some cases, computing device 310 may also send a unique identifier of user 338 and/or of computing device 310 to system 100 of the vehicle.
- FIG. 5 is a diagram illustrating an example driver onboarding and authentication process, in accordance with one or more aspects of the present disclosure
- FIG. 6 is a diagram illustrating an example facial enrollment process, in accordance with one or more aspects of the present disclosure.
- Enrolling or onboarding a user 150 may be done on the user's personal device (e.g., mobile computing device 170 of FIG. 1 , computing device 310 of FIG. 3 ), such as at the time of account creation on IHU system 100 .
- This process involves running an enrollment application on the personal device that captures the biometric features of the user's face, using the front facing camera on the device, such as described above, e.g., in reference to FIG. 3 .
- the personal device is registered/paired as a trusted device with the IHU (e.g., IHU system 100 of FIG. 1 ).
- the camera of the personal device e.g., one or more of image capture devices 314 of computing device 310
- NIR near infrared
- RGB red-green-blue
- the personal device is recognized as a trusted device with respect to the IHU, and is assigned a unique identifier by the IHU.
- This trusted device, and its corresponding unique identifier are associated by the IHU with the user (e.g., user 150 ) of the vehicle, and any profile and/or account information for that user.
- the enrollment process may include the following aspects.
- Various personal computing devices or systems such as mobile computing device 170 and/or computing device 310 , may be equipped with front-facing image capture devices 173 or 314 , such as cameras. These cameras may use or provide NIR and/or RGB frames 551 for purposes of face identification, where these frames 551 may be one example of images 333 shown in FIG. 3 .
- the NIR spectrum may, in some cases, provides better adaptability to different lighting conditions.
- the personal device may utilize deep learning models to extract facial landmarks that help in identifying an individual, as described above. These models may be tailored to execute on the dedicated neural engines or processors of the personal devices.
- an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) 553 of a user's face, where features 553 may be one example of feature data 330 .
- computing device 310 may include model information within models 329 that is based on trained image and/or feature data for images taken with an NIR camera and/or an RGB camera.
- computing device 310 may distill the information (e.g., feature data) in models 329 from one color space (RGB space) to another color space (NIR space). As a result, the cameras used within the vehicle of FIG.
- the vehicle may include NIR cameras, but image capture devices 314 may include RGB cameras.
- the cameras used within the vehicle of FIG. 1 may be of the same type as image capture devices 314 included in computing device 310 .
- FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.).
- each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity.
- FRM 322 may apply a trained facial recognition model to frames 551 and output an image template (e.g., embedding) for respective facial features in features 553 as a vector.
- element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity.
- enrollment module 424 may apply a trained facial recognition model to frames 551 and output an image template (e.g., a vector) for respective facial features in features 553 .
- an image template may be represented as a vector and may be generated by a trained facial recognition model.
- the vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.).
- the biometric feature information or identifier 555 associated with a known user may include the various enrolled features for that user that are stored on the trusted device (e.g., in enrolled information 328 ), and the stored information may include feature information for the various different poses.
- This biometric information stored in enrolled information 328 may also include a unique identifier of the user, and/or information for a user profile or account that is specific to the user.
- the machine models may be stored locally (e.g., on mobile computing device 170 , computing device 310 , computing device 410 , THU system), or may be stored remotely from these systems (e.g., on one or more remote servers).
- the trusted mobile computing device may send the enrolled feature data to a vehicle computing system (e.g., vehicle computing system 100 ), which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data of the unknown user to the received enrolled feature data of a known user.
- vehicle computing system 100 receives, from the mobile computing device, the feature data (e.g., feature data or identifier 555 ) associated with the at least one image of the face of the previously enrolled user.
- Vehicle computing system 100 may then be configured to this received feature data with the feature data (e.g., features 559 ) associated with the captured image of the face of the user inside the vehicle.
- Vehicle computing system 100 may then determine, based on the comparing, the match between the user of the vehicle and the previously enrolled user, where the authentication data processed by vehicle computing system 100 comprises an indication of the match.
- the computing device may create a gallery of facial image data (e.g., feature data) against different face poses of the same user (e.g., known user 338 ).
- the face pose estimation is comprised of pan and tilt values of the face.
- the poses for which facial features are registered can appear as a pattern on the personal device. These patterns guide the user to move their head capturing different poses of the face, such as shown in FIG. 6 .
- FIG. 6 illustrates a screen display of how this can be accomplished inside an application on the personal device.
- a known user 338 may utilize one or more of image capture devices 314 to view one or more images of user 338 that are output at PSD 312 .
- a representation of these images may be displayed at PSD 312 within a graphical frame 600 that is visible by user 338 .
- Computing device 310 may output instructions to user 338 via PSD 312 , such as shown in FIG. 6 .
- computing device 310 may output instructions to user 338 to move his/her face relative to image capture device 314 in order to position the user's 338 face within frame 600 , and then to gently rotate user's 338 head until all dots shown in frame 600 turn a certain color (e.g., green), or switch from unfilled dots to fully filled dots. As shown in FIG. 6 , nine dots included in frame 600 are currently unfilled, indicating that user 338 should rotate his/her head until all of the dots turn green or switch from an unfilled to a filled state or color (e.g., black). Once user 338 has done so, image capture device 314 will capture and store images for that given pose in enrolled information (e.g., data store) 328 .
- enrolled information e.g., data store
- Computing device 310 may instruct user 338 to strike various different poses when capturing feature data 330 for various different poses, based on different pitch and yaw angles, such as previously described in reference to FIG. 3 .
- Computing device 310 may iterate the above process in reference to FIG. 6 for each of these different poses of user 338 .
- the facial feature data of corresponding features/landmarks, for the various different facial poses, may then be stored in an enrolled galley in enrolled information 328 .
- enrolled information 328 may comprise a secure enclave of the personal device 310 .
- Computing device 310 which is one example of mobile computing device 170 , may be registered with IHU system 100 of the vehicle shown in FIG. 1 , and, upon registration, may be a trusted companion device for the vehicle.
- IHU system 100 may assign a unique identifier to computing device 310 and/or known user 338 , which may be associated with the user's account on IHU system 100 .
- user 150 of the vehicle may have a user's account on IHU system 100 , which includes a personal profile for user 150 , associated with information that may be shown on display 112 of IHU system 100 that is customized or personalized for user 150 , as described previously in reference to FIGS. 1-2 .
- the unique identifier assigned by IHU system 100 to computing device 310 and/or known user 338 may be associated with the user account or profile of user 150 , when user 150 is authenticated as known user 338 of device 310 .
- user 150 may also have a wearable device 107 (e.g., smartwatch).
- mobile computing device 170 may, in some examples, communicate with wearable device 107 to provide the information included in enrolled information 328 for storage on wearable device 107 .
- wearable device 107 may be configured to store the information included in enrolled information 328 .
- wearable device 107 may perform the authentication functionality previously described in reference to FIG. 3 .
- wearable device 107 may be one example of computing device 310 shown in FIG. 3 .
- user 150 of the vehicle may not necessarily have a separate mobile computing device 170 in the vehicle or in proximity to IHU system 100 .
- IHU system 100 may communicate directly with wearable device 107 for authenticating user 150 .
- THU system 100 may communicate with wearable device 107 and/or mobile computing device 170 using a wireless communication protocol (e.g., BLUETOOTH, WIFI, BLE).
- a wireless communication protocol e.g., BLUETOOTH, WIFI, BLE.
- the image frames 557 from the in-car driver facing camera 104 / 111 are used to compute biometric features 559 of the face of user 150 , who may be seated in the driver seat.
- These features 559 are then sent ( 563 ) from THU system 100 to computing device 310 (e.g., mobile computing device 170 , wearable device 107 ), which may occur over a secure channel for, e.g., non-spoofed features, such as described further below.
- Computing device 310 may be a trusted, previously registered device with respect to system 100
- the software on computing device 310 determines if there is a match in the facial features of the current driver 150 and the enrolled, known user 338 . If there is a match, computing device 310 sends IHU system 100 data indicative of the match, and IHU system 100 receives ( 565 ) this data indicative of a match as representing authentication of the registered user, and proceeds to log in the registered user (e.g., user 150 of the vehicle).
- models similar to the enrollment models may be used by IHU system 100 .
- computing device 310 may utilize one or more models 329 (e.g., enrollment models) to identify feature vector data for images of known user 338 taken by image capture devices 314 .
- IHU system 100 may utilize similar models for identifying feature vector data for images of user 150 taken by camera 104 and/or 111 in the vehicle.
- IHU system 100 may, for example, utilize these models to determine the facial features of user 150 based on, e.g., NIR images acquired by camera 104 , 111 .
- These features may, in some cases, be encrypted and sent by IHU system 100 to the trusted device (e.g., computing device 310 , mobile computing device 170 ) for authentication user 150 .
- the trusted device e.g., computing device 310 , mobile computing device 170
- IHU system 100 may establish a secure communication channel with the previously registered trusted device. This channel can be established over, e.g., BLUETOOTH/BLE, or other wireless interfaces.
- the trusted device may, upon receiving the incoming feature data (e.g., feature vector data) from IHU system 100 , may compare such data against the gallery of saved feature vectors, and a match is determined, such as described previously (e.g., in reference to FIG. 3 ). If there is a match, an indication of this match, and/or authentication data for the user (e.g., registered token) is sent to the IHU system 100 over the secure channel, which may be used or otherwise serve as the trigger to log in user 150 into the IHU system 100 .
- feature data e.g., feature vector data
- the trusted device may, upon receiving the incoming feature data (e.g., feature vector data) from IHU system 100 , may compare such data against the gallery of saved feature vectors, and a match is determined, such as described previously (e.g., in reference to FIG. 3 ). If there is a match, an indication of this match, and/or authentication data for the user (e.g., registered token) is sent to the IHU system 100 over
- an intruder with a stolen personal device can attempt to unlock the IHU system 100 using, e.g., a three-dimensional mask of the owner (e.g., user 150 ), or other images (e.g., paper images) of the owner's face.
- the deep learning models used by IHU system and/or computing device 310 may be capable of spotting or otherwise detecting such a spoofing attempt, as part of spoof detection 561 , and abort the authentication procedure. For example, these models may be trained with various different objects and types of objects (e.g., human objects, paper objects, plastic objects).
- the models may include various different feature data (e.g., feature vectors) that include features associated with images of these different types of objects, and the models may be used to identify features of, e.g., a three-dimensional mask or paper image of an owner, in an effort to abort any authentication procedures associated with such spoofing attempts.
- feature vectors e.g., feature vectors
- IHU system 100 may utilize deep learning models and/or inputs from other signals to determine the presence of user 150 , and/or an appropriate or optimal time to capture a facial image of user 150 using cameras 104 / 111 .
- IHU system 100 may utilize a learning model, trained with various image information to store corresponding feature vector information, to determine the gaze of user 150 .
- IHU system 100 may determine to capture a facial image of user 150 when the gaze of user 150 is towards the camera, or when the pose of user 150 is optimal for capturing an increased number of facial feature information for use in authenticating user 150 .
- system 100 may utilize additional signals provided by the vehicle.
- system 100 may use these signals (independently or in conjunction with gaze information gathered from the camera) to determine an appropriate or optimal time to capture a facial image of user 150 using cameras 104 / 111 .
- the cameras used by IHU system 100 may comprise NIR cameras, which may provide improved adaptability to ambient lighting and worn items like sunglasses.
- these systems and devices may use any number of other different types of cameras (e.g., RGB cameras, NIR cameras, etc.).
- the techniques of the present disclosure address the challenge of reliable authentication of a user (e.g., user 150 ) using biometric features computed from frames captured by different cameras (e.g., cameras of THU system 100 and cameras of computing device 310 ), where such comparison and authentication may be invariant or adaptable to various different lighting conditions.
- the cameras of IHU system 100 and computing device 310 may be of the same type or different types, which provides added flexibility.
- the models used in system 100 and/or computing device 310 may be trained using an approach that can generate matchable features for the same individual's face captured using different cameras.
- the disclosed techniques also reliably handle faces at differing distances/poses from the in-car camera of IHU system 100 , due to the robust processes and enrollment feature data stored for different user poses in computing device 310 .
- IHU system 100 and/or computing device 310 to compute biometric features allow the above workflow to work with different personal devices (e.g., personal devices running various different operating systems). Additionally, since the facial features of any user stays on the user's personal device (e.g., trusted device 310 ), there user's private biometric identity and information resides only on the user's personal device, and not on THU system, which may be included in a shared vehicle potentially used by various different individuals.
- the disclosed techniques also enable use cases where a user can log in to a rental/shared/leased car and have the same personalized experience without worrying about having to wipe out any personal information after the user is done using the vehicle.
- no personal information is stored in the IHU system 100 of such a vehicle, but is instead only stored on the user's personal device (e.g., device 310 ).
- another use of the disclosed techniques could be for ride-sharing applications to verify the identity of a registered driver (e.g., user 150 ), and prevent unregistered individuals, who may have stolen or are otherwise carrying the personal device of the registered driver, from being able to accept any ride requests using IHU system 100 .
- the disclosed techniques may confirm the identity or otherwise authenticate user 150 of the vehicle before allowing user 150 to operate the vehicle, in order to prevent unauthorized individuals from operating the vehicle.
- FIG. 7 is a flowchart illustrating example process performed by an example computing system, in accordance with one or more aspects of the present disclosure.
- the process illustrated in FIG. 7 may be performed by mobile computing device 170 ( FIG. 1 ), wearable device 107 ( FIG. 1 ), computing device 310 ( FIG. 3 ), and/or computing system 410 ( FIG. 4 ).
- the process of FIG. 7 will be described in reference to operations performed by computing device 310 .
- Computing device 310 may establish ( 702 ) a connection with a vehicle computing system (e.g., IHU system 100 ) of a vehicle.
- computing device 310 may utilize one or more communication units (e.g., communication units 446 ) to establish this connection.
- computing device 310 may receive ( 704 ), from the vehicle computing system, first feature data associated with at least one image of a face of a user (e.g., user 150 ) of the vehicle.
- the at least one image of the face of the user of the vehicle may captured by an image capture device (e.g., camera 104 , 111 ) connected to at least a portion of the vehicle.
- an image capture device e.g., camera 104 , 111
- Computing device 310 may then determine ( 706 ), based on a comparison between the first feature data and second feature data (e.g., feature data associated with images 330 stored in enrolled information 328 ) associated with at least one image of a face of a previously enrolled user (e.g., user 338 ), a match between the user of the vehicle and the previously enrolled user.
- Computing device 310 may authenticate ( 708 ), based on the match, the user of the vehicle, and send ( 710 ), to the vehicle computing system, authentication data for the user of the vehicle, where the authentication data is indicative of the match.
- FIG. 8 is a flowchart illustrating example process performed by an example vehicle computing system, in accordance with one or more aspects of the present disclosure.
- the process illustrated in FIG. 8 may be performed by vehicle computing system 100 of FIG. 1 and/or computing system 410 ( FIG. 4 ).
- the process of FIG. 8 will be described in reference to operations performed by vehicle computing system 100 .
- Vehicle computing system 100 may establish ( 802 ) a connection with a mobile computing device (e.g., mobile computing device 170 of FIG. 1 , wearable device 107 of FIG. 1 , computing device 310 of FIG. 3 ), where the vehicle computing system includes an infotainment head unit.
- Vehicle computing system 100 may determine ( 804 ) a presence of a user (e.g., user 150 ) inside a vehicle. After determining the presence of the user inside the vehicle, vehicle computing system 100 captures ( 806 ), using an image capture device (e.g., camera 104 and/or 111 ) that is connected to at least a portion of the vehicle, at least one image of a face of the user.
- an image capture device e.g., camera 104 and/or 111
- Vehicle computing system 100 determines ( 808 ) first feature data associated with the at least one image of the face of the user, and receives ( 810 ) authentication data for the user.
- the authentication data indicates a match between the user and a previously enrolled user.
- Vehicle computing system 100 then accesses ( 812 ), based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- Example 1 A method comprising: establishing, by a mobile computing device, a connection with a vehicle computing system of a vehicle; after establishing the connection, receiving, by the mobile computing device and from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle; determining, by the mobile computing device and based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user; authenticating, by the mobile computing device and based on the match, the user of the vehicle; and sending, by mobile computing device and to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- Example 2 The method of Example 1, wherein the vehicle computing system includes an infotainment head unit, and wherein sending, by the mobile computing device and to the vehicle computing system, the authentication data for the user of the vehicle causes the vehicle computing system to access user account information to log the user of the vehicle into the infotainment head unit.
- Example 3 The method of any of Examples 1-2, wherein the authentication data for the user of the vehicle includes unique identification information associated with at least one of the mobile computing device or the user of the vehicle.
- Example 4 The method of any of Examples 1-3, wherein establishing the connection comprises securely pairing the mobile computing device with the vehicle computing system of the vehicle, wherein the mobile computing device comprises a trusted device that is associated with a user account of the user of the vehicle on the vehicle computing system.
- Example 5 The method of any of Examples 1-4, wherein receiving the feature data associated with the at least one image of the face of the user of the vehicle comprises, by the mobile computing device and from the vehicle computing system, an encrypted copy of the feature data associated with the at least one image of the face of the user of the vehicle.
- Example 6 The method of any of Examples 1-5, wherein: the second feature data associated with the at least one image of the face of the previously enrolled user is associated with at least one feature vector and is included in at least one pose bucket from a plurality of pose buckets; and each pose bucket from the plurality of pose buckets is associated with a respective range of pitch angles of the face of the previously enrolled user and a respective range of yaw angles of the face of the previously enrolled user.
- Example 7 The method of Example 6, further comprising: selecting, by the mobile computing device, from the second feature data, feature data included in a particular pose bucket of the plurality of pose buckets, wherein the particular pose bucket is associated with the first feature data associated with the at least one image of the face of the user of the vehicle, and wherein determining the match between the user of the vehicle and the previously enrolled user is based on the first feature data and the selected feature data associated with the at least one image of the face of the previously enrolled user.
- Example 8 The method of Example 7, further comprising: determining, by the mobile computing device, a similarity score indicating a similarity between the first feature data associated with the at least one image of the face of the user of the vehicle and the selected feature data associated with the at least one image of the face of the previously enrolled user, wherein determining the match between the user of the vehicle and the previously enrolled user occurs responsive to determining that the similarity score satisfies a threshold similarity score.
- Example 9 The method of any of Examples 1-8, wherein the second feature data associated with the at least one image of the face of the previously enrolled user comprises a plurality feature data associated with the at least one image of the face of the previously enrolled user, the method further comprising: determining, by the mobile computing device, based on the first feature data associated with the at least one image of the face of the user of the vehicle and each feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, a respective similarity score for each feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, wherein each similarity score indicates a similarity between the first data associated with the at least one image of the face of the user of the vehicle and the respective feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, wherein determining the match between the user of the vehicle and the previously enrolled user is based on the respective similarity score for the plurality of feature data associated with the
- Example 10 The method of Example 9, further comprising: determining, by the mobile computing device, based on the respective similarity scores, a highest ranked feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, wherein determining the match between the user of the vehicle and the previously enrolled user is based on the highest ranked feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user.
- Example 11 The method of Example 9, further comprising: determining, by the mobile computing device, based on the similarity scores for two or more feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, a composite similarity score, wherein determining the match between the user of the vehicle and the previously enrolled user is responsive to determining that the composite similarity score satisfies a threshold similarity score.
- Example 12 The method of Example 9, wherein determining the match between the user of the vehicle and the previously enrolled user is responsive to determining that the respective similarity score for each of two or more feature data associated with the at least one image of the face of the previously enrolled user satisfies a threshold similarity score.
- Example 13 The method of any of Examples 1-12, further comprising: determining, by the mobile computing device and during an enrollment phase, the second feature data of the at least one image of the face of the previously enrolled user using a machine learning model.
- Example 14 The method of any of Examples 1-13, wherein the mobile computing device comprises a wearable computing device.
- Example 15 A method comprising: establishing, by a vehicle computing system of a vehicle, a connection with a mobile computing device, wherein the vehicle computing system includes an infotainment head unit; determining, by the vehicle computing system, a presence of a user inside the vehicle; after determining the presence of the user inside the vehicle, capturing, by the vehicle computing system using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user; determining, by the vehicle computing system, first feature data associated with the at least one image of the face of the user; receiving, by the vehicle computing system, authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user; and accessing, by the vehicle computing system and based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- Example 16 The method of Example 15, further comprising: sending, by the vehicle computing system and to the mobile computing device, the first feature data associated with the at least one image of the face of the user, wherein receiving the authentication data for the user comprises, after sending the first feature data, receiving, by the vehicle computing system and from the mobile computing device, the authentication data for the user.
- Example 17 The method of Example 15, further comprising: receiving, by the vehicle computing system and from the mobile computing device, the second feature data associated with the at least one image of the face of the previously enrolled user; comparing, by the vehicle computing system, the first feature data and the second feature data; and determining, by vehicle computing system based on the comparing, the match between the user and the previously enrolled user, wherein the authentication data comprises an indication of the match.
- Example 18 A system comprising: at least one processor; and at least one computer-readable storage device storing instructions that, when executed by the at least one processor, cause the at least one processor to perform the method of any of Examples 1-17.
- Example 19 A computer-readable storage medium storing instructions that, when executed by at least one processor, cause the at least one processor to perform the method of any of Examples 1-17.
- Computer-readable medium may include computer-readable storage media or mediums, which corresponds to a tangible medium such as data storage media, or communication media including any medium that facilitates transfer of a computer program from one place to another, e.g., according to a communication protocol.
- Computer-readable medium generally may correspond to (1) tangible computer-readable storage media, which is non-transitory or (2) a communication medium such as a signal or carrier wave.
- Data storage media may be any available media that can be accessed by one or more computers or one or more processors to retrieve instructions, code and/or data structures for implementation of the techniques described in this disclosure.
- a computer program product may include a computer-readable medium.
- such computer-readable storage media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage, or other magnetic storage devices, flash memory, or any other storage medium that can be used to store desired program code in the form of instructions or data structures and that can be accessed by a computer.
- any connection is properly termed a computer-readable medium.
- a computer-readable medium For example, if instructions are transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium.
- DSL digital subscriber line
- Disk and disc includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and Blu-ray disc, where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable medium.
- processors such as one or more digital signal processors (DSPs), general purpose microprocessors, application specific integrated circuits (ASICs), field programmable logic arrays (FPGAs), or other equivalent integrated or discrete logic circuitry.
- DSPs digital signal processors
- ASICs application specific integrated circuits
- FPGAs field programmable logic arrays
- processors may refer to any of the foregoing structure or any other structure suitable for implementation of the techniques described herein.
- the functionality described herein may be provided within dedicated hardware and/or software modules. Also, the techniques could be fully implemented in one or more circuits or logic elements.
- the techniques of this disclosure may be implemented in a wide variety of devices or apparatuses, including a wireless handset, an integrated circuit (IC) or a set of ICs (e.g., a chip set).
- IC integrated circuit
- a set of ICs e.g., a chip set.
- Various components, modules, or units are described in this disclosure to emphasize functional aspects of devices configured to perform the disclosed techniques, but do not necessarily require realization by different hardware units. Rather, as described above, various units may be combined in a hardware unit or provided by a collection of interoperative hardware units, including one or more processors as described above, in conjunction with suitable software and/or firmware.
Landscapes
- Engineering & Computer Science (AREA)
- Mechanical Engineering (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- General Physics & Mathematics (AREA)
- Computer Security & Cryptography (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Oral & Maxillofacial Surgery (AREA)
- General Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Mathematical Physics (AREA)
- Automation & Control Theory (AREA)
- Transportation (AREA)
- Computing Systems (AREA)
- Biomedical Technology (AREA)
- Software Systems (AREA)
- Collating Specific Patterns (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
- Vehicles, such as automobiles, motorcycles, aircraft, and watercraft, may include one or more computing systems for performing functions and providing occupants of the vehicles with information, entertainment, assistance, and/or environmental control. For instance, an automobile may include an entertainment system for playing music, videos, or other content, a navigation system for providing information and navigational assistance, a temperature control system for heating or cooling the in-vehicle cabin, a control system for adjusting various components or features of the car, such as a sun roof or window shades, or an “infotainment system” that performs some or all of these aforesaid functions. Modern vehicles are equipped with an infotainment head unit (IHU) having a display device (e.g., presence-sensitive display) and compute engine, which is configured to execute an operating system and one or more applications.
- In general, the present application describes techniques for performing a seamless authentication of a driver of a vehicle using an in-vehicle camera and a trusted mobile computing device that is communicatively coupled to the vehicle. The trusted mobile computing device may initially perform an enrollment process to capture data (e.g., facial feature data associated with a captured image) of a face of a known user in a variety of different poses. The trusted device may assign each group of data of the known user to a respective pose bucket of a group of pose buckets. The trusted device enrolls the data by associating data of the known user with a user account for the known user. The trusted device may subsequently receive authentication data (e.g., computed test data, such as computed facial feature information) of an unknown user who is situated inside of the vehicle, where an authentication image is captured using the in-vehicle camera of the vehicle, and where computed data (e.g., feature data) is obtained from the image and used by the trusted device. The trusted device may authenticate the unknown user by comparing the authentication feature data for the unknown user to enrolled feature data of the known user. Based on the comparison, the trusted device sends a result of the authentication process to the vehicle, which may then perform login operations at the infotainment head unit of the vehicle that are customized for the authenticated user (e.g., based on that user's profile and/or account information). As a result, various of the described techniques enable a seamless and reliable authentication of a driver of a vehicle, while also storing enrollment feature data of authorized users only on the trusted device to protect the security of this user data. In some cases, the trusted mobile computing device may send the enrolled feature data to the vehicle computing system, which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data for the unknown user to the received enrolled feature data of the known user.
- In one example, a method includes establishing, by a mobile computing device, a connection with a vehicle computing system of a vehicle, and, after establishing the connection, receiving, by the mobile computing device and from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle. The example method further includes determining, by the mobile computing device and based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user, authenticating, by the mobile computing device and based on the match, the user of the vehicle and sending, by mobile computing device and to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- In another example, a computer-readable storage medium stores instructions that, when executed, cause at least one processor to: establish a connection with a vehicle computing system of a vehicle; after establishing the connection, receive, from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle; determine, based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user; authenticate, based on the match, the user of the vehicle; and send, to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- In another example, a mobile computing device includes at least one processor and at least one computer-readable storage device. The at least one computer-readable storage device store instructions that, when executed by the at least one processor, cause the at least one processor to: establish a connection with a vehicle computing system of a vehicle; after establishing the connection, receive, from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle; determine, based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user; authenticate, based on the match, the user of the vehicle; and send, to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- In another example, a method includes establishing, by a vehicle computing system of a vehicle, a connection with a mobile computing device, wherein the vehicle computing system includes an infotainment head unit, determining, by the vehicle computing system, a presence of a user inside the vehicle, and, after determining the presence of the user inside the vehicle, capturing, by the vehicle computing system using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user. The example method further includes determining, by the vehicle computing system, first feature data associated with the at least one image of the face of the user, receiving, by the vehicle computing system, authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user, and accessing, by the vehicle computing system and based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- In another example, a computer-readable storage medium stores instructions that, when executed, cause at least one processor to: establish a connection with a mobile computing device; determine a presence of a user inside a vehicle; after determining the presence of the user inside the vehicle, capture, using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user; determine first feature data associated with the at least one image of the face of the user; receive authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user; and access, based on the authentication data for the user, user account information to log the user into a infotainment head unit.
- In another example, a vehicle computing system includes at least one processor and at least one computer-readable storage device. The at least one computer-readable storage device store instructions that, when executed by the at least one processor, cause the at least one processor to: establish a connection with a mobile computing device, wherein the vehicle computing system includes an infotainment head unit; determine a presence of a user inside a vehicle; after determining the presence of the user inside the vehicle, capture, using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user; determine first feature data associated with the at least one image of the face of the user; receive authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user; and access, based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- The details of one or more examples are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the disclosure will be apparent from the description and drawings, and from the claims.
-
FIG. 1 is a conceptual diagram illustrating a side view of an interior of a vehicle that includes an example vehicle computing system that is configured to communicate with a mobile computing device for authenticating a user of a vehicle, in accordance with one or more aspects of the present disclosure. -
FIG. 2 is a conceptual diagram illustrating further details of an interior of a vehicle, in accordance with one or more aspects of the present disclosure. -
FIG. 3 is a conceptual diagram illustrating an example computing device that performs facial enrollment operations, in accordance with one or more aspects of the present disclosure. -
FIG. 4 is a block diagram illustrating an example computing system, in accordance with one or more aspects of the present disclosure. -
FIG. 5 is a diagram illustrating an example driver onboarding and authentication process, in accordance with one or more aspects of the present disclosure. -
FIG. 6 is a diagram illustrating an example facial enrollment process, in accordance with one or more aspects of the present disclosure. -
FIG. 7 is a flowchart illustrating example operations performed by an example computing system, in accordance with one or more aspects of the present disclosure. -
FIG. 8 is a flowchart illustrating example operations performed by an example vehicle computing system, in accordance with one or more aspects of the present disclosure. - As noted above, modern vehicles are equipped with an infotainment head unit (IHU) having a display device (e.g., presence-sensitive display) and a compute engine, which is configured to execute an operating system and one or more applications. The IHU enables a user to have a rich, personalized experience while driving, and the applications provided by the IHU may enable users to listen to preferred music, browse emails, or pick favored and/or frequent destinations, to name only a few examples. The user's personalized experience may be achieved by having user-specific profiles stored in per-user accounts on the IHU.
- In some cases, a key fob or an external device (e.g., mobile phone), which is paired with the vehicle at the time an account was initially created, may identify a user and log them in to the IHU. However, the mere presence of key fob or external device does not necessarily achieve the authentication of a particular, known or previously identified user of the vehicle. In certain cases, the IHU may further prompt the user of a known, trusted device to enter a personal passcode or password on the user's device to enable explicit permission for login into the IHU. This approach, however, may potentially negate the user's expectation of a seamless, “get-in-and drive” experience.
- Modern automobiles support different levels of autonomous driving. Some automobiles use in-car cameras to monitor the driver's attentiveness, or to determine safe handoff to and from the autonomous mode. One approach is to leverage in-car cameras to provide face-based identification, similar to the face unlock feature on personal mobile devices. However, this face-based authentication process will typically involve an enrollment step, which may involve a user moving the user's face in a directed pattern in order for the camera to capture facial/biometric features of the user in different poses. Each pose may be associated with a distinct group of facial or other biometric features that are associated with that pose. Performing enrollment directly in a vehicle, using its own camera, may have certain challenges, as the user may not receive any visual or other directions from the vehicle for moving the user's head while at the same looking at the driver-facing camera.
- The present application describes techniques for performing a seamless authentication of a driver of a vehicle using an in-vehicle camera and a trusted mobile computing device that is communicatively coupled to the vehicle. The techniques may provide a reliable authentication mechanism and a seamless user experience, while also ensuring user privacy of facial image information. The trusted mobile computing device may initially perform an enrollment process to capture image data (e.g., computed data such as facial feature data) of a face of a known user in a variety of different poses. As used herein, the term “image data” refers to data (e.g., feature data) that is computed or otherwise determined from a captured image of, e.g., a face of a user. The trusted device may assign each group of image data of the known user to a respective pose bucket of a group of pose buckets. The trusted device enrolls the image data by associating the image data of the known user with a user account for the known user. The trusted device may subsequently receive authentication image data (e.g., test image data, such as computed facial feature data associated with an image) of an unknown user who is situated inside of the vehicle, where the authentication image is captured using the in-vehicle camera of the vehicle, and where computed image data (e.g., feature data) is obtained from the image and used by the trusted device. The trusted device may authenticate the unknown user by comparing the authentication feature data for the unknown user to enrolled feature data of the known user. Based on the comparison, the trusted device sends a result of the authentication process to the vehicle, which may then perform login operations at the IHU that are customized for the authenticated user (e.g., based on that user's profile and/or account information). In some cases, the trusted mobile computing device may send the enrolled feature data to the vehicle computing system, which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data for the unknown user to the received enrolled feature data of the known user.
- In some examples, the computing device may determine a pose bucket associated with authentication feature data of the unknown user's face, select feature data of the known user that is included in the same pose bucket as the pose bucket associated with the authentication feature data, and compare the selected feature data to the authentication feature data to determine whether the unknown user is the known user. As another example, the computing device may compare the authentication feature data to each of the groups of enrolled feature data to determine which of the enrolled feature data of the known user are most similar to the authentication feature data for the unknown user. The computing device may determine whether the unknown user is the known user based on the most similar enrolled feature data of the known user, regardless of the pose buckets.
- By enrolling feature data included in several different pose buckets, the trusted device may more accurately perform facial recognition to authenticate an unknown user, such as the current user of a vehicle. For instance, enrolling feature data of a known user that are included in several pose buckets may increase the probability that the pose bucket associated with the authentication feature data (e.g., the pose of the unknown user's face) is similar to the pose buckets that include the enrolled feature data of the known user (e.g., the pose of the known user in the enrolled feature data). Increasing the probability that the pose of the authentication feature data of the unknown user is similar to the pose of one or more enrolled feature data of a known user may reduce the probability of falsely rejecting the unknown user when the unknown user is in fact a known, authorized user. In some instances, increasing the probability that the pose of the authentication feature data of the unknown user is similar to the pose of one or more enrolled feature data of a known user may reduce the probability of falsely accepting the unknown user when the unknown user is not a known, authorized user. In this way, the computing device may more accurately authenticate feature data of unknown users regardless of the pose of the unknown user.
-
FIG. 1 is a conceptual diagram illustrating a side view of an interior of a vehicle (e.g., automobile), which includes an examplevehicle computing system 100.FIG. 1 shows a cross-sectional view of a vehicle interior in addition to components ofvehicle computing system 100.Vehicle computing system 100 is configured to detect and process user input. - The vehicle illustrated in
FIG. 1 may be an automobile, but aspects of the present disclosure may also be applicable to other types of vehicles, including trucks, motorcycles, aircraft, watercraft, trains, or other vehicles. InFIG. 1 , adriver 150 may normally occupyseat 152.Seat 152 of the automobile may be positioned directly behindsteering wheel 154 of the vehicle such that an occupant ofseat 152 may physically controlsteering wheel 154. Theseat 152 is positioned within the vehicle illustrated inFIG. 1 underroof 158.Steering wheel 154 may protrude fromdashboard 156. At least one front passenger seat may be laterally positioned adjacent toseat 152. Other passenger seats may be positioned behindseat 152 or in front ofseat 152. - Also shown in
FIG. 1 is a collection of devices, components, and modules that may each be included invehicle computing system 100.Vehicle computing system 100 may include, but is not limited to, presence-sensitive panel 102 andcamera 104, as well asdisplay 112 andcontrol unit 106. One or more components ofvehicle computing system 100, such as presence-sensitive panel 102 andcamera 104, may be directly and physically accessible to occupants seated in the front driver and front passenger seats of the automobile, and may be located within, near, or oncenter console 101. Such components may be within easy reach of such occupants, and may also or alternatively be positioned in another passenger area of the vehicle, such as a back seat. In some examples, a component may be within easy reach if a vehicle occupant does not need to change positions in his or her seat in order to reach the component with an outstretched arm. Stated another way, for many drivers, for example, the usual positions of the steering wheel, stick shift, and center console may be considered within easy reach of the driver. As further described below, presence-sensitive panel 102 andcamera 104 may function as input devices forvehicle computing system 100. In some examples, one or more components ofvehicle computing system 100 that might not necessarily require physical access by occupants of the vehicle (such as, in some examples,display 112 and control unit 106), may be positioned in or on or integrated intodashboard 156. Such components may be integrated as part of an automobile dashboard and/or console facing or near the occupants of the vehicle. As further described in this disclosure,vehicle computing system 100 may includedisplay 112 that may output a graphical user interface. In some cases, additional cameras may be provided within the vehicle. For example, as shown inFIG. 1 ,steering wheel 154 may include a user-facingcamera 111. In some cases, additional user-facing cameras may be positioned on other elements or components of the vehicle, such as ondashboard 156,roof 158,console 101 orpanel 102, and/ordisplay 112. In some cases, additional cameras may be positioned on a rear-view mirror or the windshield of the vehicle. In general, the in-car cameras (e.g., 104/111) may be mounted or otherwise connected to one or more portions or components of the vehicle. - Seated on
seat 152 isuser 150.User 150 may be a driver, butuser 150 could also be a passenger or other vehicle occupant. Although inFIG. 1 user 150 is shown in a position that may often be considered a front seat (characterized, e.g., bysteering wheel 154 and dashboard 156),user 150 may be seated in another location within the vehicle, including a back seat. - In the example of
FIG. 1 ,user 150 may navigate or operate the vehicle, may interact with one or more components of the vehicle, and/or may provide input at input devices or presence-sensitive panel 102 orcamera 104. InFIG. 1 ,user 150 is shown interacting with presence-sensitive panel 102. - Presence-
sensitive panel 102 may detect one or more taps, gestures, and/or other user inputs at locations of presence-sensitive panel 102. Such taps, gestures, or other inputs may be from one or more fingers ofuser 150, or may be from a stylus or other device used byuser 150. Such input may be on the surface of presence-sensitive panel 102, or within a threshold distance of the surface of presence-sensitive panel 102. In the illustration ofFIG. 1 , the threshold distance may extend above presence-sensitive panel 102, towardsroof 158. - In response to detecting the one or more inputs at locations of presence-
sensitive panel 102, presence-sensitive panel 102 may output toUI module 108 an indication of input detected by presence-sensitive panel 102. In some examples,UI module 108 may determine, based on the indication of input, information about the input. Such information may, for example, indicate one or more lines, characters, or shapes corresponding to the input.UI module 108 may output to one ormore application modules 110 information about the input. In response to the information about the input, one ormore application modules 110 may determine an operation corresponding to the input and/or perform an operation. In some examples, and in response to the information about the input, one ormore application modules 110 may output to display 112 information about the input, the operation, or an operation to be performed. - As described and illustrated, some or all of
vehicle computing system 100 may be housed withindashboard 156, which may in some examples be constructed of plastic, vinyl, rubber, aluminum, steel, or any other suitable material.Control unit 106 may include at least one processor and/or at least one storage device, and may be housed withinhousing 105, which may also be constructed of plastic, vinyl, rubber, aluminum, steel, or any other suitable material. In some examples,housing 105 may also be a rigid case that encloses and otherwise protects one or more electrical components that provide functionality forvehicle computing system 100. In some examples,housing 105 may be affixed, mounted or otherwise integrated with the automobile dashboard or console. -
Control unit 106 may provide an operating environment or platform for one or one more modules, such as a combination of hardware, firmware, and software, as further illustrated inFIG. 4 . For instance,control unit 106 may include one or more processors and storage devices that may execute instructions and store data of one or more modules.Control unit 106 may also be operably coupled to one or more other software and/or hardware components, including presence-sensitive panel 102,camera 104, and display 112 to control, configure, and/or communicate information with the components, to name only a few example operations. -
Display 112 may function as an output device, such as a display device, using any one or more of a liquid crystal display (LCD), dot matrix display, light emitting diode (LED) display, organic light-emitting diode (OLED) display, e-ink, or similar monochrome or color display capable of outputting visible information to a user or vehicle occupant. In some examples,display 112 may also function as an input device, so that it serves as both an input and output device. In such examples,display 112 may include an integrated presence-sensitive input device and a display device. For instance,display 112 may function as a presence-sensitive input device using a presence-sensitive screen, such as a resistive touchscreen, a surface acoustic wave touchscreen, a capacitive touchscreen, a projective capacitance touchscreen, a pressure-sensitive screen, an acoustic pulse recognition touchscreen, or another presence-sensitive screen technology. Based on user input,display 112 may present output to a user. For instance,display 112 may present various user interfaces of applications (e.g., a navigation application) executing atvehicle computing system 100. An occupant of the vehicle, such as a driver, may provide user input to interact with one or more of such applications. -
Vehicle computing system 100 may operate to assist, inform, entertain, or perform other tasks that require user interactions with occupants of a vehicle.Vehicle computing system 100 may, in some examples, be referred to as an infotainment head unit (IHU), an infotainment system, or a subcomponent thereof. For example,vehicle computing system 100 may include one ormore application modules 110 that perform functions or process information, on behalf of one or more occupants of the vehicle. For instance,vehicle computing system 100 may provide a navigation service that provides directions to destinations.Vehicle computing system 100 may also provide an information retrieval service that provides information in response to queries and/or as preemptive assistance or recommendations.Vehicle computing system 100 may also provide vehicle data about the vehicle, or multimedia such as audio or video. Mentioned are only a few examples of the functionality that may be provided byvehicle computing system 100, andvehicle computing system 100 may provide many additional capabilities. In this and other ways,vehicle computing system 100 may improve the driving or riding experience for one or more occupants of the vehicle. - In some examples,
vehicle computing system 100 may be controlled through input detected by presence-sensitive panel 102, through input detected bycamera 104, and/or through input detected by a combination of presence-sensitive panel 102 andcamera 104.Vehicle computing system 100 may also be controlled through input detected by one or more additional input devices (e.g., microphones, physical buttons or switches, or other types of input devices). - Presence-
sensitive panel 102 may, in some examples, function simply as an input device for touch input, provided by user input that may occur directly and physically at presence-sensitive panel 102. For instance, presence-sensitive panel 102 may function as a multi-touch presence-sensitive input device using a presence-sensitive device, such as a resistive touchscreen or touch panel, a surface acoustic wave touchscreen or touch panel, a capacitive touchscreen or touch panel, a projective capacitance touchscreen or touch panel, a pressure-sensitive screen or touch panel, an acoustic pulse recognition touchscreen or touch panel, or another presence-sensitive screen or touch panel technology. In some examples, presence-sensitive panel 102 may detect an object at and/or near, or within range of the presence-sensitive component(s) associated with presence-sensitive panel 102. As one example range, presence-sensitive panel 102 may detect an object, such as a finger or stylus that is within 2 cm or less of presence-sensitive panel 102. Presence-sensitive panel 102 may determine a location (e.g., an (x,y) coordinate) of the presence-sensitive input device at which the object was detected. In another example range, presence-sensitive panel 102 may detect an object 6 inches or less from presence-sensitive panel 102; other ranges are also possible. Presence-sensitive panel 102 may detect a user's finger, stylus, or similar using capacitive, inductive, and/or optical recognition techniques. - In the example illustrated in
FIG. 1 , presence-sensitive panel 102 may be positioned incenter console 101 abovecamera 104, andcenter console 101 may be transparent tocamera 104 so thatcamera 104 may capture images directly above presence-sensitive panel 102 even though presence-sensitive panel 102 physically obscures the lens or field-of-view ofcamera 104. For example,camera 104 may be an infrared camera that captures images by receiving infrared light and presence-sensitive panel 102 may be transparent to infrared light such thatcamera 104 is able to receive the infrared light originating between theroof 158 and presence-sensitive panel 102. In other examples,camera 104 might not be positioned directly under presence-sensitive panel 102, andcamera 104 may be positioned elsewhere within the vehicle. For example, as shown inFIG. 1 ,steering wheel 154 may include a user-facingcamera 111. In some cases, additional user-facing cameras may be positioned on other elements or components of the vehicle, such as ondashboard 156,roof 158,console 101 orpanel 102, and/ordisplay 112. - In some examples, presence-
sensitive panel 102 may function as both an input device and as an output device. In such examples, presence-sensitive panel 102 may include an integrated presence-sensitive input device and a display device, and could be any one or more of a liquid crystal display (LCD), dot matrix display, light emitting diode (LED) display, organic light-emitting diode (OLED) display, e-ink, or similar monochrome or color display capable of outputting visible information to a user or vehicle occupant. In other examples where presence-sensitive panel 102 includes both input device and output device functionality, presence-sensitive panel 102 may be implemented by two separate components: a presence-sensitive input device for receiving input and a display device for providing output. In examples where presence-sensitive panel 102 includes both input device and output device functionality, presence-sensitive panel 102 may still be positioned incenter console 101 abovecamera 104, andcenter console 101 may still be transparent tocamera 104 so thatcamera 104 may capture images directly above presence-sensitive panel 102, even if positioned under presence-sensitive panel 102. -
Camera 104 and/orcamera 111 may be one or more of any appropriate type of image acquisition or capture device, such as a camera or charge-coupled device. In some examples,camera 104 may be one or more infrared cameras with a high field-of-view and shallow depth of focus, and may be a backlit infrared camera oriented to point generally upward within the vehicle, having a particular field-of-view. In other examples,camera 104 may be or may further include one or more other types of cameras or image sensors, which may include one or more other infrared cameras, thermographic cameras, thermal imaging cameras, light-sensitive cameras, range sensors, tomography devices, radar devices, red-green-blue (RGB) cameras, or ultrasonic cameras. In some examples,camera 104 may be any image capture device appropriate for application of computer vision techniques. Depending on the type of sensors or cameras used, the resulting image may include two-dimensional images, three-dimensional volumes, or an image sequence. Pixel values typically correspond to light intensity in one or more spectral bands, but might also be related to various physical measures, such as depth, absorption or reflectance of sonic or electromagnetic waves, or nuclear magnetic resonance. -
Camera 104 may be configured to capture movements of an occupant of the vehicle, such as a driver, as the occupant moves an arm, wrist, hand, stylus, and/or fingers as he or she gestures in, for example, a field-of-view, and may be configured to capture images of the face ofuser 150. - As described above,
vehicle computing system 100 may include user interface (UI)module 108 andapplication modules 110.UI module 108 andapplication modules 110 may perform operations described herein using software, hardware, firmware, or a mixture of both hardware, software, and firmware residing in and executing byvehicle computing system 100 or at one or more other remote computing devices. As such,UI module 108 andapplication modules 110 may be implemented as hardware, software, and/or a combination of hardware and software.Vehicle computing system 100 may executeUI module 108,application modules 110, or one or more other modules as or within a virtual machine executing on underlying hardware.UI module 108 andapplication modules 110 may be implemented in various ways. For example,UI module 108 andapplication modules 110 may be implemented as a downloadable or pre-installed application or “app.” In another example,UI module 108 andapplication modules 110 may be implemented as part of an operating system ofvehicle computing system 100. -
Application modules 110 may include functionality to perform any variety of operations onvehicle computing system 100. For instance,application modules 110 may include a navigation application, weather application, a phone dialer application, an information retrieval application, a multimedia application, a vehicle information application, an email application, a text messing application, instant messaging application, social networking application, weather application, stock market application, emergency alert application, sports application, to name only a few examples. In general,vehicle computing system 100, whether throughapplication modules 110 or otherwise, may be configured to perform operations including those relating to climate control systems (e.g., heating and air conditioning), audio or infotainment systems, seat, window, sunshade, or windshield wipers, cruise control, in-cabin display system, steering wheel controls, headrest, arm rest, side or rear view mirrors, collision sensors. Such operations may be controlled by one ormore application modules 110, or may be controlled by other systems within the vehicle. In some examples, such operations may be limited to non-safety features of the vehicle. In other examples, such operations may encompass one or more features of the vehicle that may be considered safety-related (e.g., turning on a turn-signal, adjusting a mirror, adjusting or fastening/disconnecting a seat belt, adjusting cruise control features, accelerating, braking). - Although shown as operable within
control unit 106 ofvehicle computing system 100, one or more ofapplication modules 110 may be operable by a remote computing device (e.g., mobile computing device 170) that is communicatively coupled tovehicle computing system 100. In such examples, an application module executing at a remote computing device may cause the remote computing device to send the content and intent information using any suitable form of data communication (e.g., wired or wireless network, short-range wireless communication such as Near Field Communication or BLUETOOTH, etc.). In some examples, a remote computing device may be a computing device that is separate from a computing device included invehicle computing system 100. For instance, the remote computing device may be operatively coupled tovehicle computing system 100 by a network. An example of a remote computing device may include, but is not limited to a server, smartphone, tablet computing device, smart watch, and desktop computer. In some examples, a remote computing device may or may not be an integrated component ofvehicle computing system 100. As shown inFIG. 1 , one such example remote device ismobile computing device 170, which may include adisplay device 172 and one or moreimage capture devices 173, and which may execute one ormore applications 174. Examples ofmobile computing device 170 may include, but are not limited to, a mobile phone, a tablet computer, a personal digital assistant (PDA), a laptop computer, a portable gaming device, a portable media player, an e-book reader, a wearable device (e.g., a watch, a wrist-mounted computing device, a head-mounted computing device), or other type of mobile computing device.Mobile computing device 170 may be or include one or more processors.FIG. 1 further illustrates a mobile, wearable device 107 (e.g., smartwatch), which is worn byuser 150, and which may be communicatively coupled (e.g., via one or more wireless connections) tomobile computing device 170 and/orsystem 100.IHU system 100 may communicate withwearable device 107 and/ormobile computing device 170 using a wireless communication protocol (e.g., BLUETOOTH, WIFI, BLUETOOTH Low Energy (BLE)). Whenmobile computing device 170 and/orwearable device 107 is paired withIHU system 100, this device may be recognized as a trusted device with respect toIHU system 100, and is assigned a unique identifier byIHU system 100. This trusted device, and its corresponding unique identifier, are associated byIHU system 100 withuser 150 of the vehicle, and any profile and/or account information foruser 150. -
UI module 108 ofvehicle computing system 100 may receive from presence-sensitive panel 102 one or more indications of user input detected at presence-sensitive panel 102. Generally, each time presence-sensitive panel 102 detects user input at a particular location of presence-sensitive panel 102,UI module 108 may receive an indication of user input or information about the user input from presence-sensitive panel 102.UI module 108 may assemble the information received from presence-sensitive panel 102 into a set of one or more events, such as a sequence of one or more touch events or gesture events. Each gesture event in the sequence may include data or components that represent parameters (e.g., when, where, originating direction) characterizing a presence and/or movement of input at presence-sensitive panel 102. Each gesture event in the sequence may include a location component corresponding to a location of presence-sensitive panel 102, a time component related to when presence-sensitive panel 102 detected user input at the location, and/or an action component related to whether the gesture event corresponds to a lift up or a push down at the location. -
UI module 108 may determine one or more characteristics of the user input based on the sequence of gesture events and include information about these one or more characteristics within each gesture event in the sequence of gesture events. For example,UI module 108 may determine a start location of the user input, an end location of the user input, a density of a portion of the user input, a speed of a portion of the user input, a direction of a portion of the user input, and a curvature of a portion of the user input.UI module 108 may transmit indications of user input from presence-sensitive panel 102 to other modules, such asapplication modules 110.UI module 108 may determine one or more single- or multi-touch gestures provided by a user.UI module 108 may also act as an intermediary between various components ofvehicle computing system 100 to make determinations based on input detected by presence-sensitive panel 102 and generate output presented bydisplay 112. For instance,UI module 108 may receive data from one ormore application modules 110 andcause display 112 to output content, such as a graphical user interface, for display. -
UI module 108 ofvehicle computing system 100 may also receive fromcamera 104 one or more indications of user input detected bycamera 104. Generally, eachtime camera 104 detects a user gesture or movement,UI module 108 may receive an indication of user input or information about the user input fromcamera 104.UI module 108 may assemble the information received fromcamera 104 into a set of one or more events, such as a sequence of movements or gesture events. Each gesture event in the sequence may include data or components that represents parameters (e.g., when, where in three dimensional space, originating direction, direction in three dimensional space, hand or arm orientation or posture) characterizing a presence, gesture, and/or movement captured bycamera 104 within a field-of-view. Each gesture event in the sequence may include a location component corresponding to a three-dimensional location within a field-of-view, a time component related to whencamera 104 detected user input within the three-dimensional space, an action component related to what type of gesture was made, and/or one or more images captured bycamera 104. - Further, in some examples, the arrangement and/or placement of presence-
sensitive panel 102 andcamera 104 within the vehicle may provide an ergonomic and comfortable way for a driver (or other vehicle occupant) to interact withvehicle computing system 100. While presence-sensitive panel 102 andcamera 104 may detect different types of input, the positioning of presence-sensitive panel 102 andcamera 104 in accordance with one or more aspects of this disclosure may be such that input detected by presence-sensitive panel 102 may be perceived by a vehicle occupant to be a natural extension of input detected bycamera 104. Similarly, input detected bycamera 104 may be perceived by a vehicle occupant to be a natural extension of input detected by presence-sensitive panel 102. In other words, such a system may provide a particularly natural or easy user interface for a vehicle occupant to use. In some cases, a vehicle occupant may find interacting withvehicle computing system 100 to be relatively instinctive. - As described in further below, according to various examples, a mobile computing device (e.g.,
mobile computing device 170, wearable device 107) may establish a connection withvehicle computing system 100 of the vehicle illustrated inFIG. 1 . After establishing the connection, the mobile computing device may receive, fromvehicle computing system 100, first feature data associated with at least one image of a face ofuser 150 of the vehicle. The at least one image of the face ofuser 150 of the vehicle is captured by an image capture device (e.g.,camera 104/111) connected to at least a portion of the vehicle. The mobile computing device may determine, based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user, as described further below in reference toFIG. 3 . The mobile computing device authenticates, based on the match,user 150 of the vehicle. The mobile computing device the sends, tovehicle computing system 100, authentication data foruser 150 of the vehicle, where the authentication data is indicative of the match. - According to various examples,
vehicle computing system 100 may establish a connection with a mobile computing device (e.g.,mobile computing device 170, wearable device 107), where the vehicle computing system includes an infotainment head unit, and determine a presence of a user inside the vehicle. After determining the presence of the user inside the vehicle,vehicle computing system 100 may capture, using an image capture device (e.g.,camera 104/111) that is connected to at least a portion of the vehicle, at least one image of a face of the user, and determine first feature data associated with the at least one image of the face of the user.Vehicle computing system 100 may receive authentication data for the user, where the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user.Vehicle computing system 100 may then access, based on the authentication data for the user, user account information to log the user into the infotainment head unit. - In some cases,
vehicle computing system 100 may send, to the mobile computing device, the first feature data associated with the at least one image of the face of the user. After sending the first feature data,vehicle computing system 100 receives, from the mobile computing device, the authentication data for the user. - In other alternate cases, the trusted mobile computing device may send the enrolled feature data to
vehicle computing system 100, which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data of the unknown user to the received enrolled feature data of a known user. In these cases,vehicle computing system 100 receives, from the mobile computing device, the second feature data associated with the at least one image of the face of the previously enrolled user, and compares the first feature data and the second feature data.Vehicle computing system 100 then determines, based on the comparing, the match between the user and the previously enrolled user, where the authentication data comprises an indication of the match. - Throughout the disclosure, examples are described where a computing device and/or a computing system analyzes information (e.g., facial image information, etc.) associated with a computing device and a user of a computing device, only if the computing device receives permission from the user of the computing device to analyze the information. For example, in situations discussed below, before a computing device or computing system can collect or may make use of information associated with a user, the user may be provided with an opportunity to provide input to control whether programs or features of the computing device and/or computing system can collect and make use of user information (e.g., information about a user's current location, current speed, etc.), or to dictate whether and/or how to the device and/or system may receive content that may be relevant to the user. In addition, certain data may be treated in one or more ways before it is stored or used by the computing device and/or computing system, so that personally identifiable information is removed. For example, a user's identity may be treated so that no personally identifiable information can be determined about the user, or a user's geographic location may be generalized where location information is obtained (such as to a city, ZIP code, or state level), so that a particular location of a user cannot be determined. Thus, the user may have control over how information is collected about the user and used by the computing device and computing system.
-
FIG. 2 is a conceptual diagram illustrating further details of an interior of a vehicle, in accordance with one or more aspects of the present disclosure. InFIG. 2 , a portion ofsteering wheel 154 fromFIG. 1 is shown, and a finger ofuser 150 is interacting withpanel 102 ofconsole 101. According to the techniques of the present disclosure,camera 104 has acquired facial images ofuser 150, and provided information associated with these images tomobile computing device 170.Mobile computing device 170 has authenticateduser 150 based on a comparison of received facial feature data with stored facial feature data of authorized, known users stored bymobile computing device 170, to identify a match with the stored facial feature data ofuser 150.Mobile computing device 170 sends authentication data foruser 150 tosystem 100.System 100 may send feature data tomobile computing device 170 that comprises actual image information or, in various examples, comprises feature data (e.g., feature vector data) for various features of the images ofuser 150 that are captured byimage capture device 173. - In some examples, this authentication data may comprise a unique identifier associated with
user 150 and/ormobile computing device 170.System 100 may use this received authentication data to access the user profile, user preference, login credentials, and/or account data foruser 150 that is stored in system 100 (e.g., for logginguser 150 into IHU system 100).System 100 may then display personalized or customized data display 112 ofsystem 100 foruser 150 based upon the accessed information (e.g., preferences) that are particular touser 150. This personalized information may includeuser interface elements Cursor 206 may overlap one ofuser interface elements 204. When cursor 206 is over one ofuser interface elements 204, presence-sensitive panel 102 may detect one or more taps or inputs at presence-sensitive panel 102, andsystem 100 may determine, based on this input, that the input corresponds to selection of theuser interface elements 204 overlapped bycursor 206. - In some examples,
display 112 may be a presence-sensitive panel that operates both as an input device and an output device. In such an example,display 112 may detect one or more inputs at or near a location ondisplay 112 wheredisplay 112 presentsuser interface element 204.System 100 may identify, based on the input, a selecteduser interface element 204 corresponding to the input. In response to the input selectinguser interface element 204,system 100 may perform an operation, which may include displaying information or updating a graphical user interface atdisplay 112. In some examples where presence-sensitive panel 102 also acts as a display, computing device 200 may additionally or alternatively display information at presence-sensitive panel 102 or update a graphical user interface displayed at presence-sensitive panel 102. - Display 112 of
IHU system 100 may be configured to display personalized or customized information for any particular user that is authenticated bymobile computing device 170 that is communicatively coupled tosystem 100. Thus, if various different authorized users, includinguser 150, use the vehicle,display 112 may be configured to display different customized information for each user, depending on the setup of the user profile or account for each user that is authenticated bymobile computing device 170. In some cases,system 100 may also customize various other settings of vehicle that are associated with operation of the vehicle and/or IHU system 100 (e.g., temperature settings, radio settings, vehicle environment settings, etc.). -
FIG. 3 is a conceptual diagram illustrating anexample computing device 310 that performs facial enrollment operations, in accordance with one or more aspects of the present disclosure.Computing device 310 may be one example ofmobile computing device 170 shown inFIG. 1 that is configured to save enrolled image data of authorized users. As used herein, the term “image data” refers to data (e.g., feature data) that is computed or otherwise determined from a captured image of, e.g., a face of a user. -
Computing device 310 may be a mobile device, such as a smart phone, a tablet computer, a laptop computer, computerized watch, computerized eyewear, computerized gloves, or any other type of portable computing device. Additional examples ofcomputing device 310 include other mobile and non-mobile devices, such as desktop computers, televisions, personal digital assistants (PDA), portable and non-portable gaming systems, digital media players or micro-consoles, e-book readers, mobile television platforms, automobile navigation and entertainment systems, or any other types of wearable and non-wearable, mobile or non-mobile computing devices. - As shown in
FIG. 3 ,computing device 310 includes a presence-sensitive display (PSD) 312, one or moreimage capture devices 314, facial recognition module (FRM) 322, and enrolledinformation 328.Enrolled information 328 may comprise a data store.FRM 322 may perform operations described using software, hardware, firmware, or a mixture of hardware, software, and firmware residing in and/or executing atcomputing device 310.Computing device 310 may executeFRM 322 with multiple processors or multiple devices.Computing device 310 may executeFRM 322 as a virtual machine executing on underlying hardware.FRM 322 may execute as one or more services of an operating system or computing platform.FRM 322 may execute as one or more executable programs at an application layer of a computing platform. -
PSD 312 ofcomputing device 310 may function as respective input and/or output devices forcomputing device 310.PSD 312 may be implemented using various technologies. For instance,PSD 312 may function as input devices using presence-sensitive input screens, such as resistive touchscreens, surface acoustic wave touchscreens, capacitive touchscreens, projective capacitance touchscreens, pressure sensitive screens, acoustic pulse recognition touchscreens, or another presence-sensitive display technology.PSD 312 may detect input from a user ofcomputing device 310. For example,PSD 312 may detect one or more gestures performed on or within a threshold distance of PSD 312 (e.g., auser touching PSD 312 with a finger or a stylus or moving the finger or stylus within a threshold distance of a surface of PSD 312). -
PSD 312 may also function as output (e.g., display) devices using any one or more display devices, such as liquid crystal displays (LCD), dot matrix displays, light emitting diode (LED) displays, organic light-emitting diode (OLED) displays, e-ink, or similar monochrome or color displays capable of outputting visible information to a user ofcomputing device 310.PSD 312 may output information (e.g., to a user) as a user interface (e.g., a graphical user interface), which may be associated with functionality provided bycomputing device 310. For example,PSD 312 may display various user interfaces related to an application module or other features of computing platforms, operating systems, applications, and/or services executing at or accessible fromcomputing device 310. -
Image capture devices 314 may be one example of image capture devices 173 (FIG. 1 ) and may include one or more cameras, such as digital cameras, still cameras, motion picture cameras, and the like.Image capture devices 314 may include any other devices capable of capturing and storing still or moving images. In some examples,image capture devices 314 may be capable of digitally recording images via an electronic image sensor.Image capture devices 314 may be configured to generate data indicative of an image in response to detecting visible light (e.g., light visible by humans, with wavelengths between approximately 380 nanometers to approximately 740 nanometers) or near infrared (NIR) light (e.g., light adjacent to the visible light spectrum, such as light with wavelengths between approximately 750 nanometers and approximately 3400 nanometers). In some examples,computing device 310 includes animage capture device 314 configured to generate data indicative of 2-dimensional images. In another example,computing device 310 includes a plurality ofimage capture device 314 configured to generate data indicative of 3-dimensional images. In this way, a plurality ofimage capture devices 314 may generate data indicative of 2-dimensional images, 3-dimensional images, or both. - In accordance with techniques of this disclosure,
FRM 322 may perform facial recognition to authenticate a user ofcomputing device 310. In general,FRM 322 may perform an enrollment process (e.g., one time, such as during an initial setup of computing device 310) and periodically execute an authentication process to determine whether an unknown user is, in fact, a known user. During the enrollment process,image capture devices 314 may determine one ormore feature data 330A-330H (collectively, “feature data 330” or, more generally, “image data 330”) of a known user 338 (e.g., a user logged in to a user account associated) andPSD 312 may output a graphical user interface (GUI) that includes one ormore images 333 of the knownuser 338.Known user 338 may also be referred to as an enrolled user. In some examples, each offeature data 330 may be associated with a particular pose of user 338 (e.g.,feature data 330A for a “pose A” ofuser 338,feature data 330B for a “pose B” of user 338).Known user 338 may be able to viewimages 333 within the GUI as data is captured by image capturesdevices 314, and may adjust his or her head and/orcomputing device 310 to enableimage capture devices 314 to captureimages 333 of the face of knownuser 338 in a variety of different poses.Image capture devices 314 may output data associated withimages 333 toFRM 322. As one example,user 338 may faceimage capture devices 314 and press a button (e.g., a physical button or a graphical button displayed by PSD 312) to cause image capturesdevices 314 to captureimages 333. As another example,image capture devices 314 may automatically captureimages 333 in response to the user facingimage capture devices 314. -
FRM 322 analyzes the data received fromimage capture devices 314 and assigns each of computedfeature data 330 to one or more pose buckets 332AA-332EE (collectively, pose buckets 332).Feature data 330 comprises features or characteristics ofimages 333. Each pose bucket 332 is associated with a range of pitch angles (also referred as tilt angle) and yaw angles (also referred to as a pan angle) of the user's face. As used herein, the pitch angle refers to an angle of the user's face relative to a horizontal axis and the yaw angle refers to an angle of the user's face relative to a vertical axis that is perpendicular to the horizontal axis. For example, each of pose buckets 332 may be associated with a respective yaw and pitch range. In the example ofFIG. 3 , the size of each of pose buckets 332 is equal (e.g., 10 degrees). For example, each of pose buckets 332 is associated with a 10-degree range of pitch angles and a 10-degree range of yaw angles. However, in some examples, the size of pose buckets 332 may be different. For example, pose bucket may associated with an 8-degree range of pitch angles (and/or range of yaw angles) and another pose bucket may be associated with a 10-degree range of pitch angles (and/or range of yaw angles). - For purposes of illustration, pose buckets 332 are shown in
FIG. 3 as part of a table 331. As illustrated in the example ofFIG. 3 , the yaw and pitch angles shown in table 331 represent the center of each respective pose bucket 332. For example, the center of pose bucket 332AA is −20 degrees in yaw and 20 degrees in pitch. In other words, the pose bucket 332AA may represent −15 to −25 degrees in yaw, and 15 to 25 degrees in pitch. Similarly, in the example ofFIG. 3 , the center of pose bucket 332AN is 20 degrees in yaw and 20 degrees in pitch, such that pose bucket 332AN represents 15 degrees to 25 degrees in yaw and 15 degrees to 25 degrees in pitch. While table 331 includes 25 pose buckets 332, in some examples, table 331 may include a different number of pose buckets 332. While pose buckets 332 are illustrated as part of a table 331 to aid understanding, pose buckets 332 may not be stored in a table. Pose buckets 332 may be stored in any data structure and may be organized in any manner. -
FRM 322 may determine which of pose buckets 332 is associated with feature data based on characteristics or landmarks of the face of knownuser 338 included in the data forimages 333. For instance,FRM 322 may detect landmarks inimages 333 of the user's face, such as the user's eyes, nose, and mouth, and may determine the yaw and pitch angles of the user's face based on the landmarks. For example,FRM 322 may determine thatfeature data 330A, which is associated with a particular pose (e.g., “POSE A”), should be included in pose bucket 332CC based on the yaw and pitch angles of the user's face infeature data 330A, and the range yaw and pitch angles associated with pose bucket 332CC. For instance,FRM 322 may determine that the yaw angle of the user's face infeature data 330A is 0 degrees and the pitch angle of the known user's face is 0 degrees.FRM 322 may determine that pose bucket 332CC is associated with a range of yaw angles from −5 to 5 degrees and a range of pitch angles from −5 degrees to 5 degrees (e.g., pose bucket 332CC is centered at 0 degrees yaw and 0 degrees pitch). In such examples,FRM 322 may determine that pose bucket 332CC includesfeature data 330A in response to determining that the yaw and pitch angle of the user's face infeature data 330A falls within the range of yaw and pitch angles associated with pose bucket 332CC. - As another example,
FRM 322 may determine the yaw angle of user's face infeature data 330B is 0 degrees (e.g., centered in the left and right directions) and the pitch angle of the user's face is 23 degrees (e.g., the known user is looking up).FRM 322 may determine that pose bucket 332AC is associated with a range of yaw angles from −5 to 5 degrees and a range of pitch angles from 15 degrees to 25 degrees (e.g., pose bucket 332CC is centered at 0 degrees yaw and 20 degrees pitch). In such examples,FRM 322 may determine that pose bucket 332AC includesfeature data 330B in response to determining that the yaw and pitch angle of the user's face infeature data 330B falls within the range of yaw and pitch angles associated with pose bucket 332AC. -
FRM 322 may determinefeature data 330 and determine whether a threshold number of pose buckets 332 include one offeature data 330. For example,FRM 322 may determine a number of pose buckets 332 that includefeature data 330 of the face of the known user. In the example ofFIG. 3 ,FRM 322 determines thatfeature data 330 are included within 19 pose buckets (e.g., 332AB, 332AC, 332BA, 332BB, 332BC, 332BD, 332BE, 332CA, 332CB, 332CC, 332CD, 332CE, 332DB, 332DC, 332DD, 332DE, 332EB, 332EC, and 332ED) of 25 possible pose buckets 332. In some examples,FRM 322 determines whether the number of pose buckets that includefeature data 330 satisfies (e.g., is greater than or equal to) a threshold number (e.g., 15 pose buckets, 17 pose buckets, 19 pose buckets, etc.; or 65% of pose buckets 332, 75% of pose buckets 332, 85% of pose buckets 332, etc.). For example,FRM 322 may determine that the number of pose buckets 332 that includefeature data 330 satisfies the threshold number of pose buckets in response to determining thatfeature data 330 are included within at least 75% of pose buckets 332. Determining the number of pose buckets 332 that includefeature data 330 satisfies the threshold number of pose buckets may indicate thatfeature data 330 represent the face of the known user in enough different poses to more accurately authenticate a user. - Responsive to determining that the number of pose buckets that include
feature data 330 does not satisfy the threshold number of pose buckets,image capture device 314 may capture one ormore images 333 for the enrollment process. For example,FRM 322 may output a graphical user interface instructing knownuser 338 to move his or her head to captureimages 333 of different angles of the face of knownuser 338. - Responsive to determining that the number of pose buckets that include
feature data 330 satisfies the threshold number of pose buckets,FRM 322 may associate featuredata 330 with a user account for knownuser 338. In some examples,feature data 330 may include image templates (also referred to as embeddings) for each respective image. As one example, an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) of a user's face. For example,FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.). In some examples, each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.). Alternatively or additionally, element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity. For example,FRM 322 may apply a trained facial recognition model to the plurality offeature data 330 and output an image template (e.g., embedding) for eachrespective feature data 330 as a vector. In some examples,FRM 322 associates featuredata 330 with the user account by assigning data to an image template identifier and associating the respective image template identifier with a user account identifier for the user account for knownuser 338. -
FRM 322 may feature data image in enrolledinformation 328, which comprises a data store that includesfeature data 330 associated withimages 333 ofuser 338. In some examples,FRM 322 encryptsfeature data 330 prior to storing it in enrolledinformation 328. In some examples, enrolledinformation 328 may be stored locally oncomputing device 310 such that the information is not transmitted over a network to any other devices (e.g., not transmitted tosystem 100 of the vehicle ofFIG. 1 ). Further,computing device 310 may provide the user with an opportunity to delete one or more portions offeature data 330 stored in enrolled information. -
FRM 322 may perform an authentication process for an unknown user (e.g.,user 150 of the vehicle ofFIG. 1 ) after completing the enrollment process for knownuser 338. In other words,FRM 322 may receive a request to authenticate anunknown user 150 of the vehicle ofFIG. 1 , based on feature data ofuser 150 that is determined fromimages 333 captured by one of cameras in the vehicle (e.g.,camera 104 and/or 111), and which is transmitted byIHU system 100 tocomputing device 310, which may be one example ofmobile computing device 170 ofFIG. 1 .System 100 may sendfeature data 330 tocomputing device 310 that comprises, e.g., feature vector data for various features of the images ofuser 150 that are captured byimage capture device 173. - Responsive to receiving or determining feature data associated with the authentication image,
FRM 322 may determine whetherunknown user 150 is knownuser 338. In some examples,FRM 322 determines whether theunknown user 150 is the knownuser 338 based on a comparison of the feature data indicative of authentication images of the face ofunknown user 150 and thefeature data 330 indicative ofimages 333 of the face of the knownuser 338. For example,FRM 322 may compare the feature data indicative of the authentication image(s) of the face ofunknown user 150 andfeature data 330 that is computed or determined fromimages 333 of the face of the knownuser 338 using a pose-independent (also referred to as pose invariant) technique or a pose-dependent technique. - In some examples, in a pose-dependent technique,
FRM 322 determines whether theunknown user 150 is the knownuser 338 based on the authentication feature data and featuredata 330 associated with the face of the knownuser 338 in a pose closest to the pose of the face of the unknown user in authentication image ofuser 150. In one example,FRM 322 determines a pose bucket of pose buckets 332 associated with authentication feature data ofunknown user 150. For example,FRM 322 may determine the pose bucket associated with feature data ofunknown user 150 based on characteristics or landmarks of the face of the unknown user, in a manner similar to determining the pose buckets associated withfeature data 330. For instance,FRM 322 may determine the yaw angle and the pitch angle of the face in the authentication feature data ofuser 150. Responsive to determining the yaw and pitch angles of the face in the authentication feature data,FRM 322 may determine which of pose buckets 332 include the yaw and pitch angle of the face in the authentication feature data. For instance,FRM 322 may determine the yaw and pitch angles of the face in the feature data are 20 degrees and 0 degrees respectively.FRM 322 may determine that pose bucket 332CD is associated with a range of yaw angles from 15 to 25 degrees yaw and a range of pitch angles from −5 to 5 degrees. In such instances,FRM 322 may determine that the authentication feature data of theunknown user 150 is associated with pose bucket 332CD in response to determining the yaw and pitch angles of the face in the authentication feature data are included in the range of yaw and pitch angles associated with pose bucket 332CD. -
FRM 322 may determine feature data fromfeature data 330 of the face of the knownuser 338 that is included within the pose bucket associated with the feature data of the face of theunknown user 150. In other words,FRM 322 may determine which offeature data 330 has the closest pose to the pose of the authentication feature data ofuser 150. In one example,FRM 322 determines that the feature data ofuser 150 is associated with pose bucket 332CD and selects feature data (e.g., featuredata 330G for a pose G) offeature data 330 that is included within pose bucket 332CD.FRM 322 may determine whetheruser 150 is the knownuser 338 by determining a similarity score for the selectedfeature data 330G, the similarity score indicating a similarity betweenfeature data 330G and the authentication feature data. - Responsive to determining the similarity score for selected
feature data 330G (e.g., the feature data included in the pose bucket 332CD that is associated with one or more images of the unknown user 150), in some examples,FRM 322 determines whether the similarity score forfeature data 330G satisfies (e.g., is greater than or equal to) a threshold similarity score.FRM 322 may determineunknown user 150 is the knownuser 338 in response to determining that the similarity score forfeature data 330G satisfies the threshold similarity score, and may determine thatunknown user 150 is not the known user in response to determining that the similarity score forfeature data 330G does not satisfy the threshold similarity score. - In some examples,
FRM 322 determines whetherunknown user 150 is the knownuser 338 regardless of the pose. In other words, in some examples,FRM 322 utilizes pose invariant techniques to determine whether theunknown user 150 is the knownuser 338. For example,FRM 322 may determine a respective similarity score for each feature data offeature data 330, where the respective similarity score indicates a similarity between the corresponding feature data offeature data 330 and the authentication feature data ofunknown user 150. - In one scenario,
FRM 322 selects feature data offeature data 330 based on the respective similarity scores forfeature data 330 to determine whetherunknown user 150 is the knownuser 338.FRM 322 selects the feature data offeature data 330 with the similarity score indicative of a closest match to the authentication feature data. The score indicative of the closest match may be the lowest similarity score or the highest similarity score. - In some scenarios,
FRM 322 selects two or more feature data offeature data 330 based on the respective similarity scores to determine whetherunknown user 150 is the knownuser 338. In one scenario,FRM 322 determines a composite similarity score for two ormore feature data 330. For instance,FRM 322 may determine the composite similarity score based on the average of the respective similarity scores for two or more offeature data 330 and may compare the composite similarity score to the threshold similarity score to determine whetherunknown user 150 is the knownuser 338. - As another example,
FRM 322 may compare each respective similarity score for the two or more feature data to the threshold similarity score. In such examples,FRM 322 may determine thatunknown user 150 is the knownuser 338 in response to determining that a threshold number (e.g., 100%, 80%, 60% etc.) of the selected feature data have a similarity score that satisfies the threshold similarity score. For instance, if the set of selected feature data include three feature data offeature data 330 with the highest similarity scores, in some examples,FRM 322 determines thatunknown user 150 is the knownuser 338 in response to determining that the similarity score for two of the three selected feature data satisfies the threshold similarity score. - Responsive to determining that
unknown user 150 is the knownuser 338,computing device 310 may send authentication data toIHU system 100 of the vehicle. For example,computing device 310 may send data indicative of a successful authentication ofunknown user 150 as the knownuser 338, thereby indicating authentication ofuser 150. In some cases,computing device 310 may send user profile and/or account information that is customized or personalized for user 338 (e.g., authenticated user 150) tosystem 100. In some cases,computing device 310 may also send a unique identifier ofuser 338 and/or ofcomputing device 310 tosystem 100 of the vehicle. - In some cases,
computing device 310 may also store one ormore models 329. For instance, in some cases,computing device 310 may utilize deep learning models to extract facial landmarks that help in identifying an individual, as described above. These models may be tailored to execute on the dedicated neural engines or processors of the personal devices. As described above, in some examples, an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) of a user's face. For example,FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.). In some examples, each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.). Alternatively or additionally, element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity. For example,FRM 322 may apply a trained facial recognition model toimages 333 and output an image template (e.g., embedding) forrespective feature data 330 as a vector. Alternatively or additionally, element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity. As one example, an enrollment module may apply a trained facial recognition model toimages 333 and output an image template (e.g., a vector) forrespective feature data 330. Such an image template may be represented as a vector and may be generated by a trained facial recognition model. The vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.). - While
computing device 310 is described as enrollingfeature data 330 of a knownuser 338 and authenticating anunknown user 150, in some examples, one or more remote computing devices may perform all or a subset of the functionality described herein. In certain alternate examples,computing device 310 may send the enrolled feature data to a vehicle computing system (e.g., vehicle computing system 100), which then may be configured to perform authentication of an unknown user in the vehicle by comparing the authentication feature data of the unknown user to the received enrolled feature data of the known user. - In some examples, a computing device (e.g.,
computing device 310 or another computing device) may utilize user data associated with a user ofcomputing device 310 only if the computing device receives permission from the user of the computing device to utilize the data. For example, before a computing device or computing system can collect or may make use of information associated with a user, the user may be provided with an opportunity to provide input to control whether programs or features of the computing device and/or computing system can collect and make use of user information. In addition, certain information may be treated in one or more ways before it is stored or used by the computing device and/or computing system, so that personally identifiable information is removed. For example, a user's identity may be treated so that no personally identifiable information can be determined about the user. For instance, the computing device may store feature data and/or an image template for an image without storing the image itself, and may associate the feature data and/or image template with a user identifier that is not associated with any other user information. Thus, the user may have control over how information is collected about the user and used by the computing device and computing system. - In this way, techniques of this disclosure may enable
computing device 310 to capture feature data of a knownuser 338 that are included within several different pose buckets. By capturing and enrolling feature data in several different pose buckets,computing device 310 may increase the number of pose buckets that include feature data of the knownuser 338, which may enablecomputing device 310 may more accurately authenticate feature data of unknown users regardless of the pose bucket of an authentication feature data for the unknown user (e.g., user 150). For instance, increasing the number of pose buckets that include enrolled feature data may increase the probability that the pose bucket associated with the authentication feature data of the unknown user is similar to the pose bucket that includes one or more of the group of enrolled feature data of the knownuser 338, which may reduce the probability of falsely rejecting the unknown user when the unknown user is in fact a known, authorized user, thus potentially improving the user experience. Further, reducing the probability of false rejections may reduce the number of authentication attempts (e.g., facial recognition, finger print recognition, PIN or passcodes, etc.) used for the computing device to enter an increased access mode, which may reduce the amount of processing cycles utilized by the processor and improve battery life. In some instance, the described techniques may reduce the probability of falsely authenticating the unknown user when the unknown user is not a known, authorized user, which may increase security ofcomputing device 310. -
FIG. 4 is a block diagram illustrating anexample computing system 410, in accordance with one or more aspects of the present disclosure.Computing system 410 may, in some cases, be a more detailed example ofcomputing device 310 ofFIG. 3 ,mobile computing device 170 ofFIG. 1 , and/or ofvehicle computing system 100 ofFIG. 1 .FIG. 4 illustrates only one particular example ofcomputing system 410, and many other examples ofcomputing system 410 may be used in other instances and may include a subset of the components included inexample computing system 410 or may include additional components not shown inFIG. 4 . As used herein, the term “image data” refers to data (e.g., feature data) that is computed or otherwise determined from a captured image of, e.g., a face of a user. - As shown in the example of
FIG. 4 ,computing system 410 includesPSD 412, one or moreimage capture devices 414, one ormore processors 430, one ormore input components 442, one ormore output components 444, one ormore communication units 446, and one ormore storage devices 448.Storage devices 448 ofcomputing system 410 includeFRM 422 and enrolledinformation 428. -
Communication channels 449 may interconnect each of thecomponents communication channels 449 may include a system bus, a network connection, one or more inter-process communication data structures, or any other components for communicating data (also referred to as information). -
Image capture devices 414 may include one or more cameras, such as digital cameras, still cameras, motion picture cameras, and the like.Image capture devices 414 may include any other devices capable of capturing and storing still or moving images. In some examples,image capture devices 414 may be capable of digitally recording images via an electronic image sensor.Image capture devices 414 may include one or more devices configured to detect visible light (e.g., visible light cameras), one or more devices configured to detect near-infrared light (e.g., near-infrared cameras), or a combination therein. In some examples,image capture devices 414 may generate image data indicative of 2-dimensional images, data indicative of 3-dimensional images, or a combination therein. In this way, a plurality ofimage capture devices 414 may capture visible light, near-infrared light, or a combination therein, and may generate image data indicative of 2-dimensional images, 3-dimensional images, or both. - One or
more communication units 446 ofcomputing system 410 may communicate with external devices by transmitting and/or receiving data. For example,computing system 410 may use one or more ofcommunication units 446 to transmit and/or receive radio signals on a radio network such as a cellular radio network. In some examples,communication units 446 may transmit and/or receive satellite signals on a satellite network such as a Global Positioning System (GPS) network. Examples ofcommunication units 446 include a network interface card (e.g. such as an Ethernet card), an optical transceiver, a radio frequency transceiver, a GPS receiver, or any other type of device that can send and/or receive information. Other examples ofcommunication units 446 may include short wave radios (e.g., NFC, BLUETOOTH (including BLE)), GPS, 3G, 4G, 5G, and WIFI radios found in mobile devices as well as Universal Serial Bus (USB) controllers and the like. - One or
more input components 442 ofcomputing system 410 may receive input. Examples of input are tactile, audio, kinetic, and optical input, to name only a few examples.Input components 442 ofcomputing system 410 include, in one example, a mouse, keyboard, voice responsive system, video camera, buttons, control pad, microphone or any other type of device for detecting input from a human or machine. In some examples,input component 442 may be a presence-sensitive input component, which may include a presence-sensitive screen, touch-sensitive screen, etc. - One or
more output components 444 ofcomputing system 410 may generate output. Examples of output are tactile, audio, and video output.Output components 444 ofcomputing system 410, in some examples, include a presence-sensitive screen, sound card, video graphics adapter card, speaker, cathode ray tube (CRT) monitor, liquid crystal display (LCD), or any other type of device for generating output to a human or machine. Output components may include display components such as cathode ray tube (CRT) monitor, liquid crystal display (LCD), Light-Emitting Diode (LED) or any other type of device for generating tactile, audio, and/or visual output. - In some examples,
PSD 412 ofcomputing system 410 may include functionality ofinput component 442 and/oroutput components 444. In the example ofFIG. 4 ,PSD 412 may include a presence-sensitive input component 464, such as a presence-sensitive screen or touch-sensitive screen. In some examples, presence-sensitive input component 464 may detect an object at and/or near the presence-sensitive input component. As one example range, presence-sensitive input component 464 may detect an object, such as a finger or stylus that is within two inches or less of presence-sensitive input component 464. Presence-sensitive input component 464 may determine a location (e.g., an (x,y) coordinate) of the presence-sensitive input component at which the object was detected. In another example range, presence-sensitive input component 464 may detect an object two inches or less from presence-sensitive input component 464 and other ranges are also possible. Presence-sensitive input component 464 may determine the location of presence-sensitive input component 464 selected by a user's finger using capacitive, inductive, and/or optical recognition techniques. - In some examples,
PSD 412 may also provide output to a user using tactile, audio, or video stimuli as described with respect tooutput component 444. For instance,PSD 412 may includedisplay component 462 that displays a graphical user interface.Display component 462 may be any type of output component that provides visual output, such as described with respect tooutput components 444. While illustrated as an integrated component ofcomputing system 410,PSD 412 may, in some examples, be an external component that shares a data or information path with other components ofcomputing system 410 for transmitting and/or receiving input and output. For instance,PSD 412 may be a built-in component ofcomputing system 410 located within and physically connected to the external packaging of computing system 410 (e.g., a screen on a mobile phone). In another example,PSD 412 may be an external component ofcomputing system 410 located outside and physically separated from the packaging of computing system 410 (e.g., a monitor, a projector, etc. that shares a wired and/or wireless data path with a tablet computer). In some examples,PSD 412, when located outside of and physically separated from the packaging ofcomputing system 410, may be implemented by two separate components: a presence-sensitive input component 464 for receiving input and adisplay component 462 for providing output. - One or
more storage components 448 withincomputing system 410 may store information for processing during operation of computing system 410 (e.g.,computing system 410 may store data accessed byFRM 422 during execution at computing system 410). In some examples,storage component 448 is a temporary memory, meaning that a primary purpose ofstorage component 448 is not long-term storage.Storage components 448 oncomputing system 410 may be configured for short-term storage of information as volatile memory and therefore not retain stored contents if powered off. Examples of volatile memories include random access memories (RAM), dynamic random-access memories (DRAM), static random-access memories (SRAM), and other forms of volatile memories known in the art. -
Storage components 448, in some examples, also include one or more computer-readable storage media.Storage components 448 in some examples include one or more non-transitory computer-readable storage mediums.Storage components 448 may be configured to store larger amounts of information than typically stored by volatile memory.Storage components 448 may further be configured for long-term storage of information as non-volatile memory space and retain information after power on/off cycles. Examples of non-volatile memories include magnetic hard discs, optical discs, flash memories, or forms of electrically programmable memories (EPROM) or electrically erasable and programmable (EEPROM) memories.Storage components 448 may store program instructions and/or information (e.g., data) associated withFRM 422.Storage components 448 may include a memory configured to store data or other information associated withFRM 422 and enrolledinformation 428. - One or
more processors 430 may implement functionality and/or execute instructions associated withcomputing system 410. Examples ofprocessors 430 include application processors, display controllers, auxiliary processors, one or more sensor hubs, and any other hardware configure to function as a processor, a processing unit, or a processing device.FRM 422 may be operable byprocessors 430 to perform various actions, operations, or functions ofcomputing system 410. - In certain examples, when computing
system 410 comprises an example ofvehicle computing system 100,storage devices 448 may include information and/or modules that are executable byprocessors 430 to implement corresponding functionality of vehicle computing system 100 (e.g., as illustrated and/or described in reference tosystem 100 inFIG. 1 and/orFIG. 5 ). In other examples, such as illustrated inFIG. 4 , wherecomputing system 410 comprises an example ofcomputing device 310 ofFIG. 3 ,processors 430 ofcomputing system 410 may retrieve and execute instructions stored bystorage components 448 that causeprocessors 430 to perform the operations described herein that are attributed toFRM 422. The instructions, when executed byprocessors 430, may causecomputing system 410 to store information withinstorage components 448. -
FRM 422 may include all functionality ofFRM 322 ofcomputing device 310 ofFIG. 3 and may perform similar operations asFRM 322 for performing facial recognition to authenticate a user ofcomputing system 410.FRM 422 may include an enrollment module 424 and anauthentication module 426. - In some examples, enrollment module 424 may perform an enrollment process to associate image data of a known user of
computing system 410 with a user account for the known user. In some examples, enrollment module 424 may perform the enrollment process one time for a given user account, for instance, when setting up a new account. - During the enrollment phase,
image capture devices 414 captures one or more image data 330 (ofFIG. 1 ) of a known user (e.g., a user logged in to a user account associated) ofcomputing system 410 and generates image data indicative of each of the images. Enrollment module 424 ofFRM 422 may receive the image data fromimage capture device 414 and analyze the image data to assign each ofimage data 330 to one or more pose buckets 332 (ofFIG. 1 ). - Enrollment module 424 may determine which of pose buckets 332 is associated with image data based on characteristics or landmarks of the face of the known user included in the image data. For instance, enrollment module 424 may detect landmarks in the image data of the unknown user's face, such as the user's eyes, nose, and mouth, and may determine the yaw and pitch angle of the face based on the landmarks. For example, enrollment module 424 may receive
image data 330A and determine the yaw angle of the user's face inimage data 330A is approximately 0 degrees and the pitch angle of the user's face inimage data 330A is approximately 0 degrees based on the landmarks inimage data 330A. Enrollment module 424 may determine that pose bucket 332CC includes a range of yaw angles from −5 degrees to 5 degrees and a range of pitch angles from −5 degrees to 5 degrees (e.g., pose bucket 332CC is centered on 0 degrees yaw and 0 degrees pitch). In such examples, enrollment module 424 may determine that the yaw and pitch angles of the user's face inimage data 330A are within the range of pitch and yaw angles for pose bucket 332CC, such that enrollment module 424 determines thatimage data 330A should be included within pose bucket 332CC. In some examples, enrollment module 424 may determine the amount of roll of the user's face inimage data 330A. For example, enrollment module 424 may determine which of pose buckets 332 includeimage data 330A based on the yaw angle, pitch angle, and roll of the user's face inimage data 330A. - In some examples, enrollment module 424 determines whether
image data 330A should be included in a plurality of pose buckets 332. For example, enrollment module 424 may includeimage data 330A within a given pose bucket when the yaw and pitch angles ofimage data 330A are within a predefined distance (e.g., a radius of 10 degrees) of the center of the given pose bucket 332. For instance, enrollment module 424 may determine that the yaw and pitch angles of the user's face inimage data 330A are 0 degrees and 0 degrees, respectively. In one instance, the predefined distance may be 10 degrees and pose bucket 332BC may be centered at 0 degrees yaw and 10 degrees pitch, such that enrollment module 424 may determine that the yaw and pitch angles ofimage data 330A lie within the predefined distance of the center of pose bucket 332BC. In such instances, enrollment module 424 may includeimage data 330A in pose bucket 332BC. Similarly, enrollment module 424 may determine that the yaw and pitch angles forimage data 330A are within the predefined distance of the center of pose buckets 332CB, 332CD, and 332DC, and includeimage data 330A in pose buckets 332CB, 332CD, and 332DC in addition to pose buckets 332CC and 332BC. - In some examples, enrollment module 424 receives
image data 330B after receivingimage data 330A. Enrollment module 424 may determine whether to includeimage data 330B in any of pose buckets 332. Enrollment module 424 may determine the yaw angle of the user's face inimage data 330B is approximately 0 degrees and the pitch angle of the user's face inimage data 330B is approximately 19 degrees based on the landmarks inimage data 330B. Enrollment module 424 may determine that pose bucket 332AC includes a range of yaw angles from −5 degrees to 5 degrees and a range of pitch angles from 15 degrees to 25 degrees (e.g., pose bucket 332AC is centered on 0 degrees yaw and 20 degrees pitch). In such examples, enrollment module 424 may determine that the yaw and pitch angles of the user's face inimage data 330B are within the range of pitch and yaw angles for pose bucket 332AC. Enrollment module 424 may determine whether pose bucket 332AC includes one ofimage data 330 and may includeimage data 330B in pose bucket 332AC in response to determining that pose bucket 332AC does not already include image data. - Enrollment module 424 may determine whether to include
image data 330B in any other pose buckets 332. For example, enrollment module 424 may determine whether the yaw and pitch angles forimage data 330B are within the predefined distance of the center of other pose buckets 332. For instance, enrollment module 424 may determine that the predefined distance is 10 degrees and pose bucket 332BC is centered at 0 degrees yaw and 10 degrees pitch, such that enrollment module 424 may determine that the yaw and pitch angles ofimage data 330B (e.g., 0 degrees yaw, 19 degrees pitch) lie within the predefined distance of the center of pose bucket 332BC. Enrollment module 424 may determine whether to includeimage data 330B in pose bucket 332BC in response to determining that the yaw and pitch angles forimage data 330B are within the predefined threshold of the center of pose bucket 332BC. - In some examples, enrollment module 424 determines that pose bucket 332BC already includes
image data 330A and determines whether to replaceimage data 330A withimage data 330B in pose bucket 332BC. In one example, enrollment module 424 may determine whether to replaceimage data 330A withimage data 330B in pose bucket 332BC based on a distance between the center of pose bucket 332BC and the respective yaw and pitch angles forimage data image data 330A are 0 degrees, 0 degrees; the yaw and pitch angles forimage data 330B are 0 degrees, 19 degrees, and the yaw and pitch angles for the center of pose bucket 332BC are 0 degrees, 10 degrees. In such examples, enrollment module 424 may determine thatimage data 330B is closer to the center of pose bucket 332BC thanimage data 330A, and may replaceimage data 330A withimage data 330B within pose bucket 332BC. - In some examples, enrollment module 424 may determine whether to include
image data 330A orimage data 330B within pose bucket 332BC based on the order of receiving theimage data image data 330A should be included within pose bucket 332BC as theimage data 330A was received first. In another example, enrollment module 424 may include the most recent image data within pose bucket 332BC. In these examples, enrollment module 424 may determine thatimage data 330B should be included within pose bucket 332BC. - Enrollment module 424 may receive data
indicative image data 330 fromimage capture devices 414 and determine whether a threshold number of pose buckets 332 include one ofimage data 330. For example, enrollment module 424 may determine a number of pose buckets 332 that includeimage data 330 of the face of the known user. Responsive to determining the number of pose buckets 332, enrollment module 424 determines whether the number of pose buckets that includeimage data 330 satisfies (e.g., is greater than or equal to) a threshold number of pose buckets. For example, enrollment module 424 may determine that the number of pose buckets 332 that includeimage data 330 satisfies the threshold number of pose buckets in response to determining thatimage data 330 are included within at least 75% of pose buckets 332. Responsive to determining that the number of pose buckets that includeimage data 330 does not satisfy the threshold number of pose buckets, enrollment module 424 may causeimage capture devices 414 to capture one or moreadditional image data 330 for the enrollment process. - In some examples, enrollment module 424 may associate data indicative of the
image data 330 with a user account for the known user in response to determining that the number of pose buckets that includeimage data 330 satisfies the threshold number of pose buckets. In some examples,image data 330 may include the images themselves or image templates for each respective image (e.g., computed facial feature data). The image templates may include a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.). In some examples, each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.). Alternatively or additionally, element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity. As one example, enrollment module 424 may apply a trained facial recognition model to the plurality ofimage data 330 and output an image template (e.g., a vector) for eachrespective image data 330. In some examples, enrollment module 424 associates the data indicative ofimage data 330 with the user account by assigning each image template an image template identifier and associating the respective image template identifier with a user account identifier for the user account for the known user. - Enrollment module 424 may store the data indicative of image data 330 (e.g., the images themselves or the image templates) to enrolled
information 428. In some examples, enrollment module 424 encrypts the data indicative ofimage data 330 prior to storing the data indicative ofimage data 330.Image data 330 may be stored locally oncomputing system 410 such that the data is not transmitted over a network to any other devices. Further,computing system 410 may provide the user with an opportunity to delete the data indicative ofimage data 330. - In some scenarios,
authentication module 426 performs the authentication process in response to receiving data indicative of an authentication image of a face of an unknown user 150 (ofFIG. 1 ) usingauthentication module 426. The data indicative of the image of the face of the unknown user may include the image itself or an image template representing characteristics of the unknown user's face. - Responsive to receiving the data indicative of the authentication image data,
authentication module 426 may determine whetherunknown user 150 is a knownuser 338.Authentication module 426 may determine whetherunknown user 150 is a knownuser 338 based on the authentication image data ofuser 150 and one or moreenrollment image data 330. In some instances,authentication module 426 determines whetherunknown user 150 is a knownuser 338 using a pose-independent (also referred to as pose invariant) technique or a pose-dependent technique. - In some pose-dependent examples,
authentication module 426 determines whether theunknown user 150 is the knownuser 338 based on the authentication image data and a particular image data ofimage data 330 that includes the face of the known user in a pose closest to the pose of the face ofunknown user 150 in the authentication image. In one example, theparticular image data 330 that includes the face of the knownuser 338 in the pose closest to the pose of the face in the authentication image data may be the particular image ofimage data 330 that is included in the same pose bucket as a pose bucket 332 associated with the authentication image data. -
Authentication module 426 may determine a pose bucket of pose buckets 332 associated with authentication image data ofunknown user 150. For example,authentication module 426 may determine the pose bucket associated with this image data based on characteristics or landmarks of the face of the unknown user. For instance,authentication module 426 may determine the yaw angle and the pitch angle of the face in the authentication image data. Responsive to determining the yaw and pitch angles of the face in the authentication image data,authentication module 426 may determine which of pose buckets 332 include the yaw and pitch angle of the face in the authentication image data, and determine that pose bucket (e.g., pose bucket 332CD) is the pose bucket associated with the authentication image data. In some instances,authentication module 426 may determine a roll of the face ofunknown user 150 in the authentication image data.Authentication module 426 may determine which of pose buckets 332 is associated with the authentication image data based on the yaw angle, the pitch angle, and the roll of the face in the authentication image data. -
Authentication module 426 may determine which ofimage data 330 is included within the pose bucket associated with the authentication image data (e.g., pose bucket 332CD). In an example where pose bucket 332CD is associated with the authentication image data,authentication module 426 may query enrolledinformation 428 and determine thatimage data 330G is included within pose bucket 332CD. Responsive to determining thatimage data 330G is included within the pose bucket associated with the authentication image data,authentication module 426 may determine whetheruser 150 is the knownuser 338 by determining a similarity score for the selectedimage data 330G. In some examples, the similarity score forimage data 330G indicates a similarity betweenimage data 330G and the authentication image data. -
Authentication module 426 may determine a similarity score forimage data 330G based on the data indicative ofimage data 330G and the data indicative of the authentication image data. In some examples, the data indicative ofimage data 330G includes an image template forimage data 330G. Such an image template may be represented as a vector and may be generated by a trained facial recognition model. The vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.). Similarly, the authentication image data may include a vector generated in a similar manner. In some examples,authentication module 426 determines the similarity score by calculating an angle between a vector representingimage data 330G and a vector representing the authentication image data. As another example,authentication module 426 may determine the similarity score forimage data 330G by determining a cosine similarity between a vector representingimage data 330G and a vector representing the authentication image data. - In some examples,
authentication module 426 determines whether the similarity score forimage data 330G satisfies a threshold similarity score. As one example,authentication module 426 determines the similarity score forimage data 330G by determining the angle between the vector representingimage data 330G and the vector representing the authentication image data, and determines that the similarity score forimage data 330G satisfies the threshold similarity score in response to determining that the similarity score is less than the threshold similarity score. As another example,authentication module 426 determines the similarity score forimage data 330G by determining the cosine similarity between the vector representingimage data 330G and the vector representing the authentication image data, and determines that the similarity score forimage data 330G satisfies the threshold similarity score in response to determining that the similarity score is greater than the threshold similarity score. -
Authentication module 426 may determineunknown user 150 is the knownuser 338 in response to determining that the similarity score forimage data 330G satisfies the threshold similarity score. Similarly,authentication module 426 may determine thatunknown user 150 is not the knownuser 338 in response to determining that the similarity score forimage data 330G does not satisfy the threshold similarity score. - In some pose-independent examples,
authentication module 426 determines a respective similarity score for each ofimage data 330 to determine whether theunknown user 150 is the knownuser 338. The respective similarity score indicates a similarity between the corresponding image data ofimage data 330 and the authentication image data. As discussed above,authentication module 426 may determine a respective similarity score for each ofimage data 330 based on the data indicative of therespective image data 330 and the data indicative of the authentication image data. In some examples, the data indicative ofimage data 330 includes a respective image template. Such an image template may be represented as a vector and may be generated by a trained facial recognition model. The vector may include a plurality of element values that each correspond to a respective feature of the user's face. In such examples, the data indicative of the authentication image data may include a vector that includes a plurality of element values that each correspond to a respective feature of the face ofunknown user 150. In some scenarios,authentication module 426 determines the respective similarity score for each ofimage data 330 by calculating an angle between the respective vector and the vector representing the authentication image data. As another example,authentication module 426 may determine the respective similarity score for each ofimage data 330 by determining a cosine similarity between the respective vector for each ofimage data 330 and the vector representing the authentication image data. - In one pose-independent example,
authentication module 426 selects image data ofimage data 330 based on the respective similarity scores forimage data 330 to determine whetherunknown user 150 is the knownuser 338.Authentication module 426 selects the single image data ofimage data 330 with the similarity score indicative of a closest match to the authentication image data. In some examples,authentication module 426 determines the respective similarity scores forimage data 330 based on an angle between each vector representing a respective image data ofimage data 330 and the vector representing the authentication image data and determines that the score indicative of the closest match is the lowest similarity score (e.g., the smaller angle between two vectors the closer the vectors are to one another). In another example,authentication module 426 determines the respective similarity scores forimage data 330 based on a cosine similarity between each vector representing a respective image data ofimage data 330 and the vector representing the authentication image data and determines that the score indicative of the closest match is the highest similarity score (e.g., the larger cosine value between two vectors the more the vectors are more similar). - In some scenarios,
authentication module 426 selects two or more image data ofimage data 330 based on the respective similarity scores to determine whetherunknown user 150 is the knownuser 338. In one scenario,authentication module 426 determines a composite similarity score for two ormore image data 330. In some instances,authentication module 426 may determine the composite similarity score based on the highest similarity scores for two ormore image data 330 or the lowest similarity scores for two ormore image data 330. In one instance,authentication module 426 may determine the composite similarity score based on the average of the respective similarity scores for two or more ofimage data 330 and may compare the composite similarity score to the threshold similarity score to determine whetherunknown user 150 is the knownuser 338. - As another example,
authentication module 426 may compare each respective similarity score for the two or more image data to the threshold similarity score. In such examples,authentication module 426 may determine thatunknown user 150 is the knownuser 338 in response to determining that a threshold number (e.g., 100%, 80%, 60% etc.) of the selected image data have a similarity score that satisfies the threshold similarity score. For instance,authentication module 426 may determine that the set of selected image data include the three image data ofimage data 330 with the highest similarity scores, and may determine thatunknown user 150 is the knownuser 338 in response to determining that the similarity score for two of the three selected image data satisfies the threshold similarity score. - In some cases,
storage devices 448 may also store one ormore models 429. For instance, in some cases,computing device 310 may utilize deep learning models to extract facial landmarks that help in identifying an individual, as described above. These models may be tailored to execute on the dedicated neural engines or processors of the personal devices. As described above, in some examples, an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) of a user's face. For example,FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.). In some examples, each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.). Alternatively or additionally, element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity. For example,FRM 322 may apply a trained facial recognition model to the plurality ofimage data 330 and output an image template (e.g., embedding) for eachrespective image data 330 as a vector. Alternatively or additionally, element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity. As one example, enrollment module 424 may apply a trained facial recognition model to the plurality ofimage data 330 and output an image template (e.g., a vector) for eachrespective image data 330. Such an image template may be represented as a vector and may be generated by a trained facial recognition model. The vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.). - In some cases, the biometric feature information or identifier associated with a known user may include the various enrolled image data and/or features for that user that are stored in enrolled
information 428, which may include image data and/or feature information for the various different poses forimage data 330. This biometric information stored in enrolledinformation 428 may, in some cases, also include a unique identifier of the user, and/or information for a user profile or account that is specific to the user. The machine models may be stored locally (e.g., inmodels 429 of computing device 410), or may be stored remotely from device 410 (e.g., on one or more remote servers). - Responsive to determining that
unknown user 150 is the knownuser 338,computing device 310 may send authentication data toTHU system 100 of the vehicle. For example,computing device 310 may send data indicative of a successful authentication ofunknown user 150 as the knownuser 338, thereby indicating authentication ofuser 150. In some cases,computing device 310 may send user profile and/or account information that is customized or personalized for user 338 (e.g., authenticated user 150) tosystem 100. In some cases,computing device 310 may also send a unique identifier ofuser 338 and/or ofcomputing device 310 tosystem 100 of the vehicle. -
FIG. 5 is a diagram illustrating an example driver onboarding and authentication process, in accordance with one or more aspects of the present disclosure, andFIG. 6 is a diagram illustrating an example facial enrollment process, in accordance with one or more aspects of the present disclosure. Enrolling or onboarding auser 150, such as a driver, may be done on the user's personal device (e.g.,mobile computing device 170 ofFIG. 1 ,computing device 310 ofFIG. 3 ), such as at the time of account creation onIHU system 100. This process involves running an enrollment application on the personal device that captures the biometric features of the user's face, using the front facing camera on the device, such as described above, e.g., in reference toFIG. 3 . These corresponding facial features are stored as feature data on the personal device (e.g., enrolled information 328). The personal device is registered/paired as a trusted device with the IHU (e.g.,IHU system 100 ofFIG. 1 ). The camera of the personal device (e.g., one or more ofimage capture devices 314 of computing device 310) may be any of various different types of cameras, such as a camera capturing near infrared (NIR) frames and/or red-green-blue (RGB) frames. When the personal device is paired with the IHU, the personal device is recognized as a trusted device with respect to the IHU, and is assigned a unique identifier by the IHU. This trusted device, and its corresponding unique identifier, are associated by the IHU with the user (e.g., user 150) of the vehicle, and any profile and/or account information for that user. - In various examples, the enrollment process may include the following aspects. Various personal computing devices or systems, such as
mobile computing device 170 and/orcomputing device 310, may be equipped with front-facingimage capture devices frames 551 may be one example ofimages 333 shown inFIG. 3 . The NIR spectrum may, in some cases, provides better adaptability to different lighting conditions. In some examples, the personal device may utilize deep learning models to extract facial landmarks that help in identifying an individual, as described above. These models may be tailored to execute on the dedicated neural engines or processors of the personal devices. As described above, in some examples, an image template may generally correspond to a statistical model of one or more features (e.g., biometric features) 553 of a user's face, where features 553 may be one example offeature data 330. In some cases,computing device 310 may include model information withinmodels 329 that is based on trained image and/or feature data for images taken with an NIR camera and/or an RGB camera. In some cases,computing device 310 may distill the information (e.g., feature data) inmodels 329 from one color space (RGB space) to another color space (NIR space). As a result, the cameras used within the vehicle ofFIG. 1 do not necessarily have to be of the same type as the image capture devices (e.g., cameras) 314 used incomputing device 310. For example, the vehicle may include NIR cameras, butimage capture devices 314 may include RGB cameras. In various examples, the cameras used within the vehicle ofFIG. 1 may be of the same type asimage capture devices 314 included incomputing device 310. - For example,
FRM 322 may generate an image template that includes a vector with a plurality of element values (e.g., 50 values, 100 values, 500 values, etc.). In some examples, each element value of the vector corresponds to a feature of the user's face (e.g., distance between the eyes, nose shape, etc.). Alternatively or additionally, element values of the vector may be generated through a, e.g., non-linear machine learned model trained to generate outputs indicative of a facial identity. For example,FRM 322 may apply a trained facial recognition model toframes 551 and output an image template (e.g., embedding) for respective facial features infeatures 553 as a vector. Alternatively or additionally, element values of the vector may be generated through a non-linear machine learned model trained to generate outputs indicative of a facial identity. - As one example, enrollment module 424 may apply a trained facial recognition model to
frames 551 and output an image template (e.g., a vector) for respective facial features infeatures 553. Such an image template may be represented as a vector and may be generated by a trained facial recognition model. The vector may include a plurality of element values that each correspond to a respective feature of the user's face (e.g., distance between the eyes, nose shape, etc.). In some cases, the biometric feature information oridentifier 555 associated with a known user may include the various enrolled features for that user that are stored on the trusted device (e.g., in enrolled information 328), and the stored information may include feature information for the various different poses. This biometric information stored in enrolledinformation 328 may also include a unique identifier of the user, and/or information for a user profile or account that is specific to the user. The machine models may be stored locally (e.g., onmobile computing device 170,computing device 310,computing device 410, THU system), or may be stored remotely from these systems (e.g., on one or more remote servers). - In one or more alternate examples to the one illustrated in
FIG. 5 , the trusted mobile computing device may send the enrolled feature data to a vehicle computing system (e.g., vehicle computing system 100), which then may be configured to perform authentication of the unknown user in the vehicle by comparing the authentication feature data of the unknown user to the received enrolled feature data of a known user. In these cases,vehicle computing system 100 receives, from the mobile computing device, the feature data (e.g., feature data or identifier 555) associated with the at least one image of the face of the previously enrolled user.Vehicle computing system 100 may then be configured to this received feature data with the feature data (e.g., features 559) associated with the captured image of the face of the user inside the vehicle.Vehicle computing system 100 may then determine, based on the comparing, the match between the user of the vehicle and the previously enrolled user, where the authentication data processed byvehicle computing system 100 comprises an indication of the match. - As described previously in reference to
FIG. 3 , and as also shown inFIG. 5 , during the enrollment process, the computing device (e.g.,mobile computing device 170, computing device 310) may create a gallery of facial image data (e.g., feature data) against different face poses of the same user (e.g., known user 338). The face pose estimation is comprised of pan and tilt values of the face. The poses for which facial features are registered can appear as a pattern on the personal device. These patterns guide the user to move their head capturing different poses of the face, such as shown inFIG. 6 . -
FIG. 6 illustrates a screen display of how this can be accomplished inside an application on the personal device. A knownuser 338 may utilize one or more ofimage capture devices 314 to view one or more images ofuser 338 that are output atPSD 312. A representation of these images may be displayed atPSD 312 within agraphical frame 600 that is visible byuser 338.Computing device 310 may output instructions touser 338 viaPSD 312, such as shown inFIG. 6 . - For example,
computing device 310 may output instructions touser 338 to move his/her face relative to imagecapture device 314 in order to position the user's 338 face withinframe 600, and then to gently rotate user's 338 head until all dots shown inframe 600 turn a certain color (e.g., green), or switch from unfilled dots to fully filled dots. As shown inFIG. 6 , nine dots included inframe 600 are currently unfilled, indicating thatuser 338 should rotate his/her head until all of the dots turn green or switch from an unfilled to a filled state or color (e.g., black). Onceuser 338 has done so,image capture device 314 will capture and store images for that given pose in enrolled information (e.g., data store) 328.Computing device 310 may instructuser 338 to strike various different poses when capturingfeature data 330 for various different poses, based on different pitch and yaw angles, such as previously described in reference toFIG. 3 .Computing device 310 may iterate the above process in reference toFIG. 6 for each of these different poses ofuser 338. The facial feature data of corresponding features/landmarks, for the various different facial poses, may then be stored in an enrolled galley in enrolledinformation 328. In some cases, enrolledinformation 328 may comprise a secure enclave of thepersonal device 310. -
Computing device 310, which is one example ofmobile computing device 170, may be registered withIHU system 100 of the vehicle shown inFIG. 1 , and, upon registration, may be a trusted companion device for the vehicle.IHU system 100 may assign a unique identifier tocomputing device 310 and/or knownuser 338, which may be associated with the user's account onIHU system 100. For example,user 150 of the vehicle may have a user's account onIHU system 100, which includes a personal profile foruser 150, associated with information that may be shown ondisplay 112 ofIHU system 100 that is customized or personalized foruser 150, as described previously in reference toFIGS. 1-2 . The unique identifier assigned byIHU system 100 tocomputing device 310 and/or knownuser 338 may be associated with the user account or profile ofuser 150, whenuser 150 is authenticated as knownuser 338 ofdevice 310. - In some cases,
user 150 may also have a wearable device 107 (e.g., smartwatch). In these cases,mobile computing device 170 may, in some examples, communicate withwearable device 107 to provide the information included in enrolledinformation 328 for storage onwearable device 107. In these examples,wearable device 107 may be configured to store the information included in enrolledinformation 328. As a result, oncewearable device 107 stores this information,wearable device 107 may perform the authentication functionality previously described in reference toFIG. 3 . For instance,wearable device 107 may be one example ofcomputing device 310 shown inFIG. 3 . In these cases,user 150 of the vehicle may not necessarily have a separatemobile computing device 170 in the vehicle or in proximity toIHU system 100. Instead,IHU system 100 may communicate directly withwearable device 107 for authenticatinguser 150. As noted previously,THU system 100 may communicate withwearable device 107 and/ormobile computing device 170 using a wireless communication protocol (e.g., BLUETOOTH, WIFI, BLE). - Referring again to
FIG. 5 , at the time of authenticatinguser 150 of the vehicle shown inFIG. 1 , the image frames 557 from the in-cardriver facing camera 104/111 (e.g., NIR or RGB camera) are used to computebiometric features 559 of the face ofuser 150, who may be seated in the driver seat. Thesefeatures 559 are then sent (563) fromTHU system 100 to computing device 310 (e.g.,mobile computing device 170, wearable device 107), which may occur over a secure channel for, e.g., non-spoofed features, such as described further below.Computing device 310 may be a trusted, previously registered device with respect tosystem 100 - As described previously, the software on
computing device 310 determines if there is a match in the facial features of thecurrent driver 150 and the enrolled, knownuser 338. If there is a match,computing device 310 sendsIHU system 100 data indicative of the match, andIHU system 100 receives (565) this data indicative of a match as representing authentication of the registered user, and proceeds to log in the registered user (e.g.,user 150 of the vehicle). - Regarding facial feature estimation, models similar to the enrollment models may be used by
IHU system 100. As described previously,computing device 310 may utilize one or more models 329 (e.g., enrollment models) to identify feature vector data for images of knownuser 338 taken byimage capture devices 314.IHU system 100 may utilize similar models for identifying feature vector data for images ofuser 150 taken bycamera 104 and/or 111 in the vehicle.IHU system 100 may, for example, utilize these models to determine the facial features ofuser 150 based on, e.g., NIR images acquired bycamera IHU system 100 to the trusted device (e.g.,computing device 310, mobile computing device 170) forauthentication user 150. IHU system and the trusted device may communicate using various different encryption/decryption techniques to maintain the confidentiality of the feature data that is exchanged between these entities. In some examples,IHU system 100 may establish a secure communication channel with the previously registered trusted device. This channel can be established over, e.g., BLUETOOTH/BLE, or other wireless interfaces. - Regarding authentication on trusted device (e.g., computing device 310), the trusted device may, upon receiving the incoming feature data (e.g., feature vector data) from
IHU system 100, may compare such data against the gallery of saved feature vectors, and a match is determined, such as described previously (e.g., in reference toFIG. 3 ). If there is a match, an indication of this match, and/or authentication data for the user (e.g., registered token) is sent to theIHU system 100 over the secure channel, which may be used or otherwise serve as the trigger to log inuser 150 into theIHU system 100. - In some instances, it is possible that an intruder with a stolen personal device (e.g., computing device 310) can attempt to unlock the
IHU system 100 using, e.g., a three-dimensional mask of the owner (e.g., user 150), or other images (e.g., paper images) of the owner's face. The deep learning models used by IHU system and/orcomputing device 310 may be capable of spotting or otherwise detecting such a spoofing attempt, as part ofspoof detection 561, and abort the authentication procedure. For example, these models may be trained with various different objects and types of objects (e.g., human objects, paper objects, plastic objects). As a result, the models may include various different feature data (e.g., feature vectors) that include features associated with images of these different types of objects, and the models may be used to identify features of, e.g., a three-dimensional mask or paper image of an owner, in an effort to abort any authentication procedures associated with such spoofing attempts. - In addition, in various examples,
IHU system 100 may utilize deep learning models and/or inputs from other signals to determine the presence ofuser 150, and/or an appropriate or optimal time to capture a facial image ofuser 150 usingcameras 104/111. For example,IHU system 100 may utilize a learning model, trained with various image information to store corresponding feature vector information, to determine the gaze ofuser 150.IHU system 100 may determine to capture a facial image ofuser 150 when the gaze ofuser 150 is towards the camera, or when the pose ofuser 150 is optimal for capturing an increased number of facial feature information for use in authenticatinguser 150. In addition,system 100 may utilize additional signals provided by the vehicle. For example, if the vehicle provides signals tosystem 100 indicative of user's 150 seatbelt being fastened, user's 150 hands onsteering wheel 154, or the change of the vehicle's gear from park to drive,system 100 may use these signals (independently or in conjunction with gaze information gathered from the camera) to determine an appropriate or optimal time to capture a facial image ofuser 150 usingcameras 104/111. - As noted above, in many cases, the cameras used by IHU system 100 (e.g.,
cameras 104/111) and/or cameras used by computing device 310 (e.g., image capture devices 314) may comprise NIR cameras, which may provide improved adaptability to ambient lighting and worn items like sunglasses. However, these systems and devices may use any number of other different types of cameras (e.g., RGB cameras, NIR cameras, etc.). - The techniques of the present disclosure address the challenge of reliable authentication of a user (e.g., user 150) using biometric features computed from frames captured by different cameras (e.g., cameras of
THU system 100 and cameras of computing device 310), where such comparison and authentication may be invariant or adaptable to various different lighting conditions. As described previously, the cameras ofIHU system 100 andcomputing device 310 may be of the same type or different types, which provides added flexibility. The models used insystem 100 and/orcomputing device 310 may be trained using an approach that can generate matchable features for the same individual's face captured using different cameras. The disclosed techniques also reliably handle faces at differing distances/poses from the in-car camera ofIHU system 100, due to the robust processes and enrollment feature data stored for different user poses incomputing device 310. - These aspects may be potentially achieved without user intervention by
user 150, offering a seamless “get in and drive” experience foruser 150. The models used byIHU system 100 and/orcomputing device 310 to compute biometric features allow the above workflow to work with different personal devices (e.g., personal devices running various different operating systems). Additionally, since the facial features of any user stays on the user's personal device (e.g., trusted device 310), there user's private biometric identity and information resides only on the user's personal device, and not on THU system, which may be included in a shared vehicle potentially used by various different individuals. - In some examples, the disclosed techniques also enable use cases where a user can log in to a rental/shared/leased car and have the same personalized experience without worrying about having to wipe out any personal information after the user is done using the vehicle. In these examples, no personal information is stored in the
IHU system 100 of such a vehicle, but is instead only stored on the user's personal device (e.g., device 310). - In some other examples, another use of the disclosed techniques could be for ride-sharing applications to verify the identity of a registered driver (e.g., user 150), and prevent unregistered individuals, who may have stolen or are otherwise carrying the personal device of the registered driver, from being able to accept any ride requests using
IHU system 100. In certain other examples, the disclosed techniques may confirm the identity or otherwise authenticateuser 150 of the vehicle before allowinguser 150 to operate the vehicle, in order to prevent unauthorized individuals from operating the vehicle. -
FIG. 7 is a flowchart illustrating example process performed by an example computing system, in accordance with one or more aspects of the present disclosure. For example, the process illustrated inFIG. 7 may be performed by mobile computing device 170 (FIG. 1 ), wearable device 107 (FIG. 1 ), computing device 310 (FIG. 3 ), and/or computing system 410 (FIG. 4 ). For purposes of illustration only, the process ofFIG. 7 will be described in reference to operations performed by computingdevice 310. -
Computing device 310 may establish (702) a connection with a vehicle computing system (e.g., IHU system 100) of a vehicle. For example,computing device 310 may utilize one or more communication units (e.g., communication units 446) to establish this connection. After establishing the connection,computing device 310 may receive (704), from the vehicle computing system, first feature data associated with at least one image of a face of a user (e.g., user 150) of the vehicle. The at least one image of the face of the user of the vehicle may captured by an image capture device (e.g.,camera 104, 111) connected to at least a portion of the vehicle. -
Computing device 310 may then determine (706), based on a comparison between the first feature data and second feature data (e.g., feature data associated withimages 330 stored in enrolled information 328) associated with at least one image of a face of a previously enrolled user (e.g., user 338), a match between the user of the vehicle and the previously enrolled user.Computing device 310 may authenticate (708), based on the match, the user of the vehicle, and send (710), to the vehicle computing system, authentication data for the user of the vehicle, where the authentication data is indicative of the match. -
FIG. 8 is a flowchart illustrating example process performed by an example vehicle computing system, in accordance with one or more aspects of the present disclosure. For example, the process illustrated inFIG. 8 may be performed byvehicle computing system 100 ofFIG. 1 and/or computing system 410 (FIG. 4 ). For purposes of illustration only, the process ofFIG. 8 will be described in reference to operations performed byvehicle computing system 100. -
Vehicle computing system 100 may establish (802) a connection with a mobile computing device (e.g.,mobile computing device 170 ofFIG. 1 ,wearable device 107 ofFIG. 1 ,computing device 310 ofFIG. 3 ), where the vehicle computing system includes an infotainment head unit.Vehicle computing system 100 may determine (804) a presence of a user (e.g., user 150) inside a vehicle. After determining the presence of the user inside the vehicle,vehicle computing system 100 captures (806), using an image capture device (e.g.,camera 104 and/or 111) that is connected to at least a portion of the vehicle, at least one image of a face of the user. -
Vehicle computing system 100 determines (808) first feature data associated with the at least one image of the face of the user, and receives (810) authentication data for the user. The authentication data indicates a match between the user and a previously enrolled user.Vehicle computing system 100 then accesses (812), based on the authentication data for the user, user account information to log the user into the infotainment head unit. - The following Examples are provided for purposes of illustration only.
- Example 1: A method comprising: establishing, by a mobile computing device, a connection with a vehicle computing system of a vehicle; after establishing the connection, receiving, by the mobile computing device and from the vehicle computing system, first feature data associated with at least one image of a face of a user of the vehicle, wherein the at least one image of the face of the user of the vehicle is captured by an image capture device connected to at least a portion of the vehicle; determining, by the mobile computing device and based on a comparison between the first feature data and second feature data associated with at least one image of a face of a previously enrolled user of the mobile computing device, a match between the user of the vehicle and the previously enrolled user; authenticating, by the mobile computing device and based on the match, the user of the vehicle; and sending, by mobile computing device and to the vehicle computing system, authentication data for the user of the vehicle, wherein the authentication data is indicative of the match.
- Example 2: The method of Example 1, wherein the vehicle computing system includes an infotainment head unit, and wherein sending, by the mobile computing device and to the vehicle computing system, the authentication data for the user of the vehicle causes the vehicle computing system to access user account information to log the user of the vehicle into the infotainment head unit.
- Example 3: The method of any of Examples 1-2, wherein the authentication data for the user of the vehicle includes unique identification information associated with at least one of the mobile computing device or the user of the vehicle.
- Example 4: The method of any of Examples 1-3, wherein establishing the connection comprises securely pairing the mobile computing device with the vehicle computing system of the vehicle, wherein the mobile computing device comprises a trusted device that is associated with a user account of the user of the vehicle on the vehicle computing system.
- Example 5: The method of any of Examples 1-4, wherein receiving the feature data associated with the at least one image of the face of the user of the vehicle comprises, by the mobile computing device and from the vehicle computing system, an encrypted copy of the feature data associated with the at least one image of the face of the user of the vehicle.
- Example 6: The method of any of Examples 1-5, wherein: the second feature data associated with the at least one image of the face of the previously enrolled user is associated with at least one feature vector and is included in at least one pose bucket from a plurality of pose buckets; and each pose bucket from the plurality of pose buckets is associated with a respective range of pitch angles of the face of the previously enrolled user and a respective range of yaw angles of the face of the previously enrolled user.
- Example 7: The method of Example 6, further comprising: selecting, by the mobile computing device, from the second feature data, feature data included in a particular pose bucket of the plurality of pose buckets, wherein the particular pose bucket is associated with the first feature data associated with the at least one image of the face of the user of the vehicle, and wherein determining the match between the user of the vehicle and the previously enrolled user is based on the first feature data and the selected feature data associated with the at least one image of the face of the previously enrolled user.
- Example 8: The method of Example 7, further comprising: determining, by the mobile computing device, a similarity score indicating a similarity between the first feature data associated with the at least one image of the face of the user of the vehicle and the selected feature data associated with the at least one image of the face of the previously enrolled user, wherein determining the match between the user of the vehicle and the previously enrolled user occurs responsive to determining that the similarity score satisfies a threshold similarity score.
- Example 9: The method of any of Examples 1-8, wherein the second feature data associated with the at least one image of the face of the previously enrolled user comprises a plurality feature data associated with the at least one image of the face of the previously enrolled user, the method further comprising: determining, by the mobile computing device, based on the first feature data associated with the at least one image of the face of the user of the vehicle and each feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, a respective similarity score for each feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, wherein each similarity score indicates a similarity between the first data associated with the at least one image of the face of the user of the vehicle and the respective feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, wherein determining the match between the user of the vehicle and the previously enrolled user is based on the respective similarity score for the plurality of feature data associated with the at least one image of the face of the previously enrolled user.
- Example 10: The method of Example 9, further comprising: determining, by the mobile computing device, based on the respective similarity scores, a highest ranked feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, wherein determining the match between the user of the vehicle and the previously enrolled user is based on the highest ranked feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user.
- Example 11: The method of Example 9, further comprising: determining, by the mobile computing device, based on the similarity scores for two or more feature data of the plurality of feature data associated with the at least one image of the face of the previously enrolled user, a composite similarity score, wherein determining the match between the user of the vehicle and the previously enrolled user is responsive to determining that the composite similarity score satisfies a threshold similarity score.
- Example 12: The method of Example 9, wherein determining the match between the user of the vehicle and the previously enrolled user is responsive to determining that the respective similarity score for each of two or more feature data associated with the at least one image of the face of the previously enrolled user satisfies a threshold similarity score.
- Example 13: The method of any of Examples 1-12, further comprising: determining, by the mobile computing device and during an enrollment phase, the second feature data of the at least one image of the face of the previously enrolled user using a machine learning model.
- Example 14: The method of any of Examples 1-13, wherein the mobile computing device comprises a wearable computing device.
- Example 15: A method comprising: establishing, by a vehicle computing system of a vehicle, a connection with a mobile computing device, wherein the vehicle computing system includes an infotainment head unit; determining, by the vehicle computing system, a presence of a user inside the vehicle; after determining the presence of the user inside the vehicle, capturing, by the vehicle computing system using an image capture device that is connected to at least a portion of the vehicle, at least one image of a face of the user; determining, by the vehicle computing system, first feature data associated with the at least one image of the face of the user; receiving, by the vehicle computing system, authentication data for the user, wherein the authentication data indicates a match between the user and a previously enrolled user based on a comparison between the first feature data and second feature data associated with at least one image of a face of the previously enrolled user; and accessing, by the vehicle computing system and based on the authentication data for the user, user account information to log the user into the infotainment head unit.
- Example 16: The method of Example 15, further comprising: sending, by the vehicle computing system and to the mobile computing device, the first feature data associated with the at least one image of the face of the user, wherein receiving the authentication data for the user comprises, after sending the first feature data, receiving, by the vehicle computing system and from the mobile computing device, the authentication data for the user.
- Example 17: The method of Example 15, further comprising: receiving, by the vehicle computing system and from the mobile computing device, the second feature data associated with the at least one image of the face of the previously enrolled user; comparing, by the vehicle computing system, the first feature data and the second feature data; and determining, by vehicle computing system based on the comparing, the match between the user and the previously enrolled user, wherein the authentication data comprises an indication of the match.
- Example 18: A system comprising: at least one processor; and at least one computer-readable storage device storing instructions that, when executed by the at least one processor, cause the at least one processor to perform the method of any of Examples 1-17.
- Example 19: A computer-readable storage medium storing instructions that, when executed by at least one processor, cause the at least one processor to perform the method of any of Examples 1-17.
- In one or more examples, the functions described may be implemented in hardware, hardware and software, hardware and firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over, as one or more instructions or code, a computer-readable medium and executed by a hardware-based processing unit. Computer-readable medium may include computer-readable storage media or mediums, which corresponds to a tangible medium such as data storage media, or communication media including any medium that facilitates transfer of a computer program from one place to another, e.g., according to a communication protocol. In this manner, computer-readable medium generally may correspond to (1) tangible computer-readable storage media, which is non-transitory or (2) a communication medium such as a signal or carrier wave. Data storage media may be any available media that can be accessed by one or more computers or one or more processors to retrieve instructions, code and/or data structures for implementation of the techniques described in this disclosure. A computer program product may include a computer-readable medium.
- By way of example, and not limitation, such computer-readable storage media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage, or other magnetic storage devices, flash memory, or any other storage medium that can be used to store desired program code in the form of instructions or data structures and that can be accessed by a computer. Also, any connection is properly termed a computer-readable medium. For example, if instructions are transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. It should be understood, however, that computer-readable storage mediums and media and data storage media do not include connections, carrier waves, signals, or other transient media, but are instead directed to non-transient, tangible storage media. Disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and Blu-ray disc, where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable medium.
- Instructions may be executed by one or more processors, such as one or more digital signal processors (DSPs), general purpose microprocessors, application specific integrated circuits (ASICs), field programmable logic arrays (FPGAs), or other equivalent integrated or discrete logic circuitry. Accordingly, the term “processor,” as used herein may refer to any of the foregoing structure or any other structure suitable for implementation of the techniques described herein. In addition, in some aspects, the functionality described herein may be provided within dedicated hardware and/or software modules. Also, the techniques could be fully implemented in one or more circuits or logic elements.
- The techniques of this disclosure may be implemented in a wide variety of devices or apparatuses, including a wireless handset, an integrated circuit (IC) or a set of ICs (e.g., a chip set). Various components, modules, or units are described in this disclosure to emphasize functional aspects of devices configured to perform the disclosed techniques, but do not necessarily require realization by different hardware units. Rather, as described above, various units may be combined in a hardware unit or provided by a collection of interoperative hardware units, including one or more processors as described above, in conjunction with suitable software and/or firmware.
- Various embodiments have been described. These and other embodiments are within the scope of the following claims.
Claims (21)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962862581P | 2019-06-17 | 2019-06-17 | |
PCT/US2019/062035 WO2020256765A1 (en) | 2019-06-17 | 2019-11-18 | Seamless driver authentication using an in-vehicle camera in conjunction with a trusted mobile computing device |
Publications (1)
Publication Number | Publication Date |
---|---|
US20210229673A1 true US20210229673A1 (en) | 2021-07-29 |
Family
ID=68848453
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/764,322 Abandoned US20210229673A1 (en) | 2019-06-17 | 2019-11-12 | Seamless driver authentication using an in-vehicle camera in conjunction with a trusted mobile computing device |
Country Status (6)
Country | Link |
---|---|
US (1) | US20210229673A1 (en) |
EP (1) | EP3771314B1 (en) |
JP (1) | JP7049453B2 (en) |
KR (1) | KR102504746B1 (en) |
CN (1) | CN112399935B (en) |
WO (1) | WO2020256765A1 (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11218522B1 (en) | 2020-08-28 | 2022-01-04 | Tmrw Foundation Ip S. À R.L. | Data processing system and method using hybrid system architecture for image processing tasks |
US20220070236A1 (en) * | 2020-08-28 | 2022-03-03 | Tmrw Foundation Ip S. À R.L. | Graphical representation-based user authentication system and method |
US11386678B2 (en) * | 2018-11-13 | 2022-07-12 | Denso International America, Inc. | Driver authentication for vehicle-sharing fleet |
US20220253549A1 (en) * | 2021-02-08 | 2022-08-11 | Capital One Services, Llc | Methods and systems for automatically preserving a user session on a public access shared computer |
WO2024127415A1 (en) * | 2022-12-13 | 2024-06-20 | Tvs Motor Company Limited | A system and method for communication between a vehicle and a rider accessory device |
US12034785B2 (en) | 2020-08-28 | 2024-07-09 | Tmrw Foundation Ip S.Àr.L. | System and method enabling interactions in virtual environments with virtual presence |
US12095761B2 (en) | 2021-12-02 | 2024-09-17 | Ford Global Technologies, Llc | Enhanced biometric authorization |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20230237130A1 (en) * | 2022-01-27 | 2023-07-27 | Cypress Semiconductor Corporation | Multi-layered authentication and permission methods, systems and apparatuses |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170124385A1 (en) * | 2007-12-31 | 2017-05-04 | Applied Recognition Inc. | Face authentication to mitigate spoofing |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4675492B2 (en) * | 2001-03-22 | 2011-04-20 | 本田技研工業株式会社 | Personal authentication device using facial images |
KR100571826B1 (en) * | 2003-12-02 | 2006-04-17 | 삼성전자주식회사 | Large volume face recognition appratus and method thereof |
JP4696857B2 (en) * | 2005-11-02 | 2011-06-08 | オムロン株式会社 | Face matching device |
JP5001028B2 (en) * | 2007-03-02 | 2012-08-15 | 株式会社デンソー | Driving environment setting system, in-vehicle device, and program for in-vehicle device |
JP2009284442A (en) * | 2008-05-26 | 2009-12-03 | Fujitsu Ten Ltd | Apparatus and method for person identification |
US20100039224A1 (en) * | 2008-05-26 | 2010-02-18 | Okude Kazuhiro | Biometrics information matching apparatus, biometrics information matching system, biometrics information matching method, person authentication apparatus, and person authentication method |
JP2009286342A (en) * | 2008-05-30 | 2009-12-10 | Fujitsu Ten Ltd | Biological information collation device, biological information collation system, and biological information collation method |
US20150088337A1 (en) * | 2012-02-27 | 2015-03-26 | Clark W. Toohy | Method and system for vehicle personalization |
JP6225460B2 (en) * | 2013-04-08 | 2017-11-08 | オムロン株式会社 | Image processing apparatus, image processing method, control program, and recording medium |
JP6052751B2 (en) * | 2013-07-03 | 2016-12-27 | パナソニックIpマネジメント株式会社 | Object recognition apparatus and object recognition method |
CN103434484B (en) * | 2013-08-20 | 2016-04-27 | 安科智慧城市技术(中国)有限公司 | Vehicle-mounted identification authenticate device, mobile terminal, intelligent vehicle key control system and method |
JP2016055789A (en) * | 2014-09-10 | 2016-04-21 | トヨタ自動車株式会社 | Vehicular authentication apparatus |
KR20160059793A (en) * | 2014-11-19 | 2016-05-27 | 현대자동차주식회사 | Apparatus and Method for Vehicle Controlling of Using User Authentication |
KR20170017111A (en) * | 2015-08-05 | 2017-02-15 | 엘지전자 주식회사 | Mobile terminal and method for controlling the same |
CN105096528B (en) * | 2015-08-05 | 2017-07-11 | 广州云从信息科技有限公司 | A kind of method for detecting fatigue driving and system |
CN105644500B (en) * | 2016-02-05 | 2018-02-23 | 重庆广播电视大学 | Car door opening control method based on wireless security system |
JP2019032694A (en) * | 2017-08-08 | 2019-02-28 | キヤノン株式会社 | Information processing apparatus, system, information processing method, and program |
US11072311B2 (en) * | 2017-09-05 | 2021-07-27 | Future Mobility Corporation Limited | Methods and systems for user recognition and expression for an automobile |
EP3456592A1 (en) * | 2017-09-15 | 2019-03-20 | Nxp B.V. | Automobile-person interaction |
JP6691309B2 (en) * | 2017-10-31 | 2020-04-28 | キヤノンマーケティングジャパン株式会社 | Information processing apparatus, control method thereof, and program |
CN107800708B (en) * | 2017-11-06 | 2022-07-01 | 上海擎感智能科技有限公司 | Vehicle-mounted machine account automatic login method and vehicle-mounted machine device |
US10676067B2 (en) * | 2018-01-05 | 2020-06-09 | Byton Limited | User capture device configuration for a vehicle |
-
2019
- 2019-11-12 US US16/764,322 patent/US20210229673A1/en not_active Abandoned
- 2019-11-18 KR KR1020207018926A patent/KR102504746B1/en active IP Right Grant
- 2019-11-18 CN CN201980007317.9A patent/CN112399935B/en active Active
- 2019-11-18 WO PCT/US2019/062035 patent/WO2020256765A1/en unknown
- 2019-11-18 EP EP19818400.4A patent/EP3771314B1/en active Active
- 2019-11-18 JP JP2020529580A patent/JP7049453B2/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170124385A1 (en) * | 2007-12-31 | 2017-05-04 | Applied Recognition Inc. | Face authentication to mitigate spoofing |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11386678B2 (en) * | 2018-11-13 | 2022-07-12 | Denso International America, Inc. | Driver authentication for vehicle-sharing fleet |
US11218522B1 (en) | 2020-08-28 | 2022-01-04 | Tmrw Foundation Ip S. À R.L. | Data processing system and method using hybrid system architecture for image processing tasks |
US20220070236A1 (en) * | 2020-08-28 | 2022-03-03 | Tmrw Foundation Ip S. À R.L. | Graphical representation-based user authentication system and method |
US12034785B2 (en) | 2020-08-28 | 2024-07-09 | Tmrw Foundation Ip S.Àr.L. | System and method enabling interactions in virtual environments with virtual presence |
US12107907B2 (en) | 2020-08-28 | 2024-10-01 | Tmrw Foundation Ip S.Àr.L. | System and method enabling interactions in virtual environments with virtual presence |
US20220253549A1 (en) * | 2021-02-08 | 2022-08-11 | Capital One Services, Llc | Methods and systems for automatically preserving a user session on a public access shared computer |
US11861041B2 (en) * | 2021-02-08 | 2024-01-02 | Capital One Services, Llc | Methods and systems for automatically preserving a user session on a public access shared computer |
US12095761B2 (en) | 2021-12-02 | 2024-09-17 | Ford Global Technologies, Llc | Enhanced biometric authorization |
WO2024127415A1 (en) * | 2022-12-13 | 2024-06-20 | Tvs Motor Company Limited | A system and method for communication between a vehicle and a rider accessory device |
Also Published As
Publication number | Publication date |
---|---|
KR20200145826A (en) | 2020-12-30 |
EP3771314B1 (en) | 2022-08-03 |
JP2021531521A (en) | 2021-11-18 |
JP7049453B2 (en) | 2022-04-06 |
CN112399935A (en) | 2021-02-23 |
EP3771314A1 (en) | 2021-02-03 |
WO2020256765A1 (en) | 2020-12-24 |
KR102504746B1 (en) | 2023-03-02 |
CN112399935B (en) | 2023-05-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3771314B1 (en) | Seamless driver authentication using an in-vehicle camera in conjunction with a trusted mobile computing device | |
US11072311B2 (en) | Methods and systems for user recognition and expression for an automobile | |
US9530265B2 (en) | Mobile terminal and vehicle control | |
US20200324784A1 (en) | Method and apparatus for intelligent adjustment of driving environment, method and apparatus for driver registration, vehicle, and device | |
KR101854633B1 (en) | Integrated wearable article for interactive vehicle control system | |
KR101659027B1 (en) | Mobile terminal and apparatus for controlling a vehicle | |
CN107757553B (en) | Method, device and system for identifying user and related vehicle | |
US10209832B2 (en) | Detecting user interactions with a computing system of a vehicle | |
CN104691449A (en) | Vehicle control apparatus and method thereof | |
US20210229633A1 (en) | Biometric user authenticating keys for vehicles and methods of use | |
EP3906499B1 (en) | User authentication using pose-based facial recognition | |
US11928895B2 (en) | Electronic device and control method therefor | |
US10471965B2 (en) | Securing guest access to vehicle | |
KR101578741B1 (en) | Mobile terminal and method for controlling the same | |
US11381950B2 (en) | In-vehicle detection of a charge-only connection with a mobile computing device | |
KR20160023755A (en) | Mobile terminal and method for controlling the same | |
KR102220367B1 (en) | Mobile terminal and method for controlling the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: GOOGLE LLC, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SINGH, HANUMANT PRASAD R;KULAGA, PIOTR;CHU, WEN-SHENG;AND OTHERS;SIGNING DATES FROM 20191112 TO 20191115;REEL/FRAME:052667/0386 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |