Skip to content

Everything you need to add AI-driven ID document capture into your native Android app.

Notifications You must be signed in to change notification settings

BlinkID/capture-android

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Microblink

Capture SDK for Android

The Capture Android SDK gives you the ability to auto-capture high quality images of identity documents in a user-friendly way. The SDK provides you with a rectified image of the document that ensures high success rate in extracting document text or verifying the document validity.

User is guided to avoid glare, blurred images, bad lighting conditions, fingers over the document or too much tilt. The SDK is able to recognize if a document is single sided (i.e. passport) or double sided (i.e. driving license) and prompt the user to scan the back side of the document when needed.

In the results, you can get a cropped, perspective-corrected image of the document, along with the original frame. Those can be processed by your app in any way required. The SDK is lightweight and can be easily integrated into your mobile app and bland in your design.

Table of contents

Quick Start

Quick start with the sample apps

  1. Open Android Studio.
  2. In Quick Start dialog choose Open project
  3. In File dialog select CaptureSample folder.
  4. Wait for the project to load. If Android studio asks you to reload project on startup, select Yes.

Included sample apps:

  • app demonstrates quick and straightforward integration of the Capture SDK by using the provided UX and limited customization options to capture document images and display the results.

  • app-direct-api demonstrates custom integration using the Direct API, where the integrator is responsible for preparing input image stream (or static images) for analysis and building completely custom UX from scratch, based on the image-by-image feedback from the SDK.

SDK integration

Adding Capture SDK dependency

The Capture library is available on Microblink maven repository.

In your project root, add Microblink maven repository to repositories list:

repositories {
    maven { url 'https://maven.microblink.com' }
}

Add Capture as a dependency in module level build.gradle(.kts):

dependencies {
    implementation("com.microblink:capture-ux:1.4.0")
}

Launching the capture and obtaining the results

  1. A valid license key is required to initialize scanning. You can request a free trial license key, after you register, at Microblink Developer Hub. License is bound to application ID of your app, so please make sure you enter the correct application ID when asked.

    Download your licence file and put it in your application's assets folder. Make sure to set the license key before using any other classes from the SDK, otherwise you will get a runtime exception.

    We recommend that you extend Android Application class and set the license in onCreate callback like this:

    public class MyApplication : Application() {
        override fun onCreate() {
            CaptureSDK.setLicenseFile("path/to/license/file/within/assets/dir", this)
        }	
    }
  2. In your activity, define and create ActivityResultLauncher object by using MbCapture contract and define the result callback.

    private val captureLauncher =
        registerForActivityResult(MbCapture()) { captureResult ->
            when (captureResult.status) {
                CaptureResult.Status.DOCUMENT_CAPTURED -> {
                    // do something with the result, you can use analyzerResult
                    captureResult.analyzerResult?.let { analyzerResult ->
                        // use result
                    }
                }
                CaptureResult.Status.CANCELLED -> {
                    // Capture process has been canceled by the user, or because
                    // of any other unexpected error.
                }
                CaptureResult.Status.ERROR_LICENCE_CHECK -> {
                    // Capture process has been canceled because of the licence
                    // check error. This happens if you use licence which has to
                    // be online activated, and activation fails.
                CaptureResult.Status.ERROR_ANALYZER_SETTINGS_UNSUITABLE -> {
                    // Capture process has been canceled because of the AnalyzerSettings
                    // validation error. This error means that the given analyzer settings
                    // are not suitable for capturing the document from the input image
                    // (image resolution is too small to fulfill all requirements from AnalyzerSettings).
                }
            }
         }
  3. Start capture process by calling ActivityResultLauncher.launch:

    // method within MyActivity from previous step
    public fun startScanning() {
        // Start scanning
        captureLauncher.launch(CaptureSettings())
    }

    The results are going to be available in result callback, which was defined in the previous step.

Capture results

After capture is finished, SDK returns object of tipe CaptureResult. You first need to check the CaptureResult.status. If the status is DOCUMENT_CAPTURED, CaptureResult.analyzerResult will be available and you can use it.

You can check the API documentation to see which data fields are available in the AnalyzerResult.

Device requirements

Android version

Capture SDK requires Android API level 21 or newer.

Camera

Camera video preview resolution also matters. In order to perform successful scans, camera preview resolution must be at least 1080p. Note that camera preview resolution is not the same as video recording resolution.

Processor architecture

Capture SDK is distributed with ARMv7 and ARM64 native library binaries.

Capture is a native library, written in C++ and available for multiple platforms. Because of this, Capture cannot work on devices with obscure hardware architectures. We have compiled Capture native code only for the most popular Android ABIs.

If you are combining Capture library with other libraries that contain native code in your application, make sure you match the architectures of all native libraries. For example, if third party library has only ARMv7 version, you must use exactly ARMv7 version of Capture with that library, but not ARM64. Using this architectures will crash your app at initialization step because JVM will try to load all its native dependencies in same preferred architecture and will fail with UnsatisfiedLinkError.

To avoid this issue and ensure that only architectures supported by the Capture library are packaged in the final application, add the following statement to your android/defaultConfig block inside build.gradle.kts:

android {
    ...
    defaultConfig {
        ...
        ndk {
        // Tells Gradle to package the following ABIs into your application
        abiFilters 'armeabi-v7a', 'arm64-v8a'
        }
    }
}

Customizing the look and the UX

Scan overlay theming

Capture SDK

Capture SDK

To customise the scanning overlay, provide your custom style resource via CaptureSettings constructor by defining the style property. You can customise elements labeled on screenshots above by providing the following properties in your style:

exit

  • mb_capture_exitScanDrawable - icon drawable

torch

  • mb_capture_torchOnDrawable - icon drawable that is shown when the torch is enabled
  • mb_capture_torchOffDrawable - icon drawable that is show when the torch is disabled

instructions

  • mb_capture_instructionsTextAppearance - style that will be used as android:textAppearance
  • mb_capture_instructionsBackgroundDrawable - drawable used for background

flashlight warning

  • mb_capture_flashlightWarningTextAppearance - style that will be used as android:textAppearance
  • mb_capture_flashlightWarningBackgroundDrawable - drawable used for background

card icon

  • mb_capture_cardFrontDrawable - icon drawable shown during card flip animation, representing front side of the card
  • mb_capture_cardBackDrawable - icon drawable shown during card flip animation, representing back side of the card

reticle

  • mb_capture_reticleDefaultDrawable - drawable shown when reticle is in neutral state
  • mb_capture_reticleSuccessDrawable - drawable shown when reticle is in success state (scanning was successful)
  • mb_capture_reticleErrorDrawable - drawable shown when reticle is in error state

inner reticle

  • mb_capture_reticleColor - color of the shapes displayed during reticle animations

Introduction dialog and onboarding dialog

Capture SDK

introduction dialog

  • mb_capture_introductionTitleTextAppearance - style that will be used as android:textAppearance for introduction dialog title
  • mb_capture_introductionMessageTextAppearance - style that will be used as android:textAppearance for introduction dialog message
  • mb_capture_introductionBackgroundColor - background color of the introduction dialog
  • mb_capture_introductionImage - drawable displayed in the introduction dialog

onboarding dialog

  • mb_capture_onboardingImageColor - color of the onboarding images
  • mb_capture_onboardingTitleTextAppearance - style that will be used as android:textAppearance for onboarding titles
  • mb_capture_onboardingMessageTextAppearance - style that will be used as android:textAppearance for onboarding messages
  • mb_capture_onboardingButtonTextAppearance - style that will be used as android:textAppearance for onboarding buttons
  • mb_capture_onboardingBackgroundColor - onboarding screens background color
  • mb_capture_onboardingPageIndicatorColor - color of the onboarding page indicator

help button

  • mb_capture_helpButtonDrawable - icon drawable of the help button

help tooltip

  • mb_capture_helpTooltipColor - background color of the help tooltip
  • mb_capture_helpTooltipTextAppearance - style that will be used as android:textAppearance for the help tooltip text

alert dialog

  • mb_capture_alertDialogButtonTextAppearance - style that will be used as android:textAppearance for the alert dialog button
  • mb_capture_alertDialogTitleTextAppearance - style that will be used as android:textAppearance for the alert dialog title
  • mb_capture_alertDialogMessageTextAppearance - style that will be used as android:textAppearance for the alert dialog message
  • mb_capture_alertDialogBackgroundColor - alert dialog background color

You can control the visibility of the introduction dialog by using UxSettings property showIntroductionDialog and onboarding screens by using showOnboardingInfo when defining the CaptureSettings.

There is also an option for controlling the delay of the "Show help?" tooltip that is shown near the help button. The button and tooltip will be shown if the previous option for showing onboarding is true.

To change the default delay length of the tooltip, use UxSettings.showHelpTooltipTimeIntervalMs. Time parameter is set in milliseconds. The default setting of the delay is 12 seconds (12000 milliseconds).

Changing default strings and localization

SDK has built-in support for several languages.

You can modify strings and add your own language. For more information on how localization works in Android, check out the official Android documentation.

Defining your own string resources for UI elements

For the capture screen, you can define your own string resources that will be used instead of predefined ones by using the custom CaptureOverlayStrings while creating the CaptureSettings.

val captureSettings = CaptureSettings(
    strings = CaptureOverlayStrings(
        helpTooltip = R.string.your_help_tooltip_text,
        flashlightWarning = R.string.your_flashlight_warning_message,
        onboardingStrings = OnboardingStrings(
            onboardingSkipButtonText = R.string.your_onboarding_skip_button_text,
            onboardingBackButtonText = R.string.your_onboarding_back_button_text,
            onboardingNextButtonText = R.string.your_onboarding_next_button_text,
            onboardingDoneButtonText = R.string.your_onboarding_done_button_text,
            onboardingTitles = intArrayOf(
                R.string.your_onboarding_title_1,
                R.string.your_onboarding_title_2,
                R.string.your_onboarding_title_3
            ),
            onboardingMessages = intArrayOf(
                R.string.your_onboarding_msg_1,
                R.string.your_onboarding_msg_2,
                R.string.your_onboarding_msg_3
            ),
            introductionDialogTitle = R.string.your_introduction_dialog_title,
            introductionDialogMessage = R.string.your_introduction_dialog_message,
            introductionDoneButtonText = R.string.your_introduction_done_button_text
        ),
        instructionsStrings = InstructionsStrings(
            scanFrontSide = R.string.your_scan_front_side_instructions,
            scanBackSide = R.string.your_scan_back_side_instructions,
            flipDocument = R.string.your_flip_document_instructions,
            rotateDocument = R.string.your_rotate_document_instructions,
            rotateDocumentShort = R.string.your_rotate_document_short_instructions,
            moveFarther = R.string.your_move_farther_instructions,
            moveCloser = R.string.your_move_closer_instructions,
            keepDocumentVisible = R.string.your_keep_document_visible_instructions,
            alignDocument = R.string.your_align_document_instructions,
            increaseLightingIntensity = R.string.your_increase_lighting_instructions,
            decreaseLightingIntensity = R.string.your_decrease_lighting_instructions,
            eliminateBlur = R.string.your_eliminate_blur_instructions,
            eliminateGlare = R.string.your_eliminate_glare_instructions,
        ),
        alertStrings = AlertStrings(
            errorDialogMessageScanningUnavailable = R.string.your_scanning_unavailable_message,
            errorDialogMessageCheckInternet = R.string.your_check_internet_message,
            errorDialogMessageNetworkCommunicationError = R.string.your_network_communication_error_message,
            errorDialogButtonText = R.string.your_error_dialog_button_text
        )
    )
)

Using Capture filter

If you need additional checks on Capture result images, you can use CaptureFilter. This feature is optional.

Capture filter can be used with an external validation tool, such as BlinkID, to confirm that the image is of sufficient quality before ending the scanning process. During this, the scanning session is not stopping or restarting, therefore not causing friction on the end-user side.

Capture filter filters capture results after each successful side capture (accepts or drops captured side). If the captured image is filtered out, the capture process is restarted for the current side and the same side is captured again in the same camera session.

You can set your implementation of the CaptureFilter on the CaptureSettings like this:

val captureSettings = CaptureSettings(
    filterSettings = FilterSettings (
        // your implementation of the CaptureFilter interface
        YourCaptureFilterImplementation()
    )
)

We are providing one specific implementation of the CaptureFilter which uses the BlinkID SDK and accepts document images that are extractable by the BlinkID SDK. Usage of the BlinkIdCaptureFilter is described in the following section.

Using provided BlinkIdCaptureFilter

BlinkIdCaptureFilter implementation uses BlinkID SDK for filtering of capture results. For each successful side capture, this filter runs BlinkID extraction on the captured side image and side capture results are accepted only if the image is extractable by the BlinkID SDK.

To use the BlinkIdCaptureFilter, you first need to add additional dependency in module level build.gradle(.kts):

dependencies {
    // this will also add transitive dependency to the BlinkID SDK
    implementation("com.microblink:capture-filter-blinkid:1.0.0")
}

Because BlinkIdCaptureFiler internally uses BlinkID SDK, you will also need to set the valid license key for the BlinkID SDK. We recommend that you extend Android Application class and set the license in onCreate callback, it can be done in the same place as for the Capture SDK license key:

   public class MyApplication : Application() {
       override fun onCreate() {
           // setting license key for the Capture SDK like before
           CaptureSDK.setLicenseFile("path/to/license/file/within/assets/dir", this)
           // additionally set the license key for the BlinkID SDK
           com.microblink.blinkid.MicroblinkSDK.setLicenseFile("path/to/blinkid/license/file/within/assets/dir", this);
       }
   }

To activate BlinkIdCaptureFilter filter, set it on the CaptureSettings like this:

val captureSettings = CaptureSettings(
    filterSettings = FilterSettings (
        // use provided BlinkID filter implementation
        BlinkIdCaptureFilter(
            // optionally you can set extraction result listener for
            // obtaining the extraction results from the filter
            extractionResultListener = YourExtractionResultListener()
        )
    )
)

Completely custom UX with Direct API (advanced)

When using the Direct API, you are responsible for preparing input image stream (or static images) for analysis and building a completely custom UX from scratch based on the image-by-image feedback from the SDK.

Direct API gives you more flexibility with the cost of a significantly larger integration effort. For example, if you need a camera, you will be responsible for camera management and displaying real-time user guidance.

Adding Capture SDK dependency for Direct API

For Direct API, you need only Capture SDK core library: capture-core, capture-ux is not needed.

In your project root, add Microblink maven repository to the repositories list:

repositories {
    maven { url 'https://maven.microblink.com' }
}

Add capture-core library as a dependency in module level build.gradle(.kts):

dependencies {
    implementation("com.microblink:capture-core:1.4.0")
}

The AnalyzerRunner

For the Direct API integration, use the AnalyzerRunner. It is a singleton object, meaning it is possible to capture a single document at a time.

Like in the default UX, you can configure the AnalyzerRunner with desired AnalyzerSettings. It is allowed to update settings at any time during analysis.

AnalyzerRunner.settings = AnalyzerSettings(
    // set supported options
)

When starting the analysis of the next document, be sure that Analyzer has been reset to the initial state:

AnalyzerRunner.reset()

During analysis and after analysis is done, the current result is available via AnalyzerRunner.result.

After analysis is done, and you don't need the AnalyzerRunner anymore, be sure to terminate it to release the allocated memory for processing:

AnalyzerRunner.terminate()

After terminating, the AnalyzerRunner could be used later again. Just start feeding the frames for the next document.

Analyzing the stream of images

When you have a larger number of images coming from the stream, e.g. camera stream or pre-recorded video stream, use AnalyzerRunner.analyzeStreamImage method.

It is expected that you will call this method multiple times to analyze the single document and all analyzed images are considered for building the final AnalyzerRunner.result.

For each frame, all relevant info for the current status of the analysis and the capture process is returned by FrameAnalysisResultListener as FrameAnalysisResult, which could be used to guide the user through the scanning process and give real-time feedback.

When FrameAnalysisResult.captureState becomes CaptureState.DocumentCaptured, this means that the document has been successfully captured and you can use the AnalyzerRunner.result as a final capture result. To immediately reset the Analyzer to its initial state and avoid further result changes, you can use AnalyzerRunner.detachResult().

Analyzing a few images (usually one or two)

When you have a fixed number of images to analyze, e.g. one (or few) for the front side and another (or few) for the back side of the document, use AnalyzerRunner.analyzeImage, which is optimized for single image analysis.

Make sure that you have set appropriate settings to enable capturing of the document side from the single image:

AnalyzerRunner.settings = AnalyzerSettings(
    // here we have to use single frame strategy, because we have one frame per document side
    captureStrategy = CaptureStrategy.SingleFrame
)

Troubleshooting

Integration difficulties

In case of problems with SDK integration, first make sure that you have followed integration instructions and device requirements. If you're still having problems, please contact us at help.microblink.com.

Resolving conflict on libc++_shared.so

Capture contains native code that depends on the C++ runtime. This runtime is provided by the libc++_shared.so, which needs to be available in your app that is using Capture. However, the same file is also used by various other libraries that contain native components. If you happen to integrate both such library together with Capture in your app, your build will fail with an error similar to this one:

* What went wrong:
2 files found with path 'lib/arm64-v8a/libc++_shared.so' from inputs:
 - /Users/igrce/.gradle/caches/transforms-4/960385a07004d962827b3e095ca2f9ae/transformed/capture-core-xxx/jni/arm64-v8a/libc++_shared.so
 - /Users/igrce/.gradle/caches/transforms-4/0639c19a4f322d7b8c731d9105a5cf64/transformed/blinkid-xxx/jni/arm64-v8a/libc++_shared.so

The error states that multiple different dependencies provide the same file lib/arm64/libc++_shared.so (in this case, Capture and BlinkID).

You can resolve this issue by making sure that the dependency that uses newer version of libc++_shared.so is listed first in your dependency list, and then, simply add the following to your build.gradle:

android {
    packaging {
        jniLibs {
            pickFirsts.add("lib/armeabi-v7a/libc++_shared.so")
            pickFirsts.add("lib/arm64-v8a/libc++_shared.so")
        }
    }
}

IMPORTANT NOTE

The code above will always select the first libc++_shared.so from your dependency list, so make sure that the dependency that uses the latest version of libc++_shared.so is listed first. This is because libc++_shared.so is backward-compatible, but not forward-compatible. This means that, e.g. libBlinkID.so built against libc++_shared.so from NDK r24 will work without problems when you package it together with libc++_shared.so from NDK r26, but will crash when you package it together with libc++_shared.so from NDK r21. This is true for all your native dependencies.

Other problems

If you are having problems like undesired behaviour on specific device(s), crashes inside Capture SDK or anything unmentioned, please contact us at help.microblink.com describing your problem and provide following information:

  • high resolution scan/photo of the item that you are trying to read
  • information about device that you are using - we need exact model name of the device. You can obtain that information with any app like this one
  • please stress out that you are reporting problem related to Android version of Capture SDK

Additional info

Capture SDK size

We recommend that you distribute your app using App Bundle. This will defer apk generation to Google Play, allowing it to generate minimal APK for each specific device that downloads your app, including only required processor architecture support.

Here is the SDK size, calculated for supported ABIs:

ABI APK file size increase APK download size increase
armeabi-v7a 2.6 MB 2.4 MB
arm64-v8a 2.6 MB 2.5 MB

SDK size is calculated as application size increase when Capture SDK is added, with all its dependencies included.

API documentation

You can find the Capture SDK KDoc documentation here.

Contact

For any other questions, feel free to contact us at help.microblink.com.

About

Everything you need to add AI-driven ID document capture into your native Android app.

Resources

Stars

Watchers

Forks

Packages

No packages published