Best Open Source Image Recognition Software 2024

Image Recognition Software

Image Recognition Artificial Intelligence Clear Filters

Browse free open source Image Recognition software and projects below. Use the toggles on the left to filter open source Image Recognition software by OS, license, language, programming language, and project status.

Get Advanced Threat Protection for Your Azure Workloads
FortiGate NGFW on Azure Enables You to Protect Your Workloads Beyond Basic Azure Security Services

FortiGate NGFW identifies and stops advanced threats with powerful application control, malware protection, web filtering, antivirus, and IPS technology. As the attack surface expands, FortiGate provides integrated and automated protection against emerging and sophisticated threats while securing hybrid or multi-cloud environments. Deploy today in Azure Marketplace.

Free 30-Day Trial
The Voice API that just works | Twilio
Build a scalable voice experience with the API that's connecting millions around the world.

With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources.

Learn More
1

Tesseract OCR

Open Source OCR Engine

Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns. Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.

Downloads: 925 This Week

Last Update: 2024-01-25
See Project
2

LabelImg

Graphical image annotation tool and label object bounding boxes

LabelImg is a graphical image annotation tool. It is written in Python and uses Qt for its graphical interface. Annotations are saved as XML files in PASCAL VOC format, the format used by ImageNet. Besides, it also supports YOLO and CreateML formats. Linux/Ubuntu/Mac requires at least Python 2.6 and has been tested with PyQt 4.8. However, Python 3 or above and PyQt5 are strongly recommended. Virtualenv can avoid a lot of the QT / Python version issues. Build and launch using the instructions. Click 'Change default saved annotation folder' in Menu/File. Click 'Open Dir'. Click 'Create RectBox'. Click and release left mouse to select a region to annotate the rect box. You can use right mouse to drag the rect box to copy or move it. The annotation will be saved to the folder you specify. You can refer to the hotkeys to speed up your workflow.

Downloads: 260 This Week

Last Update: 2021-05-05
See Project
3

DeepFaceLab

The leading software for creating deepfakes

DeepFaceLab is currently the world's leading software for creating deepfakes, with over 95% of deepfake videos created with DeepFaceLab. DeepFaceLab is an open-source deepfake system that enables users to swap the faces on images and on video. It offers an imperative and easy-to-use pipeline that even those without a comprehensive understanding of the deep learning framework or model implementation can use; and yet also provides a flexible and loose coupling structure for those who want to strengthen their own pipeline with other features without having to write complicated boilerplate code. DeepFaceLab can achieve results with high fidelity that are indiscernible by mainstream forgery detection approaches. Apart from seamlessly swapping faces, it can also de-age faces, replace the entire head, and even manipulate speech (though this will require some skill in video editing).

Downloads: 259 This Week

Last Update: 2023-09-07
See Project
4

labelme Image Polygonal Annotation

Image polygonal annotation with Python

Labelme is a graphical image annotation tool. It is written in Python and uses Qt for its graphical interface. Image annotation for polygon, rectangle, circle, line and point. Image flag annotation for classification and cleaning. Video annotation. (video annotation). GUI customization (predefined labels / flags, auto-saving, label validation, etc). Exporting VOC-format dataset for semantic/instance segmentation. (semantic segmentation, instance segmentation). Exporting COCO-format dataset for instance segmentation. (instance segmentation). The first time you run labelme, it will create a config file in ~/.labelmerc. You can edit this file and the changes will be applied the next time that you launch labelme. If you would prefer to use a config file from another location, you can specify this file with the --config flag.

Downloads: 65 This Week

Last Update: 2024-01-06
See Project
Holistically view your business data within a single solution.
For IT service providers and MSPs that need a data platform to manage their processes

BrightGauge, a ConnectWise solution, was started in 2011 to fill a missing need in the small-to-medium IT Services industry: a better way to manage data and provide the value of work to clients. BrightGauge Software allows you to display all of your important business metrics in one place through the use of gauges, dashboards, and client reports. Used by more than 1,800 companies worldwide, BrightGauge integrates with popular business solutions on the market, like ConnectWise, Continuum, Webroot, QuickBooks, Datto, IT Glue, Zendesk, Harvest, Smileback, and so many more. Dig deeper into your data by adding, subtracting, multiplying, and dividing one metric against another. BrightGauge automatically computes these formulas for you. Want to show your prospects how quick you are to respond to tickets? Show off your data with embeddable gauges on public sites.

Learn More
5

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle

PaddleOCR offers exceptional, multilingual, and practical Optical Character Recognition (OCR) tools that can help users train better models and apply them into practice. Inspired by PaddlePaddle, PaddleOCR is an ultra lightweight OCR system, with multilingual recognition, digit recognition, vertical text recognition, as well as long text recognition. It features a PPOCR series of high-quality pre-trained models, which includes: ultra lightweight ppocr_mobile series models, general ppocr_server series models, and ultra lightweight compression ppocr_mobile_slim series models. PaddleOCR is easy to install and easy to use on Windows, Linux, MacOS and other systems.

Downloads: 34 This Week

Last Update: 2024-03-29
See Project
6

Computer Vision Annotation Tool (CVAT)

Interactive video and image annotation tool for computer vision

Computer Vision Annotation Tool (CVAT) is a free and open source, interactive online tool for annotating videos and images for Computer Vision algorithms. It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for computer vision tasks. CVAT supports several annotation formats. Format selection can be done after clicking on the Upload annotation and Dump annotation buttons.

Downloads: 26 This Week

Last Update: 4 days ago
See Project
7

html2canvas

A JavaScript HTML screenshot renderer

html2canvas is a JavaScript HTML renderer. The script provides you with the tools to take screenshots of webpages directly on the browser. The screenshot is based on the DOM and therefore, it may not be 100% accurate to the real representation, given that it is not an actual screenshot, but a type of screenshot built based on the available data and information of the page. The script renders such page as a canvas image, by reading the DOM and the different styles of the featured elements. It doesn't require rendering from the server, given that the image is created on the user's browser. However, as it is heavily dependent on the browser, the library is not to be used in nodejs. It can't circumvent any browser content policy restrictions and to render cross-origin content a proxy will be needed to get the content to the same origin.

Downloads: 17 This Week

Last Update: 2023-09-07
See Project
8

Tesseract.js

A pure Javascript Multilingual OCR

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image parameter, which should be something that is like an image. What's considered "image-like" differs depending on whether it is being run from the browser or through NodeJS.

Downloads: 15 This Week

Last Update: 2024-02-26
See Project
9

openalpr

Automatic license plate recognition library

Deploy license plate and vehicle recognition with Rekor’s OpenALPR suite of solutions designed to provide invaluable vehicle intelligence which enhances business capabilities, automates tasks, and increases overall community safety! Rekor’s OpenALPR suite of solutions utilizes artificial intelligence and machine learning to greatly surpass legacy OCR solutions. Now, in real-time, users can receive a vehicle's plate number, make, model, color, and direction of travel. Rekor’s OpenALPR suite of solutions allows law enforcement and homeowners to protect their communities, while businesses can boost customer loyalty by receiving alerts the moment a plate of interest is detected. Rekor’s OpenALPR suite of solutions is a force multiplier. Rekor Scout™ upgrades nearly any IP, traffic, or security camera to give you an immediate edge, while Rekor CarCheck analyzes vehicle images and returns valuable data for countless business use-cases.

Downloads: 11 This Week

Last Update: 2021-06-08
See Project
Propelling Payments for Software Platforms
For SaaS businesses to monetize payments through its turnkey PayFac-as-a-Service solution.

Exact Payments delivers easy-to-integrate embedded payment solutions enabling you to rapidly onboard merchants, instantly activate a variety of payment methods and accelerate your revenue — delivering an end-to-end payment processing platform for SaaS businesses.

Learn More
10

Convert-Screenshot-To-Text

"Note that by default, three languages are selected. If you only need to recognize English, please only select English." -No installation required. It's ready to use as soon as you open it.- I have made a major upgrade to CSTT this time, including support for all Tesseract-supported languages, improved OCR accuracy, added multiple recognition modes, added keyboard shortcuts for canvas movement and zooming, and enabled users to adjust OCR settings. If you like it, please support me. Author: A_A Email: A_A_kent_leung@hotmail.com Donation: (Buy Me a Coffee) https://www.buymeacoffee.com/AAkent (PATREON) patreon.com/A_A_KENT (PAYPAL) https://www.paypal.com/paypalme/AAKENT

1 Review

Downloads: 719 This Week

Last Update: 2023-04-05
See Project
11

Face Alignment

2D and 3D Face alignment library build using pytorch

Detect facial landmarks from Python using the world's most accurate face alignment network, capable of detecting points in both 2D and 3D coordinates. Build using FAN's state-of-the-art deep learning-based face alignment method. For numerical evaluations, it is highly recommended to use the lua version which uses identical models with the ones evaluated in the paper. More models will be added soon. By default, the package will use the SFD face detector. However, the users can alternatively use dlib, BlazeFace, or pre-existing ground truth bounding boxes. While not required, for optimal performance(especially for the detector) it is highly recommended to run the code using a CUDA-enabled GPU. While here the work is presented as a black box, if you want to know more about the intrisecs of the method please check the original paper either on arxiv or my webpage.

Downloads: 5 This Week

Last Update: 2023-08-16
See Project
12

ncnn

High-performance neural network inference framework for mobile

ncnn is a high-performance neural network inference computing framework designed specifically for mobile platforms. It brings artificial intelligence right at your fingertips with no third-party dependencies, and speeds faster than all other known open source frameworks for mobile phone cpu. ncnn allows developers to easily deploy deep learning algorithm models to the mobile platform and create intelligent APPs. It is cross-platform and supports most commonly used CNN networks, including Classical CNN (VGG AlexNet GoogleNet Inception), Face Detection (MTCNN RetinaFace), Segmentation (FCN PSPNet UNet YOLACT), and more. ncnn is currently being used in a number of Tencent applications, namely: QQ, Qzone, WeChat, and Pitu.

Downloads: 5 This Week

Last Update: 2024-04-10
See Project
13

Jimp

An image processing library written entirely in JavaScript for Node

An image processing library for Node written entirely in JavaScript, with zero native dependencies. If you're using this library with TypeScript the method of importing slightly differs from JavaScript. Instead of using require, you must import it with ES6 default import scheme. If you're using a web bundles (webpack, rollup, parcel) you can benefit from using the module build of jimp. Using the module build will allow your bundler to understand your code better and exclude things you aren't using. If you're using webpack you can set process.browser to true and your build of jimp will exclude certain parts, making it load faster. The static Jimp.read method takes the path to a file, URL, dimensions, a Jimp instance or a buffer and returns a Promise. In some cases, you need to pass additional parameters with an image's URL.

Downloads: 3 This Week

Last Update: 2024-02-23
See Project
14

Mozilla JPEG Encoder Project

Improved JPEG encoder

MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time. It is compatible with the JPEG standard, and the vast majority of the world's deployed JPEG decoders. MozJPEG is compatible with the libjpeg API and ABI. It is intended to be a drop-in replacement for libjpeg. MozJPEG is a strict superset of libjpeg-turbo's functionality. All MozJPEG's improvements can be disabled at run time, and in that case it behaves exactly like libjpeg-turbo. MozJPEG is meant to be used as a library in graphics programs and image processing tools. We include a demo cjpeg command-line tool, but it's not intended for serious use. We encourage authors of graphics programs to use libjpeg's C API and link with MozJPEG library instead. Progressive encoding with "jpegrescan" optimization. It can be applied to any JPEG file (with jpegtran) to losslessly reduce file size.

Downloads: 3 This Week

Last Update: 2022-08-15
See Project
15

NSFWJS

Client-side indecent content checking powered by TensorFlow.js

NSFWJS is a simple JavaScript library that can quickly and quite accurately identify NSFW images, all in the client's browser. It is powered by TensorFlow.js and the NSFW detection model, and delivers around 90% accuracy that is improving each time. NSFWJS classifies images with percentages under five categories, namely: drawing and neutral, which are both safe for work; sexy, which includes sexually explicit images; and hentai and porn, which are pornographic drawings and images. NSFWJS offers a 'browserified' version, an NSFW filter web extension that filters out NSFW images from your browser, and also has a separate React Native app.

Downloads: 2 This Week

Last Update: 2024-03-06
See Project
16

ARKit + CoreLocation

Combines the high accuracy of AR with the scale of GPS data

ARKit uses camera and motion data to map out the local world as you move around. CoreLocation uses wifi and GPS data to determine your global location, with a low degree of accuracy. ARKit + CoreLocation combines the high accuracy of AR with the scale of GPS data. The potential for combining these technologies is huge, with so many potential applications across many different areas. Allow items to be placed within the AR world using real-world coordinates. Dramatically improved location accuracy, using recent location data points combined with knowledge about movement through the AR world. The improved location accuracy is currently in an “experimental” phase, but could be the most important component. The library and demo come with a bunch of additional features for configuration. It’s all fully documented to be sure to have a look around.

Downloads: 1 This Week

Last Update: 2021-11-01
See Project
17

Image Crop Picker

iOS/Android image picker with support for camera, video, etc.

Image Crop Picker is an iOS/Android image picker with support for camera, video, configurable compression, multiple images and cropping. Module is creating tmp images which are going to be cleaned up automatically somewhere in the future. If you want to force cleanup, you can use clean to clean all tmp files, or cleanSingle(path) to clean single tmp file. Some of these types may not be available on all iOS versions.

Downloads: 1 This Week

Last Update: 2024-01-22
See Project
18

libfacedetection

Library for face detection in images

This is an open source library for CNN-based face detection in images. The CNN model has been converted to static variables in C source files. The source code does not depend on any other libraries. What you need is just a C++ compiler. You can compile the source code under Windows, Linux, ARM and any platform with a C++ compiler. SIMD instructions are used to speed up the detection. You can enable AVX2 if you use Intel CPU or NEON for ARM. The model file has also been provided in directory ./models/. The file examples/detect-image.cpp and examples/detect-camera.cpp show how to use the library. The library was trained by libfacedetection.train. You can copy the files in directory src/ into your project, and compile them as the other files in your project. The source code is written in standard C/C++. It should be compiled at any platform which supports C/C++.

Downloads: 1 This Week

Last Update: 2021-09-24
See Project
19

pixelmatch

The smallest, simplest JavaScript pixel-level image comparison library

The smallest, simplest and fastest JavaScript pixel-level image comparison library, originally created to compare screenshots in tests. Features accurate anti-aliased pixels detection and perceptual color difference metrics. Inspired by Resemble.js and Blink-diff. Unlike these libraries, pixelmatch is around 150 lines of code, has no dependencies, and works on raw typed arrays of image data, so it's blazing fast and can be used in any environment (Node or browsers). Compares two images, writes the output diff and returns the number of mismatched pixels.

Downloads: 1 This Week

Last Update: 2022-04-25
See Project
20

scikit-image

Image processing in Python

scikit-image is a collection of algorithms for image processing. It is available free of charge and free of restriction. We pride ourselves on high-quality, peer-reviewed code, written by an active community of volunteers. scikit-image builds on scipy.ndimage to provide a versatile set of image processing routines in Python. This library is developed by its community, and contributions are most welcome! Read about our mission, vision, and values and how we govern the project. Major proposals to the project are documented in SKIPs. The scikit-image community consists of anyone using or working with the project in any way. A community member can become a contributor by interacting directly with the project in concrete ways.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
21

Image To Text tools

ITTT is a Free tool designed to Scan and extract Text from Images.

Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.

Downloads: 13 This Week

Last Update: 2024-02-21
See Project
22

Img2Txt

Img2Txt - Extract Text From Images using AI

Important: If you are sharing this program. Please Include the official Download Link What is Img2Txt? Img2Txt is a Python-based application packaged using PyInstaller that utilizes the power of pytesseract, an AI-powered optical character recognition (OCR) library, to extract text from images and convert it into plain text. The application features a simple and modern user-friendly interface created using customtkinter, allowing users to easily process images and obtain the text within them. Support me at : https://www.buymeacoffee.com/zsynctic it will motivate me and it will make me create more projects Support For any questions or issues, please open an issue on the Img2Txt GitHub repository. Warning: When running Img2Txt.exe a Blue Window Might Popup. To Run The Application You Have To Press More Info And Then Run Anyways. © zSynctic

1 Review

Downloads: 4 This Week

Last Update: 2023-08-15
See Project
23

DeepImageTranslator

DeepImageTranslator: a deep-learning utility for image translation

Created by: Run Zhou Ye, En Zhou Ye, and En Hui Ye DeepImageTranslator: a free, user-friendly tool for image translation using deep-learning and its applications in CT image analysis Citation: Please cite this software as: Ye RZ, Noll C, Richard G, Lepage M, Turcotte ÉE, Carpentier AC. DeepImageTranslator: a free, user-friendly graphical interface for image translation using deep-learning and its applications in 3D CT image analysis. SLAS technology. 2022 Feb 1;27(1):76-84. https://doi.org/10.1016/j.slast.2021.10.014

Downloads: 3 This Week

Last Update: 2022-11-16
See Project
24

Exadel CompreFace

Leading free and open-source face recognition system

Exadel CompreFace is a free and open-source face recognition GitHub project. Essentially, it is a docker-based application that can be used as a standalone server or deployed in the cloud. You don’t need prior machine learning skills to set up and use CompreFace. The system provides REST API for face recognition, face verification, face detection, face mask detection, landmark detection, age, and gender recognition. The solution also features a role management system that allows you to easily control who has access to your Face Recognition Services. CompreFace is delivered as a docker-compose config and supports different models that work on CPU and GPU. Our solution is based on state-of-the-art methods and libraries like FaceNet and InsightFace. Official website: https://exadel.com/solutions/compreface/ Github link: https://github.com/exadel-inc/CompreFace

Downloads: 3 This Week

Last Update: 2023-08-22
See Project
25

AiHound

AI powered image classification for nudity and documents / id-cards

AI Hound is designed to run from an USB pendrive or any other kind of removeable and writeable media. The programm checks all Office-documents, Images and videos for various categories for images. Actually It can recognice nudity/porn and scanned or photographed documents / ID- and credit-cards. I am working on a model that also recognice various types of drugs in images.

Downloads: 2 This Week

Last Update: 2023-04-20
See Project