Category: Artificial Intelligence
Comma AI
[openpilot] is open source software built to improve upon the existing driver assistance in most new cars on the road today.
TensorFlow ImageScaleUp
I started a TensorFlow notebook [TF_ImageScaleUp] to experiment with AI to deblur images.

Deep Convolutional Generative Adversarial Network
Learn PyTorch for deep learning in a day. Literally.
TensorFlow Developer Certificate in 2022: Zero to Mastery – Code Repository
Seamless textures from a single image: style transfer and SGAN
Whisper Speech Recognition
[Whisper] is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
[AI model from OpenAI automatically recognizes speech and translates it to English]
I tried to build a ML Text to Image App with Stable Diffusion in 15 Minutes

I commented out the following from requirements.txt:
#torch==1.12.1+cu113
#torchaudio==0.12.1+cu113
#torchvision==0.13.1+cu113
Run the terminal commands from the Conda terminal:conda install pytorch==1.12.0 torchvision==0.13.0 torchaudio==0.12.0 cudatoolkit=11.3 -c pytorch
py -m pip install –upgrade pip
pip install -r requirements.txt
pip install customtkinter diffusers transformers ftfy
python app.py

TensorFlow DataSets
[TensorFlow Datasets] (TFDS) provides a collection of ready-to-use datasets for use with TensorFlow, Jax, and other Machine Learning frameworks. It handles downloading and preparing the data deterministically and constructing a tf.data.Dataset (or np.array).
The [TensorFlow Datasets Catalog] has numerous datasets available through the API.
Best A.I. Image Upscaler? Top 7 Software Compared!
Finding the Compute Compatibility Score for your GPU
ImageNet
[ImageNet] is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images. The project has been instrumental in advancing computer vision and deep learning research. The data is available for free to researchers for non-commercial use.
TensorFlow Hub
[TensorFlow Hub] is a repository of trained machine learning models ready for fine-tuning and deployable anywhere. Reuse trained models like BERT and Faster R-CNN with just a few lines of code.
The [tfhub.dev] portal lets you search and discover hundreds of trained, ready-to-deploy machine learning models in one place.
13 sick OpenAI Codex use cases I’ve seen so far
Dall-e 2 is stunning!
Sign Language Detection using ACTION RECOGNITION with Python | LSTM Deep Learning Model
TensorFlow Developer Certificate in 2022: Zero to Mastery
Stunning new AI “could be conscious” – with Elon Musk.
Deep Fake Images and Video with SimSwap
Tom’s robot double sells out
Latest SELFIE SEGMENTATION with 100+ FPS on CPU – MediaPipe and OpenCV Python
Real Time AI Face Landmark Detection in 20 Minutes with Tensorflow.JS and React
This is the best tutorial on using face landmark detection that I’ve seen so far.
Build the site with:
npm run build
Building AI-Powered Virtual Collaboration and Content Creation Solutions with NVIDIA Maxine
[Alex Qi] Eye contact (coming soon): Simulates eye contact by estimating and aligning gaze with the camera.
Eye tracking to release Q2/Q3. [Maxine Early Access]
Look at:
Face landmark tracking
Face tracking Sample
Descript
[Descript] automatically transcribes and overdubs your audio recordings for quick editing and removing mistakes from your presentations.
[Feature Request] to add Web Cam support.