This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

Research

1: Reinforcement Learning
2: PhD How To
3: Digital Twin News
4: Generative Machine Learning GAN
5: Awesome Research and Academic Writing Assistant Tools
6: Awesome Science Blogs and Forums
7: Paper and Article Search Tools
8: Awesome Conferences Tools
9: Machine Learning Teaching
10: Machine Learning News and Blog
11: Hardware for Machine Learning
12: Awesome List of Dataset
13: Sign Language Recognition
14: Time Series Forecasting
15: Soft Sensor
16: Historical Colorization
17: Interesting Machine Learning Papers
18: Machine Learning Tools
19: Awesome List of Machine Learning Visualization
20: Machine Learning in Sports
21: Machine Learning for Image Processing
22: Machine Learning for Health
23: Machine Learning in Agriculture
24: Machine Learning for Earth Science
25: Machine Learning SOTA and Model Zoo
26: Machine Learning Metrics
27: Awesome NLP Projects
28: Awesome Latex Apps
29: Awesome Scientific Journal Tools
30: Safety Helmet Detection
31: Emotion Detection with Machine Learning
32: Face Mask Detection with Machine Learning
33: CT-Scan for Covid Classification using Machine Learning
34: Object Detection
35: Face Expression and Detection
36: Awesome Google Colab Notebooks
37: Arxiv, Paper Preprint, and Curated Paper Sites
38: Robotic Simulator
39: Machine Learning for Sport Pose Analysis
40: Machine Learning for Satellite Images
41: NLP Models
42: NLP for Bahasa Indonesia
43: NLP with GPT
44: Digital Twin
45: Awesome Jupyter Notebooks
46: NCBI Papers with Code

Research

1 - Reinforcement Learning

Reinforcement Learning

Reinfocement Learning Course

Algorithm

reinforcement learning algorithms This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO.

AI4Finance-Foundation/FinRL: FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥

2 - PhD How To

PhD How To

3 - Digital Twin News

Digital Twin News

Digital Twin Policy

Digital Twin News

Digital Twin Tools

Digital Twin DIY

IoT Digital Twin with Roblox and The Things Network

Digital Twin on Github

Digital Twin Research Group

Computational Modelling Group: Knowledge graphs
Digital Twin | Open Access Publishing Platform | Digital Twin Fast publication and open peer review for all research related to digital twin technologies.
Interreg - EMR Digital Twin Academy Project
- Technology Sheets - EMR DTA
Home - Digital Twin Research
Digital Twin
AU Centre for Digital Twins
EAISI Digital Twin Lab
Scientific machine learning and data-driven model reduction for a Predictive Digital Twin
Digital Twinning | TNO
Doctoral project: Digital twin developments within Volvo CE | lnu.se
IBM Digital Twin Exchange - Overview - Indonesia | IBM

4 - Generative Machine Learning GAN

Generative Machine Learning GAN

GAN

The dataset was web-scraped for an original 20k samples, then a custom MRCNN model was trained for image segmentation and cropping before being fed into the 128 DCGAN, trained on local hardware, 1660

Collections of GANs : Pytorch implementation of unsupervised GANs.

Generative Model Course

GAN Course

GAN Project/Paper

NVlabs/stylegan3: Official PyTorch implementation of StyleGAN3
AI Demos | NVIDIA Research GAN
HyperStyle StyleGAN Inversion
Pollinations.AI

Style GAN

Paper

GAN Image Superresolution

GAN

Making Anime Faces With StyleGAN · Gwern.net

Style-GAN

Research

An enhanced 3D model and generative adversarial network for automated generation of horizontal building mask images and cloudless aerial photographs - ScienceDirect

Research

Realless Generative webs with blinking eyes

Music Generation

Video Generation

Hyperchroma A music player that creates real-time music videos (hyperchroma.app)

Colorization

Pix2pix vs CycleGAN
- Cycle-GAN can work in an ‘unpaired’ manner and various architectural differences. Unpaired image translation is much harder, as you demand the model to learn objects of different shapes, size, angle, texture, location in different scenes and settings on top of the actual task (coloring in this case). Requires more data and you don’t have fine control on the learning. Formulating the coloring problem as a paired task makes more sense as you simply decrease the complexity of the problem without increasing data collection/annotation work.
- The whole point about using CycleGAN is that it can learn in unpaired situations. And it works well in the context of style transfer tasks where the changes are really bold and less nuanced. But, in the context of image colorization, the changes are really subtle and also there are way more options to choose colors than changing a horse to zebra. The other thing is that learning to change a colored image to black and white is much easier for the model than learning to colorize it which can lead to a bad learning procedure.
- The most prominent differences is that CycleGAN helps when you have unpaired images and you want to go from one class to the other (Horse to Zebra for example) but in the Pix2Pix paper, the images that you get after the inference, are the input images but with some new features (black&white to colorized or day time to night time of a scene). In pix2pix, a conditional GAN (one generator and one discriminator) is used with some supervision from L1 loss. In CycleGAN, you need two generators and two discriminators to do the task: one pair for going from class A to B and one pair for going from class B to A. Also you need Cycle Consistency Loss to make sure that the models learn to undo the changes they make.
pix2pixhd is pix2pix in higher resolution
pix2pix tutorial and example:

Machine Learning: Image Generator

GAN

Gradio demo for GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior

Generation

saharmor/dalle-playground: A playground to generate images from any text prompt using DALL-E Mini and based on OpenAI’s DALL-E https://openai.com/blog/dall-e/

Lightboard

5 - Awesome Research and Academic Writing Assistant Tools

Awesome Research and Academic Writing Assistant Tools

Chrome Extension and Script

Multitools

Lazy Scholar - Chrome Web Store

Arxiv

PubMed and Google Scholar

Code

Find Code for Research Papers - CatalyzeX - Chrome Web Store

Jupyter Notebook

Citation Manager

Paper Finder

MS Word Add-ons

Citation Manager

Zotero
Mendeley

Grammar

Grammarly
Writely
LanguageTools
WriteBetter
Power Thesaurus

Research

Keenious
Data Finder

Reference Tools

Anystyle : parse academic references
Citation Finder : parse academic references
- Citation Finder (Old Version)
Makebib

Article/Paper Summarization

allenai/scholarphi: An interactive PDF reader.: paper reading with explanation and definiton
- example: https://s2-reader.apps.allenai.org/?file=https://arxiv.org/pdf/2009.12303v4.pdf
- https://s2-reader.apps.allenai.org/?file=%arxivfilepdf
TL;DR Now: paper searching and summarization

Article/Paper Graph and Relationship

Connected Papers : Explore connected papers in a visual graph
Inciteful : Inciteful builds a network of academic papers based on a topic of your choice and then analyzes the network to help you find the most relevant literature.
scite: Smart citations for better research
Paperscape
Paperscape Graph
PubPeer - Search publications and join the conversation.
ResearchHub | Open Science Community
Stateoftheart AI
Open Research Knowledge Graph
Publish or Perish (Desktop App)

Article/Paper Reading

Ten Simple Rules for Reading a Scientific Paper

Article/Paper Exploration

Litmaps

Article/Paper Reading Tools

Web Annotation

Research Open Data

List of Researcher and Academia Tools

Research/Academia Forums

Research Code
DeepAI: The front page of A.I. | DeepAI AI Forum

Testing

Generic Paper Search

digital twin - Google Scholar ⭐⭐ (features: time filter, two-variable sorting)
digital twin | Semantic Scholar ⭐⭐⭐ (features: multi-variable filter, multi-variable sorting)
SciSpace | Discover research findings in their entirety ⭐⭐ (features: multi-variable filter, multi-variable sorting)
digital twin in Publications - Dimensions ⭐⭐⭐ (features: multi-variable filter, multi-variable sorting)
Zenodo ⭐⭐ (features: multi-variable filter, multi-variable sorting)

Paper Graph Search

digital twin | Connected Papers Search ⭐⭐⭐ (features: auto-generated citation graph, prior-derivative work)
Search for Papers | Inciteful.xyz ⭐⭐⭐ (features: auto-generated citation graph, similar paper, top-related paper, top authors, top journals)
Elicit | The AI Research Assistant ⭐⭐ (features: writing assistant with language models)
Search - ORKG ⭐⭐⭐ (features: comparison paper, paper search, )
Open Knowledge Maps - A visual interface to the world’s scientific knowledge

Paper Graph Writing Assistant

Litmaps ⭐⭐ features: auto-generated citation graph, seed)
Research Rabbit ⭐⭐⭐
Paperscape ⭐⭐
Iris.ai - Your Science Assistant

6 - Awesome Science Blogs and Forums

Awesome Science Blogs and Forums

7 - Paper and Article Search Tools

Paper and Article Search Tools

Article Search Flowchart

Guide to Finding Articles/Books - Google Docs

Article/Paper Search Tools

Thesis Search

Paper Search Engine (Sci-hub)

sci-hub.ee or sci-hub.st or sci-hub.se or sci-hub.do or sci-hub.ai
sci-hub at hns
scimag or scimag or scimag
telegram
Love Science,Love Sci-Hub! – The latest Sci-Hub working domain

Article

8 - Awesome Conferences Tools

Awesome Conferences Tools

AI Journals

9 - Machine Learning Teaching

Machine Learning Teaching

Teaching Deep Learning

Note by P Migdal

Machine Learning : Visual Coding

💡 : Kid’s machine learning tutorial

Machine Learning on Spreadsheet

Magicsheets: Machine Learning in your spreadsheet
Prediction Labs
mljar-supervised · PyPI Automated Machine Learning Python package that works with tabular data
Magicsheets: Machine Learning in your spreadsheet

Machine Learning Visualization

Experiments with Google

Machine Learning

The First Rule of Machine Learning: Start without Machine Learning
Machine Learning: The High Interest Credit Card of Technical Debt – Google Research So little of success in ML comes from the sexy algorithms and so much just comes from ensuring a bunch of boring details get properly saved in the right place. After months learning about machine learning for time series forecasting, several chapters in a book on deep learning techniques for time series analysis and forecasting, the author kindly pointed out that there are no papers published up to that point that prove deep learning (neural networks) can perform better than classical statistics. Career lesson: Ask a lot of questions early in a project’s life. If you’re working on something that uses machine learning, ask what system it’s replacing, and make sure that someone (or you) runs it manually before spending the time to automate.

Rules of Machine Learning: | ML Universal Guides | Google Developers

Machine Learning

10 - Machine Learning News and Blog

Machine Learning News and Blog

Machine Learning News Update

Articles about Machine Learning

Review about ML

11 - Hardware for Machine Learning

Hardware for Machine Learning

Hardware for Deep Learning

TPU - GPU

Turning TPU into GPU (mean: it compiles Pytorch to work on a TPU). PyTorchXLA converts the TPUv3-8 hardware into a GPU so you can use it with PyTorch as a normal GPU. TPUv3-8 which is part of free access from Google Colab can give a computation power that is equivalent to 8 V100 Tesla GPU and possibly 6 3090RTX GPU. info is here. TPUs are ~5x as expensive as GPUs ($1.46/hr for a Nvidia Tesla P100 GPU vs $8.00/hr for a Google TPU v3 vs $4.50/hr for the TPUv2 with “on-demand” access on GCP).
We recommend CPUs for their versatility and for their large memory capacity. GPUs are a great alternative to CPUs when you want to speed up a variety of data science workflows, and TPUs are best when you specifically want to train a machine learning model as fast as you possibly can. In Google Colab, CPU types vary according to variability (Intel Xeon, Intel Skylake, Intel Broadwell, or Intel Haswell CPUs). GPUs were NVIDIA P100 with Intel Xeon 2GHz (2 core) CPU and 13GB RAM. TPUs were TPUv3 (8 core) with Intel Xeon 2GHz (4 core) CPU and 16GB RAM).

Free TPU

TensorFlow Research Cloud - Free TPU : Accelerate your cutting-edge machine learning research with free Cloud TPUs.

AI/ML Cloud Computing

Machine Learning

SpotML - Cut ML Training Costs by up to 70%

AI Platform

Cloud-CV/EvalAI: Evaluating state of the art in AI
nidhaloff/igel at v0.4.0 machine learning tool that allows you to train, test, and use models without writing code

12 - Awesome List of Dataset

Awesome List of Dataset

Dataset

Art Dataset

Art Data — Artnome

Dataset

Drug Dataset

DrugBank Online | Database for Drug and Drug Target Info
- Muler, PizzaMyHeart/muler: A search engine for drug information built with Flask.

Dataset Zoo

Deeplite/deeplite-torch-zoo Pytorch

Dataset

Recommended Data Repositories | Scientific Data

Dataset

Dataset Search

Dataset

Dataset Tools

Scale AI: The Data Platform for AI : High quality training and validation data for AI applications
Aquarium | Data Management For ML : ML data management platform
Labelbox: The leading training data platform for data labeling : Save time by creating and managing your training data, people, and processes in a single place

Cell Tower Dataset

Twitter Dataset

Hedonometer

Dataset

EleutherAI EleutherAI is a grassroots AI research group aimed at democratizing and open sourcing AI research.
The Pile : The Pile is a 825 GiB diverse, open source language modelling data set that consists of 22 smaller, high-quality datasets combined together.

13 - Sign Language Recognition

Sign Language Recognition

List of Project

Video-based

with NLP

Sign Language Web

Fingerspelling with Machine Learning

Sign Language Tutor

ssaamm/sign-language-tutor: The fun way to learn sign language

Sign Language Vocalization

ardamavi/Vocalize-Sign-Language: Vocalization sign language with deep learning.

Inverse ( … to Sign Language)

Image-based

Bahasa Isyarat Indonesia

Bahasa Isyarat Indonesia (BISINDO)
- dianggap sebagai bahasa yang bisa mewakili budaya tuli Indonesia
Sistem Isyarat Bahasa Indonesia (SIBI)
- adopsi dari American Sign Language (ASL)
- dipakai di SLB
Klobility - BISINDO dan SIBI: Apa Bedanya?

14 - Time Series Forecasting

Time Series Forecasting

LSTM for Time Series Forecasting

Univariate LSTM Models : one observation time-series data, predict the next value in the sequence
Multivariate LSTM Models : two or more observation time-series data, predict the next value in the sequence
- Multiple Input Series : two or more parallel input time series and an output time series that is dependent on the input time series
- Multiple Parallel Series : multiple parallel time series and a value must be predicted for each
Univariate Multi-Step LSTM Models : one observation time-series data, predict the multi step value in the sequence prediction.
Multivariate Multi-Step LSTM Models : two or more observation time-series data, predict the multi step value in the sequence prediction.
- Multiple Input Multi-Step Output.
- Multiple Parallel Input and Multi-Step Output.

Machine Learning for Multivariate Input

Statistical Method for Multivariate Input

Machine Learning for Univariate Input

Time Series Forecasting with the Long Short-Term Memory Network in Python : LSTM
rishikksh20/LSTM-Time-Series-Analysis: Using LSTM network for time series forecasting Dataset: Sunspot Zurich, Model: LSTM
sagarmk/Forecasting-on-Air-pollution-with-RNN-LSTM: Time Series Forecasting using LSTM in Keras. Dataset: Air Pollution, Model: LSTM
pushpendughosh/Stock-market-forecasting: Forecasting directional movements of stock prices for intraday trading using LSTM and random forest Dataset: Stock Market, Model: LSTM, RF
deshpandenu/Time-Series-Forecasting-of-Amazon-Stock-Prices-using-Neural-Networks-LSTM-and-GAN-: Project analyzes Amazon Stock data using Python. Feature Extraction is performed and ARIMA and Fourier series models are made. LSTM is used with multiple features to predict stock prices and then sentimental analysis is performed using news and reddit sentiments. GANs are used to predict stock data too where Amazon data is taken from an API as Generator and CNNs are used as discriminator. Dataset: Amazon Stock Model: LSTM with addition
demmojo/lstm-electric-load-forecast: Electric load forecast using Long-Short-Term-Memory (LSTM) recurrent neural network Dataset: Electric Consumption Model: LSTM
Yifeng-He/Electric-Power-Hourly-Load-Forecasting-using-Recurrent-Neural-Networks: This project aims to predict the hourly electricity load in Toronto based on the loads of previous 23 hours using LSTM recurrent neural network. Dataset: Electricity Consumption Model: LSTM
Yongyao/enso-forcasting: Improving the forecasting accuracy of ENSO through deep learning Dataset: ENSO El Nino, Model: LSTM
EsmeYi/time-series-forcasting: Using K-NN, SVM, Bayes, LSTM, and multi-variable LSTM models on time series forecasting Dataset: Sensor, Model: LSTM
CynthiaKoopman/Forecasting-Solar-Energy: Forecasting Solar Power: Analysis of using a LSTM Neural Network Dataset: Solar power, Model: LSTM
3springs/attentive-neural-processes: implementing “recurrent attentive neural processes” to forecast power usage (w. LSTM baseline, MCDropout) Dataset: English power consumption, Model: ANP-RNN “Recurrent Attentive Neural Process for Sequential Data”, ANP: Attentive Neural Processes, NP: Neural Processes, LSTM
Housiadas/forecasting-energy-consumption-LSTM: Development of a machine learning application for IoT platform to predict electric energy consumption in smart building environment in real time., Dataset: Kaggle energy consuption, Model: LSTM, Seq2Seq

Statistical Method for Univariate Input

Time Series Forecasting — ARIMA, LSTM, Prophet with Python | by Caner Dabakoglu | Medium : LSTM, ARIMA, Prophet
pyaf/load_forecasting: Load forcasting on Delhi area electric power load using ARIMA, RNN, LSTM and GRU models Dataset: Electricity, Model: Feed forward Neural Network FFNN, Simple Moving Average SMA, Weighted Moving Average WMA, Simple Exponential Smoothing SES, Holts Winters HW, Autoregressive Integrated Moving Average ARIMA, Recurrent Neural Networks RNN, Long Short Term Memory cells LSTM, Gated Recurrent Unit cells GRU, Type: Univariate
jiegzhan/time-series-forecasting-rnn-tensorflow: Time series forecasting Dataset: Daily Temperature, Model: LSTM
zhangxu0307/time_series_forecasting_pytorch: time series forecasting using pytorch，including ANN,RNN,LSTM,GRU and TSR-RNN，experimental code Dataset: Pollution, Solar Energy, Traffic data etec. Model MLP,RNN,LSTM,GRU, ARIMA, SVR, RF and TSR-RNN
rakshita95/DeepLearning-time-series: LSTM for time series forecasting Dataset: ?? Model: ARIMA, VAR, LSTM
mborysiak/Time-Series-Forecasting-with-ARIMA-and-LSTM Dataset: Olypic, LeBron, Zika, Model: ARIMA dan LSTM
stxupengyu/load-forecasting-algorithms: 使用多种算法（线性回归、随机森林、支持向量机、BP神经网络、GRU、LSTM）进行电力系统负荷预测/电力预测。通过一个简单的例子。A variety of algorithms (linear regression, random forest, support vector machine, BP neural network, GRU, LSTM) are used for power system load forecasting / power forecasting. Dataset: Power usage, Model: linear regression, random forest, support vector machine, BP neural network, GRU, LSTM
Abhishekmamidi123/Time-Series-Forecasting: Rainfall analysis of Maharashtra - Season/Month wise forecasting. Different methods have been used. The main goal of this project is to increase the performance of forecasted results during rainy seasons. Dataset: precipitation, Model: ARIMA, LSTM, FNN(Feed forward Neural Networks), TLNN(Time lagged Neural Networks), SANN(Seasonal Artificial Neural Networks

Jupyter Notebook Examples

Univariate ARIMA

import statsmodels

Univariate LSTM

import keras

Multivariate VAR

(Note: VAR should only for Stationary process - Wikipedia)

Multivariate LSTM

Prophet and Kats from Facebook

Note on Multivariate and Univariate

Software

Gekko Timeseries and Modeling Software - Timeseries handling, and solving of large-scale economic models

Other Time Series

Time Series Forecasting with Regression and LSTM | Paperspace Blog
Kats | Kats One stop shop for time series analysis in Python
chlubba/catch22: catch-22: CAnonical Time-series CHaracteristics
blue-yonder/tsfresh: Automatic extraction of relevant features from time series:

Precipitation Forecasting

Nowcasting, Upsampling, Interpolation, Super resolution
hydrogo/rainnet: RainNet: a convolutional neural network for radar-based precipitation nowcasting
1706.03458 Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model
- sxjscience/HKO-7: Source code of paper “NIPS2017 Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model”

Deep Learning for Forecasting

top open source deep learning for time series forecasting frameworks.

Gluon This framework by Amazon remains one of the top DL based time series forecasting frameworks on GitHub. However, there are some down sides including lock-in to MXNet (a rather obscure architecture). The repository also doesn’t seem to be quick at adding new research.
Flow Forecast This is an upcoming PyTorch based deep learning for time series forecasting framework. The repository features a lot of recent models out of research conferences along with an easy to use deployment API. The repository is one of the few repos to have new models, coverage tests, and interpretability metrics.
sktime dl This is another time series forecasting repository. Unfortunately it looks like particularly recent activity has diminished on it.
PyTorch-TS Another framework, written in PyTorch, this repository focuses more on probabilistic models. The repository isn’t that active (last commit was in November).

eBook Forecasting

Timeseries Forecasting

Timeseries Forecasting Book

Timeseries Forecasting Reading

Timeseries RNN

Spice.ai

15 - Soft Sensor

Soft Sensor

What is Soft Sensors

Soft-sensors: predictive models for sensor characteristic are called soft sensors Soft-sensors: software+sensor

Soft-sensor Categories

model-driven
- First Principle Models (FPM)
- extended Kalman Filter
data-driven
- Principle Component Analysis + regression model, Partial Least Squares
- Artificial Neural Networks
- Neuro-Fuzzy Systems
- Support Vector Machines

Soft-sensor Application

on-line prediction
- prediction of process variables which can be determined either at low sampling rates
- prediction of process variables which can be determined through off-line analysis only
- (statistical or soft computing supervised learning approaches)
process monitoring
- detection of the state of the process, usually by human
- observation and interpretation of the process state (based on univariate statistics) and experience of the operator
process fault detection
- detection of the state of the process

FPM

First Principle Models describe the physical and chemical background of the process.
These models are developed primarily for the planning and design of the processing plants, and therefore usually focus on the description of the ideal steady-states of the processes
based on established laws of physics
does not make assumptions such as empirical model and fitting parameters
using experimental data

Data-driven data-driven models are based on the data measured within the processing plants, and thus describe the real process conditions, they are, compared to the model-driven Soft Sensors, more reality related and describe the true conditions of the process in a better way. Nevertheless

The most commonly applied multivariate analysis tools are principal component analysis (PCA) for fault detection and projection of latent structures (PLS) for the prediction of key quality parameters at end of batch.

First-principle models may be the answer, using experimental data instead of statistical methods to estimate model parameters. They are not as quick and easy to build, but they have many advantages. In terms of simulation, first-principle models provide extrapolation in addition to the interpolation provided by data-driven models. But they also can be used for monitoring, control and optimization.

Soft-Sensor Modelling

vigorfif/Soft-Sensor-Modelling: Soft sensor modelling using multiple machine learning algorithms Dataset: SRU from Fortuna, Model: NN-BP, LSTM, RNN
hkaneko1985/adaptive_soft_sensors: Adaptive Soft Sensors Dataset: Debutanizer from Fortuna, Model: MWPLSm MWSVR, JITPLS, JITSVR and LWPLS
- hkaneko1985/lwpls: Locally-Weighted Partial Least Squares (LWPLS) Dataset: Debutanizer from Fortuna, Model: LWPLS

Others

rogeredc/soft_sensor: soft_sensor_for_chemical_process LSTM
alexalex222/Batch-Process-Soft-Sensor: Gaussian process regression algorithms for nonlinear system state prediciton Matlab, Model: GPR, NN, SVR
yaole0720/VTR-based-Soft-Sensor: Variable Time Reconstruction based modeling framework for soft sensor development Matlab, Model: VTR
chahat99/Soft-Sensor-with-ANN: Soft sensor using Artificial Neural Networks Matlab, NN
aysunrhn/Adaptive-Soft-Sensor-Design at 4a1d72787e3d08ccb93af11ca24cfda76a9d0f96 Matlab

Dataset

Reference

Ebook

curiousily/Deep-Learning-For-Hackers: Machine Learning tutorials with TensorFlow 2 and Keras in Python (Jupyter notebooks included) - (LSTMs, Hyperameter tuning, Data preprocessing, Bias-variance tradeoff, Anomaly Detection, Autoencoders, Time Series Forecasting, Object Detection, Sentiment Analysis, Intent Recognition with BERT)

16 - Historical Colorization

Historical Colorization

Old Historical Image/Video

Home - Beeldbank Nederlands Instituut voor Militaire Historie

Colorization Tools

Image DeOldify at Google Colab https://colab.research.google.com/github/jantic/DeOldify/blob/master/ImageColorizerColab.ipynb
Video Deldify at Google Colab: https://colab.research.google.com/github/jantic/DeOldify/blob/master/VideoColorizerColab.ipynb
GitHub - skzv/colorize-photos: Colorize all the photos in a directory

17 - Interesting Machine Learning Papers

Interesting Machine Learning Papers

Neural Representation

MPC Deep Learning Control

Deep Implicit Layers

18 - Machine Learning Tools

Machine Learning Tools

Machine Learning Toolbox

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code
a machine learning tool that allows to train, test and use models without writing code
PyCaret is an open source, low-code machine learning library in Python that allows you to go from preparing your data to deploying your model within minutes in your choice of notebook environment.
WEKA The workbench for machine learning
igel A machine learning tool that allows you to train/fit, test and use models without writing code
sktime A unified toolbox for machine learning with time series
FiftyOne is an open source machine learning tool created by Voxel51 that helps you get closer to your data and ML models. With FiftyOne, you can rapidly experiment with your datasets, enabling you to search, sort, filter, visualize, analyze, and improve your datasets without excess wrangling or writing custom scripts.

Machine Learning Deployment

Machine Learning Versioning Control

Replicate AI versioning control for AI
Comet ML versioning control for ML

Data Studio

Data Studio Google

Machine Learning Ops

Machine Learning Toolbox

Machine Learning Toolbox
LabML Neural Networks This is a collection of simple PyTorch implementations of neural networks and related algorithms.

Machine Learning

alan-turing-institute/MLJ.jl at mlnews Julia Machine Learning Library
Rudrabha/Wav2Lip at mlnews Voice Wave to Lip Movement
Best AI Paper 2020

Machine Learning Tools

aimhubio/aim: Aim — a super-easy way to record, search and compare 1000s of ML training runs
- AimStack - Dev tools for AI engineers.

Replicate – Version control for machine learning

Machine Learning Steps

Machine Learning Field Guide
- Importing Data
- Data Cleaning
- Visualisation
- Modelling
- Production

Machine Learning

Annotation Tool

Machine Learning

Machine Learning Labeling

19 - Awesome List of Machine Learning Visualization

Awesome List of Machine Learning Visualization

Machine Learning Visualization

Distill Pub : Visualizing AI algorithms
Open AI Microscope : understanding ML models
CNN Explainer : interactive CNN explanation
TrustMLVis TrustMLVis Browser, A Visual Survey in Enhancing Trust in Machine Learning (ML) Models with Visualization License: CC-BY
What if Tools : Visually probe the behavior of trained machine learning models, with minimal coding.
Visualizing a neural network
lutzroeder/netron: Visualizer for neural network, deep learning, and machine learning models
TensorSpace.js
ML and NN Visualization
CNN Explain
CNN Explain
Flat Explainer
Explained
CS231n Explanatin

20 - Machine Learning in Sports

Machine Learning in Sports

Baseball

Soccer

Soccer Data Set

21 - Machine Learning for Image Processing

Machine Learning for Image Processing

Image to Art

Art Line : https://github.com/vijishmadhavan/ArtLine
- Self-Attention (https://arxiv.org/abs/1805.08318). Generator is pre trained UNET with spectral normalization and self-attention. Something that I got from Jason Antic’s DeOldify(https://github.com/jantic/DeOldify), this made a huge difference, all of a sudden I started getting proper details around the facial features.
- Progressive Resizing (https://arxiv.org/abs/1710.10196),(https://arxiv.org/pdf/1707.02921.pdf,(https://arxiv.org/pdf/1707.02921.pdf)). Progressive resizing takes this idea of gradually increasing the image size. In this project, the image size was gradually increased and learning rates were adjusted. Thanks to fast.ai for introducing me to Progressive resizing, this helps the model to generalize better as it sees many more different images.
- Generator Loss : Perceptual Loss/Feature Loss based on VGG16. (https://arxiv.org/pdf/1603.08155.pdf).
Stylized Neural Painters : simulated paint process for a photo, Github repo, Colab
Toonify convert photo to 3d cartoon

Image Reconstruction and Upscaler

DeOldify: A Deep Learning based project for colorizing and restoring old images (and video!)
Github PifuHD: 3D Human Reconstruction
Image Super Resolution
Waifu : Image, GIF and Video enlarger/upscaler(super-resolution) achieved with Waifu2x, SRMD, RealSR, Anime4K and ACNet.
Reconstructing 3D from Photos

Image Cloaking

Fawkes : Prevent AI for identifying photo

Computer Vision Data Manipulation

cvdata (MIT) : Tools for creating and manipulating computer vision datasets: resize images, rename files, annotation format conversion, image format conversion, Split dataset into training, validation, and test subsets and much more.

ML-based Image Processor

Hotpot AI ; Design Assistant, starting from image processing, device mockups, to social media posts

22 - Machine Learning for Health

Machine Learning for Health

AI in Healths in Indonesia

Niko
Healhtech
Madeena
bravehealth.tech, rafi amjadrasyid
gigi.id
farmasee.id
ctscope
alinamed
Lamesia.com

Deep Learning for Medical Image Segmentation

How to Choose a Neural Net Architecture for Medical Image Segmentation

Healthcare Apps

Awesome Healthcare

23 - Machine Learning in Agriculture

Machine Learning in Agriculture

Disease Detection

Leaf Disease Detection

Crop Simulator and Simulation

DSSAT

Plant Phenomics

Deep Plant Phenomics

24 - Machine Learning for Earth Science

Machine Learning for Earth Science

AI for Earth Science

Climate

Climate indices (BSD-3-Clause) : Python implementations of various climate index algorithms which provide a geographical and temporal picture of the severity of precipitation and temperature anomalies useful for climate monitoring and research.
MIT Climate CoLab : A collaborative online community centered around a series of annual contests that seek out promising ideas for fighting climate change.
Climate Action Challenge : global design competition calling on the creative community to submit bold, innovative solutions to combat the impacts of climate change.
Improving Weather Prediction with CNN: J. A. Weyn, D. R. Durran, and R. Caruana, “Improving data-driven global weather prediction using deep convolutional neural networks on a cubed sphere,”Journal of Advances in Modeling Earth Systems, vol. 12, no. 9, Sep. 2020,ISSN: 1942-2466.doi:10.1029/2020ms002109.[Online]. Available:http://dx.doi.org/10.1029/2020MS002109. Code: https://github.com/jweyn/DLWP-CS

25 - Machine Learning SOTA and Model Zoo

Machine Learning SOTA and Model Zoo

List of SOTA

State of the Art AI
Gradio Hub
- Sketch Recognition
- Painting Generator (StyleGAN)
- Pose Estimation
- Colorize Photos (DeOldify)
- Identifying Skin Cancer
- Emotion Classification
- MobileNet vs. InceptionNet
Papers with Code SOTA
AI Hub by Google

Model Zoo

Model Zoo Machine Learning Playground
Cloud Blobcity : GitHub Data Science projects repository, executable in 1 click
CKnowledge Find portable workflows, reusable artifacts and automation actions for deep tech (AI, ML, quantum, systems…), ML architecture comparison
Gradio
Gradio Spaces
black0017/MedicalZooPytorch: A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation
CatalyzeX : Discover AI models & code to catalyze your projects

Report about AI Progress

Specific SOTA

Machine Learning Review

Best deep CNN architectures and their principles: from AlexNet to EfficientNet | AI Summer

26 - Machine Learning Metrics

Machine Learning Metrics

Accuracy and Loss

Loss (not in %) can be seen as a distance between the true values of the problem and the values predicted by the model. Greater the loss is, more huge is the errors you made on the data.
Loss is often used in the training process to find the “best” parameter values for your model (e.g. weights in neural network). It is what you try to optimize in the training by updating weights.
Accuracy (in %) can be seen as the number of error you made on the data.
Accuracy is more from an applied perspective. Once you find the optimized parameters above, you use this metrics to evaluate how accurate your model’s prediction is compared to the true data.
That means :
- a low accuracy and huge loss means you made huge errors on a lot of data
- a low accuracy but low loss means you made little errors on a lot of data
- a great accuracy with low loss means you made low errors on a few data (best case)

Prediction

Condition Positive (P) : the number of real positive cases in the data
Condition Negative (N) : the number of real negative cases in the data
True Positive or Hit
True Negative or Correct Rejection
False Positive or False Alarm or Type I error
False Negative or Miss or Type II error

Confusion Matrix

TP TN FP FN

Accuracy

Accuracy (ACC) = (Σ True positive + Σ True negative)/Σ Total population
Accuracy = (TP + TN)/(TP + TN + FP + FN)
Accuracy is sensitive to class imbalance

Precision or Positive Predictive Value (PPV)

Precision measures how accurate is the predictions.
Precision is the percentage of the predictions are correct.
Precision measures the “false positive rate” or the ratio of true object detections to the total number of objects that the classifier predicted.
Precision: how many selected items are relevant.
Precision, a.k.a. positive predicted value, is given as the ratio of true positive (TP) and the total number of predicted positives.
Precision = Σ True positive/Σ Predicted condition positive

Recall or Sensitivity or True Positive Rate or Probablity of Detection

Recall measures how good the algorithm find all the positives.
Recall measures the “false negative rate” or the ratio of true object detections to the total number of objects in the data set.
Recall: how many relevant items are selected.
Recall : the ratio of TP and total of ground truth positives.
Recall = Σ True positive/Σ Condition positive

F1 Score

Harmonic mean of Precision and Recall
Because it is difficult to compare two models with low precision and high recall or vice versa.
So to make them comparable, F-Score is used.
F1 score = 2 · (Precision · Recall)/(Precision + Recall)

Matthews Correlation Coefficient (MCC) or Phi Coefficient

MCC is used in machine learning as a measure of the quality of binary (two-class) classifications
MCC takes into account all four values in the confusion matrix, and a high value (close to 1) means that both classes are predicted well, even if one class is disproportionately under- (or over-) represented.

Intersection over Union (IoU)

IoU is used for detection algorithm
The IoU is given by the ratio of the area of intersection and area of union of the predicted bounding box and ground truth bounding box.
- an IoU of 0 means that there is no overlap between the boxes
- an IoU of 1 means that the union of the boxes is the same as their overlap indicating that they are completely overlapping
The IoU would be used to determine if a predicted bounding box (BB) is TP, FP or FN. The TN is not evaluated as each image is assumed to have an object in it.
Traditionally, we define a prediction to be a TP if the IoU is > 0.5, then:
- True Positive (if IoU > 0.5)
- False Positive (if IoU < 0.5 or Duplicated Bounding Box)
- False Negative, when our object detection model missed the target (if there is no detection at all or when the predicted BB has an IoU > 0.5 but has the wrong classification)
mean Average Precision (mAP) score is calculated by taking the mean AP over all classes and/or over all IoU thresholds. Mean average precision (mAP) is used to determine the accuracy of a set of object detections from a model when compared to ground-truth object annotations of a dataset. Note:
What is Mean Average Precision (mAP) in Object Detection?
Mean Average Precision (mAP) Explained | Paperspace Blog
mAP (mean Average Precision) might confuse you! | by Shivy Yohanandan | Towards Data Science

Reference

Multiclass Metrics

Accuracy, Precision, Error

Accuracy is closeness of the measurements to a specific value
- More commonly, it is a description of systematic errors, a measure of statistical bias; low accuracy causes a difference between a result and a “true” value. ISO calls this trueness.
- the accuracy of a measurement system is the degree of closeness of measurements of a quantity to that quantity’s true value
- bias is the amount of inaccuracy
Precision is the closeness of the measurements to each other
- Precision is a description of random errors, a measure of statistical variability.
- The precision of a measurement system, related to reproducibility and repeatability, is the degree to which repeated measurements under unchanged conditions show the same results.
- variability is the amount of imprecision

Accuracy has two definitions:

More commonly, it is a description of systematic errors, a measure of statistical bias; low accuracy causes a difference between a result and a “true” value. ISO calls this trueness.
Alternatively, ISO defines accuracy as describing a combination of both types of observational error above (random and systematic), so high accuracy requires both high precision and high trueness.

Learn more:

POCD

Machine Learning Overfitting Handling

27 - Awesome NLP Projects

Awesome NLP Projects

Learn NLP

NLP Benchmark

NLP Projects

Language NLP

Speech and Language Processing

Summarizer NLP

gregdurrett/berkeley-doc-summarizer: The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference constraints to ensure clarity.

NLP

Text Generation

28 - Awesome Latex Apps

Awesome Latex Apps

Learn Latex

Write LaTeX without LaTeX Editor

Online Paper (with Latex) Editors

Swiftlatex : WASM Latex editor
Overleaf
Manuscript.io
Authorea

Online Math (Latex) Editor

Codecogs Equation Editor : online Latex equation editor
LaTex Equation Editor : the best online Latex equation editor
Online Latex Equation Editor - Sciweavers
HostMath - Online LaTeX formula editor and browser-based math equation editor
Math Writer

Calculator Latex

Image to Latex Converter

Sketch to Latex and Diagram

MyScript Webdemo

Offline Latex Editor

LatexDraw : LaTeXDraw is a graphical drawing editor for LaTeX. LaTeXDraw can be used to 1) generate PSTricks code; 2) directly create PDF or PS pictures.
Texmaker : LaTex editor
MiKTex : LaTex engine
KLatex Formula
SwiftLatex : WASM Latex processing

Latex Code Checker

Latex and LanguageTool for VSCode Github Repo

Word to Latex

Pandoc (or using PanWriter)
MS Word Latex Converter, guide and limitation
Latexinword
Texword
Online Word to Latex Converter

Graph to Latex

Latex to HTML

TeX4ht - TeX Users Group

Latex Math

MathDeck

Latex , Markdown, and HTML

Pseudocode in Latex

Pseudocode
- Latex example

29 - Awesome Scientific Journal Tools

Awesome Scientific Journal Tools

Impact Factor

Impact Factor for Specific Publishers

Article Processing Charge (APC) for Open Access Paper

APC and IF relationship

Journal Search

Tandfonline Open Journal Search
OpenDOAR OpenDOAR is the quality-assured, global Directory of Open Access Repositories.
Journal Finder: Journal, Publisher, Index, SJR Rank, Q, Country, Acceptance Rate, Time To Decicion, APC.
Scimago Journal & Country Rank
Find journals | Elsevier® JournalFinder
Internet Archive Scholar Scholar Archive

Journal Keyword Alert and RSS

Journal Publisher List

Journal Notes

Publikasi Jurnal

Journal Article Tools

30 - Safety Helmet Detection

Safety Helmet and Plate Detection

Safety Helmet Detection Github Repos

njvisionpower/Safety-Helmet-Wearing-Dataset: safety helmet dataset, segmented by Yolo
wujixiu/helmet-detection : SSD-MobileNet, Faster-RCNN, TinyYOLO etc. Paper
BlcaKHat/yolov3-Helmet-Detection : YOLOv3, image
iamdsc/automatic-helmet-detection : realtime, SSD-MobileNet
Angericky/safety-helmet-detection-in-real-time-video: realtime, YOLO
AyazSaiyed/Helmet-Detection-: image, YOLO
rafiuddinkhan/Yolo-Training-GoogleColab: realtime, YOLO
YaphetS-X/CenterNet-MobileNetV3: realtime, CenterNet-MobileNet
PeterH0323/Smart_Construction: realtime, YOLOv5
mohanrajmit/Safety_Detection: realtime, YOLO
Object-Detection_HelmetDetection: image (video?), Contex RCNN
BlcaKHat/yolov3-Helmet-Detection : YOLO
AnshulSood11/PPE-Detection-YOLO-Deep_SORT: YOLO, deepsort
iamdsc/automatic-helmet-detection
abhijeet-talaulikar/Automatic-Helmet-Detection: Designing a system that can automatically detect the absence of helmet on motorcyclists by processing live video streams from a surveillance camera.
abhishekbhutra1/Motorcyclist-Detection: Detection of Motorcyclist without helmet
RajaSudalai/Motorcyclist_Helmet_Detection: Automatic traffic violation detection of Motorcyclist not wearing Helmet - Instance Segmentation using Mask R-CNN
ashish2050/Smart-Traffic-Surveillance-System-: a model built using YOLOv4 and Darknet to detect helmet and licence plate of a motorcyclist
SachinHR/Automatic-number-plate-recognition-for-motorcyclists-riding-without-helmet.: Automatic number plate recognition for motorcyclists riding without helmet with OpenCV.
Tejaswini1399/Detection-of-Motorcyclists-without-helmet
PardeshiSourabh/ObjectDetectionHaar: Object Detection using Haar-like features to enforce helmet check for motorcyclists and improve rider safety

License Plate Detection Github Repos

openalpr/openalpr: OpenALPR is an open-source Automatic License Plate Recognition library written in C++ with bindings in C#, Java, Node.js, Go, and Python. The library analyzes images and video streams to identify license plates.
xuexingyu24/License_Plate_Detection_Pytorch: This is a two-stage lightweight and robust license plate recognition in MTCNN and LPRNet using Pytorch. MTCNN is a very well-known real-time detection model primarily designed for human face recognition. It is modified for license plate detection. LPRNet, another real-time end-to-end DNN, is utilized for subsequent recognition.
Dharun/Tensorflow-License-Plate-Detection : ssd-mobilenet + tasseract OCR
sergiomsilva/alpr-unconstrained, paper
TheophileBuy/LicensePlateRecognition : Yolo+Edge Detection+CNN
qjadud1994/CRNN-Keras : CNN (for character recognition) + RNN (for number sequence detection), image
lyl8213/Plate_Recognition-LPRnet, paper : CNN + CTC, image
sirius-ai/LPRNet_Pytorch, LPRNet implementation in Pytorch
Tensorflow Plate Reader
Dharun/Tensorflow-License-Plate-Detection: The project developed using TensorFlow to recognize the License Plate from a car and to detect the charcters from it.

Ideas

Real-Time Helmet Detection: video-based, handheld based.
upgrade wujixiu/helmet-detection : SSD-MobileNet, Faster-RCNN, TinyYOLO etc. Paper
upgrade iamdsc/automatic-helmet-detection : realtime, SSD-MobileNet
upgrade rafiuddinkhan/Yolo-Training-GoogleColab: realtime, YOLO
upgrade YaphetS-X/CenterNet-MobileNetV3: realtime, CenterNet-MobileNet
upgrade PeterH0323/Smart_Construction: realtime, YOLOv5
upgrade mohanrajmit/Safety_Detection: realtime, YOLO
combined with plate identification?

Safety Helmet Detection Paper

Safety Helmet and Plate Detection Papers

31 - Emotion Detection with Machine Learning

Emotion Detection with Machine Learning

Deep learning for robust feature generation in audiovisual emotion recognition, pdf, Yelin Kim : DBN
Learning deep features for image emotion classification, Ming Chen : CNN
Multimodal emotion recognition using deep learning architectures, pdf,Hiranmayi Ranganathan : CDBN
Deep learning for emotion recognition on small datasets using transfer learning pdf
Learning multi-level deep representations for image emotion classification
Joint Image Emotion Classification and Distribution Learning via Deep Convolutional Neural Network.,Jufeng Yang : Multitask CNN
Real-time Facial Emotion Classification Using Deep Learning, pdf, Emre Dandıl : CNN + Viola-Jones algorithm for face detection
Deep learning approaches for facial emotion recognition: A case study on FER-2013, pdf, Panagiotis Giannopoulos : AlexNet GooLeNet
A brief review of facial emotion recognition based on visual information, pdf, Byoung Chul Ko : review of FER methods, emotion classification in both spatial and temporal, there are to categories: (a) CNN, (b) CNN and LSTM, list of datesets.
A review on deep learning algorithms for speech and facial emotion recognition, pdf, Charlyn Pushpa Latha : review of (speech and facial) methods, algorithm categories: (a) DBM (b) DNN (c) CNN (c) SAE (f) others
Facial emotion recognition in real time, Dan Duncan : CNN with running average, VGGS network with a face-detector provided by OpenCV (Haar Cascade),
Facial Emotion Recognition Using Hybrid Features, pdf, Abdulrahman Alreshidi : Haar Cascade + Neighboring Difference Features (NDF)
Deep learning model for facial emotion recognition
Facial emotion recognition using deep convolutional networks
Labeling images with facial emotion and the potential for pediatric healthcare, pdf, Haik Kalantarian : scalable aggregation of emotive frames from children with autism
Facial emotion recognition using deep convolutional neural network
Facial emotion recognition in real-time and static images
Real-time Algorithms for Facial Emotion Recognition: A Comparison of Different Approaches
Facial emotion recognition using deep learning: review and insights pdf
Facial emotion recognition from videos using deep convolutional neural networks

Github

atulapra/Emotion-detection : haar cascade +CNN
MauryaRitesh/Facial-Expression-Detection : haar cascade + ?
kaushikjadhav01/Deep-Surveillance-Monitor-Facial-Emotion-Age-Gender-Recognition-System : haar cascade + VGGNet/Resnet
MauryaRitesh/Facial-Expression-Detection-V2
Facial-Emotion-Detection This work showcases two independent methods for recognizing emotions from faces. The first method using representational autoencoder units, a fairly original idea, to classify an image among one of the seven different emotions. The second method uses a 8-layer convolutional neural network which has an original and unique design, and was developed from scratch.
juan-csv/Face_info: face recognition, and facial attributes detection (age, gender, emotion and race)
weblineindia/AIML-Human-Attributes-Detection-with-Facial-Feature-Extraction This is a Human Attributes Detection program with facial features extraction. It detects facial coordinates using FaceNet model and uses MXNet facial attribute extraction model for extracting 40 types of facial attributes. This solution also detects Emotion, Age and Gender along with facial attributes.
berksudan/Real-Time-Emotion-Detection Real time emotion detection from facial expression using both machine learning and deep learning techniques.
susantabiswas/realtime-facial-emotion-analyzer : CNN
m-elkhou/Facial_Expression_Detection : Automatic Micro-Expression Recognition (AMER) This project was about providing an Android application that can help people take charge of their own emotional health by capturing their micro expressions such as happiness, sadness, anger, disgust, surprise, fear, and neutral. Paper
serengil/deepface : A Lightweight Deep Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Framework for Python
mayanktolani19/RealTimeEmotionDetection: A flask based web app which utilizes your webcam to capture real time video and display your facial emotion.

Ideas

Real Time Facial Feature Extraction : Emotional + others, video based, handheld based.
upgrade juan-csv/Face_info approach
upgrade weblineindia/AIML-Human-Attributes-Detection-with-Facial-Feature-Extraction approach
upgrade m-elkhou/Facial_Expression_Detection

Library

Jeeliz Github: open source, web app
- for Drowsiness detection, by Abhilash26 aka Dinodroid: Be sure to don’t fall asleep when driving thanks to this webapp! You can try it here: dont-drive-drowsy.glitch.me, view the source code or a demo video
- for Expressions reader, by Abhilash26 aka Dinodroid: detects 5 high level expressions (happiness, fear, anger, surprise, sadness) from the morph coefficients given by this lib, and display them as smileys. You can try it here: emotion-reader.glitch.me or browse the source code
- Realtime Facial Emotion Analyzer

32 - Face Mask Detection with Machine Learning

Face Mask Detection with Machine Learning

Github

AIZOOTech/FaceMaskDetection : SSD, self model
chandrikadeb7/Face-Mask-Detection: MobileNetv2
Spidy20/face_mask_detection : Faster RCNN
mk-gurucharan/Face-Mask-Detection, Medium : Haar Cascade (OpenCV) + CNN
NVIDIA-AI-IOT/face-mask-detection, article : NVIDIA DetectNet_v2 (based on ResNet-18), on Jetson Devices
PureHing/face-mask-detection-tf2 : SSD (based on Mobilenet and RFB)
rfribeiro/mask-detector : Haar Cascade (OpenCV) + MobileNetv2
rohanrao619/Social_Distancing_with_AI : Yolov3 for object detection, Dual Shot Face Detector (DSFD) (better than Haar Cascade) for face detection, ResNet50 for face classification
datarootsio/face-mask-detection : RetinaFace (RetinaNetMobileNetV1) for face detection, MobileNetV1 for face classification
Qengineering/Face-Mask-Detection-Raspberry-Pi-64-bits : Linzaer for face detection, Paddle Lite for face classification, on Raspberry Pi
adityap27/face-mask-detector: Yolo v2, v3, v4
Rahul24-06/COVID-19-Authorized-Entry-using-Face-Mask-Detection: ResNet18 on Jetson Nano
matlab-deep-learning/COVID19-Face-Mask-Detection-using-deep-learning

Dataset

Paper

A hybrid deep transfer learning model with machine learning methods for face mask detection in the era of the COVID-19 pandemic, Loey et. al. : Resnet(+DeepTree, SVN, Ensemble)
Fighting against COVID-19: A novel deep learning model based on YOLO-v2 with ResNet-50 for medical face mask detection, Loey et. al. : Resnet (+Yolo v2)
Face Mask Detection using Transfer Learning of InceptionV3 G. Jignesh Chowdary, et al. : InceptionV3
A Deep Learning Based Assistive System to Classify COVID-19 Face Mask for Human Safety with YOLOv3, Md. Rafiuzzaman Bhuiyan et. al : Yolo v3
Comparative Study of Deep Learning Methods in Detection Face Mask Utilization PDF, Ivan Muhammad Siegfried: MobileNetV2 vs ResNet50V2 vs Xception
Covid-19 Face Mask Detection Using TensorFlow, Keras and OpenCV, Arjya Das et. al.: self CNN
MACHINE LEARNING (CONVOLUTIONAL NEURAL NETWORKS) FOR FACE MASK DETECTION IN IMAGE AND VIDEO, Ramot Lubis : MobileNet
RetinaMask: A Face Mask detector, Mingjie Jiang: RetinaFaceMask
Real Time Multi-Scale Facial Mask Detection and Classification Using Deep Transfer Learning Techniques, Kumar Addagarla : Yolo v3 vs Resnet (+NASNetMobile)
Real-Time Facemask Recognition with Alarm System using Deep Learning, Sammy V. Militante : VGG-16, Raspberry Pi
Mask Detection Using Framework Tensorflow and Pre-Trained CNN Model Based on Raspberry Pi pdf, Acep Ansor: MobileNet, Raspberry Pi
An Application of Mask Detector For Prevent Covid-19 in Public Services Area pdf, Henderi: ???, Sipeed Maix
Face Mask Detector, Akhyar Ahmed: MobileNet vs Resnet vs Exception
A FACEMASK DETECTOR USING MACHINE LEARNING AND IMAGE PROCESSING TECHNIQUES., Amrit Kumar Bhadani: MobileNetV2
Detecting masked faces in the wild with lle-cnns, pdf, S Ge: LLE CNN
Identifying Facemask-Wearing Condition UsingImage Super-Resolution with Classification Networkto Prevent COVID-19, Bosheng Qin : SRCNet

Ideas

Face Mask with Face Presentation Attack Detection (in this case: mask with part of face), with lighting and distance effect analysis on detection, working on handheld devices, video based
upgrade mk-gurucharan/Face-Mask-Detection
upgrade rfribeiro/mask-detector : Haar Cascade (OpenCV) + MobileNetv2
upgrade rohanrao619/Social_Distancing_with_AI : Yolov3 for object detection, Dual Shot Face Detector (DSFD) (better than Haar Cascade) for face detection, ResNet50 for face classification
upgrade datarootsio/face-mask-detection : RetinaFace (RetinaNetMobileNetV1) for face detection, MobileNetV1 for face classification
upgrade Rahul24-06/COVID-19-Authorized-Entry-using-Face-Mask-Detection: ResNet18 on Jetson Nano

Project in Progress (by Rozi)

Deep Learning for Face Detection in Real Time
Face Detection : SSD ResNet10 dan MTCNN
Mask Classification : CNN with MobileNetV2 dan VGG16Net
PC and Android Deployment
Variation :
- distance
- lighting
- mask variation (+face attack)
Metric for Performance Analysis :
- Accuracy, Precision, Recall, F1 for image analysis
- mAP@0.5 (Mean Average Precision) for image analysis
- FPS for video analysis
Reference:
- Face detection benchmark, Medium
- Real-Time Multi-Scale Face Detector on Embedded Devices

Object Detection

Object Detection is Object Localization and Object Classification
Model for Object Detection: Fast R-CNN, Faster R-CNN, Histogram of Oriented Gradients (HOG), Region-based Convolutional Neural Networks (R-CNN), Region-based Fully Convolutional Network (R-FCN), Single Shot Detector (SSD), Spatial Pyramid Pooling (SPP-net), YOLO (You Only Look Once)

YOLO

Redmond developed YOLO v1, YOLO v2, YOLO v3, but YOLO v4 and YOLO v5 were developed by others
Yolo at Darknet, Github Repo
How to Perform Object Detection With YOLOv3 in Keras
Real-time Object Detection with YOLO, YOLOv2 and now YOLOv3
YOLO object detection with OpenCV
What is YOLO Object Detection?
Introduction to Yolo
High-performance multiple object tracking based on YOLOv4, Deep SORT, and optical flow

TinyYolo for Mobile App

Yolo for custom object

SSD

Widerface

WiderFace and Comparison

Model Zoo

33 - CT-Scan for Covid Classification using Machine Learning

CT-Scan for Covid-19 Classification using Machine Learning

Dataset

UCSD-AI4H/COVID-CT
- Paper
ieee8023/covid-chestxray-dataset
Kaggle
Dataset: CT Scan for Covid Classification
NCOV China
haydengunraj/COVIDNet-CT: COVID-Net Open Source Initiative - Models and Data for COVID-19 Detection in Chest CT
- DAtaset : three categories (Covid, Control, Pneunomia)
COVID-19 CT Lung and Infection Segmentation Dataset | Zenodo
Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets | Nature Communications
- Clara Deploy AI COVID-19 Classification | NVIDIA NGC: two categories (Covid, Normal)
LIDC-IDRI - The Cancer Imaging Archive (TCIA) Public Access - Cancer Imaging Archive Wiki

Notes

Github

AlexTS1980/COVID-CT-Mask-Net : Segmentation and Classification, category (COVID, pneumonia, normal), Mask R-CNN.
bkong999/COVNet
- Artificial Intelligence Distinguishes COVID-19 from Community Acquired Pneumonia on Chest CT
- Using Artificial Intelligence to Detect COVID-19 and Community-acquired Pneumonia Based on Pulmonary CT: Evaluation of the Diagnostic Accuracy
JordanMicahBennett/SMART-CT-SCAN_BASED-COVID19_VIRUS_DETECTOR, historical project related CT scan usage for Covid Classification
Covid-Net, complete covid-net project (Covid-Net, CovidNet-S, CovidNet-CT, COVIDNet-CXR)
- COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images
- COVIDNet-S: Towards computer-aided severity assessment via training and validation of deep neural networks for geographic extent and opacity extent scoring of chest X-rays for SARS-CoV-2 lung disease severity
- COVIDNet-CT: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest CT Images, GitHub Repo haydengunraj/COVIDNet-CT : COVIDNet-CT
- zeeshannisar/COVID-19 : CovidNet with different architecture pretrained
- iliasprc/COVIDNet : Pytorch Implementation of CovidNET
Jeremykhon, complete list of project
kaushikjadhav01/COVID-19-Detection-Flask-App-based-on-Chest-X-rays-and-CT-Scans
JunMa11/COVID-19-CT-Seg-Benchmark CT Scan Segmentation
rekalantar/covid19_detector
aniruddh-1/COVID19_Pneumonia_detection
rohilrg/COVID19-xray-classifier
KiLJ4EdeN/DeepCOVID
sydney0zq/covid-19-detection
Paperswithcode
paper/harmony-search-and-otsu-based-system-for

Paper

A Deep Learning System to Screen Novel Coronavirus Disease 2019 Pneumonia
- Deep learning system to screen coronavirus disease 2019 pneumonia - similar with above paper
Imaging and clinical features of patients with 2019 novel coronavirus SARS-CoV-2, Xi Xu et al.
Chest CT findings in patients with coronavirus disease 2019 (COVID-19): a comprehensive review, Jinkui Li et al.
CT in coronavirus disease 2019 (COVID-19): a systematic review of chest CT findings in 4410 adult patients, Vineeta Ojha et al.
iCTCF: an integrative resource of chest computed tomography images and clinical features of patients with COVID-19 pneumonia, Wanshan Ning et al. Project Site, Europe PMC Open Access Research Square
Computed Tomography (CT) Imaging Features of Patients with COVID-19: Systematic Review and Meta-Analysis, Ephrem Awulachew et al. NCBI Open Access
Open resource of clinical data from patients with pneumonia for the prediction of COVID-19 outcomes via deep learning, Wanshan Ning et al., Nature Open Access
Fast automated detection of COVID-19 from medical images using convolutional neural networks, Shuang Liang et al. Research Square
A Fully Automated Deep Learning-based Network For Detecting COVID-19 from a New And Large Lung CT Scan Dataset, Mohammad Rahimzadeh et al. Medrxiv Dataset Code
Lightweight Model For The Prediction of COVID-19 Through The Detection And Segmentation of Lesions in Chest CT Scans
Detection and Segmentation of Lesion Areas in Chest CT Scans For The Prediction of COVID-19
COVID-CT-Mask-Net: Prediction of COVID-19 From CT Scans Using Regional Features
COVID-CT-Dataset: A CT Scan Dataset about COVID-19 GitHub Repo
Sample-Efficient Deep Learning for COVID-19 Diagnosis Based on CT Scans
Benchmarking Deep Learning Models and Automated Model Design for COVID-19 Detection with Chest CT Scans GitHub
A Novel and Reliable Deep Learning Web-Based Tool to Detect COVID-19 Infection from Chest CT-Scan, Arxiv GitHub
Radiologist-Level COVID-19 Detection Using CT Scans with Detail-Oriented Capsule Networks GitHub

34 - Object Detection

Object Detection

Object detection in an hour - Stray Robots Blog

Object Detection and Segmentation

In-Browser Pose Identification

Rooftop Detection Machine Learning

Hand Detection and Hand Tracking

Browser based Face/Pose Identification

Handsfree.js
- MIDIBlocks/handsfree: Quickly integrate face, hand, and/or pose tracking to your frontend projects in a snap ✨👌
esimov/pigo: Fast face detection, pupil/eyes localization and facial landmark points detection library in pure Go. Pupil Localization
- Pupillary Distance Measurement
Mediapipe Iris megabyte model to predict 2D eye, eyebrow and iris geometry from monocular video captured by a front-facing camera on a smartphone in real time.
- mediapipe/mediapipe/models at master · google/mediapipe
- Google AI Blog: MediaPipe Iris: Real-time Iris Tracking & Depth Estimation

In-browser Object Detection

YOLO

There are a few different algorithms for object detection and they can be split into two groups:

Algorithms based on classification – they work in two stages. In the first step, we’re selecting from the image interesting regions. Then we’re classifying those regions using convolutional neural networks. This solution could be very slow because we have to run prediction for every selected region. Most known example of this type of algorithms is the Region-based convolutional neural network (RCNN) and their cousins Fast-RCNN and Faster-RCNN.
Algorithms based on regression – instead of selecting interesting parts of an image, we’re predicting classes and bounding boxes for the whole image in one run of the algorithm. Most known example of this type of algorithms is YOLO (You only look once) commonly used for real-time object detection.

YOLO metrics:

35 - Face Expression and Detection

Face Expression and Detection

Jeeliz Github: open source, web app
- for Drowsiness detection, by Abhilash26 aka Dinodroid: Be sure to don’t fall asleep when driving thanks to this webapp! You can try it here: dont-drive-drowsy.glitch.me, view the source code or a demo video
- for Expressions reader, by Abhilash26 aka Dinodroid: detects 5 high level expressions (happiness, fear, anger, surprise, sadness) from the morph coefficients given by this lib, and display them as smileys. You can try it here: emotion-reader.glitch.me or browse the source code
- Realtime Facial Emotion Analyzer

36 - Awesome Google Colab Notebooks

Awesome Google Colab Notebooks

Computer Vision Google Colab Notebooks

Google DayDream Produce dream-alike imagery link
Big GAN Produce photorealistic images link
Style Transfer Transfer style of an image to another link
DeOldify Colorization of Videos link
3D Ken Burns Effect 3D depth video of a photo link
First Order Motion model Transfers facial movements from video footage to an image link
WiKi Art Create Images using Art works just like Art in AI link
StyleGAN 2 Generate Facial Human Images using improved GAN link
Edge Detection Use open cv to detect edges of the various images and videos link
Learn to paint Teach machines to paint like human painters link
TwinGan Unsupervised image translation for Human portraits link
Face Swap GAN Takes 2 videos and swap faces link
Traffic Counting Counts the traffic on roads link
FaceBook detectron2 For object detection, covering bounding box and object instance segmentation outputs link
DeepFaceLab Generate High quality DeepFake Videos link
3D Feature Visualization Produce feature visualizations on 3D mesh surfaces link
Lucid Modelzoo Visualize neural networks link
YOLO V3 Object detection using Yolo link
Clothe image classification Classify clothing images link
Pneumonia detection Detect pneumonia from medical x-ray images link
Breast cancer detection Detect breast cancer from pre-trained data sets link
Imaging - Amaretto Amaretto is the software toolbox for network Biology link
Waifu 2x Upscale photo and video quality link
Twitter Trends Get a list of trending news on twitter link
Mask R-CNN Object detection using Mask R-CNN link
PiFu HD 3D Human Digitalisation link
Sudoku Solver Solve sudoku using SatNet link
GPT2 Adventure GPT2 based game link
Atari Games Train an agent to play Atari Games link
Cartoon GAN Cartoonize your photos link
Hall of Faces Detect faces in any photo link
Chrome Dino Game Bot that plays chrome dino game link
Plant Classification A plant seedling classifier link
Cascade R-CNN Object detection using Cascade RCNN link
Pedestrian Tracking Pedestrian Tracking with YOLOv3 and DeepSORT link
Segment Cars & Streets Real Time Semantic Segmentation with LightNet++ link
LipGAN Generate lip motion link
Deeplab V3 Semantic Segmentation using Torchvision Deeplab v3 link
OpenPose Pose Detection with OpenPose link
DensePose Pose Detection with DensePose link
Text Detection Text Detection in a scene with PixelLink link
Face Tracker Track and Segment Persons with SiamMask link
Image Upsampler Upsample Images and generate HQ resolution with Pulse link
Super Slow motion Make any video Super SlowMo link
Replace Background Replace Image background with Background Matting link
3D photo in-painting 3D Photography using Context-aware Layered Depth Inpainting link
Super Resolution Image Super Resolution Prediction link
3D object detection LiDAR point cloud based 3D object detection link
Earth Engine Interactive mapping using earth engine API and folium link
OpenVINO OpenVINO Edge developement and deployment on Google Colab link
Autonomous Vehicle A self-driving car that manuvers itself on a two-lane track. link
Image Colorization Instance aware image colorization link
Real Image Editing In-Domain GAN Inversion for Real Image Editing link

Data Science

Jupyter Notebook

37 - Arxiv, Paper Preprint, and Curated Paper Sites

Arxiv, Paper Preprint, and Curated Paper Sites

Arxiv Tools

Arxiv Vanity convert arxiv-pdf to html
Arxiv Sanity web interface for browsing, search and filtering recent arxiv submissions
Arxiv Times ML news from Arxiv
Daily Arxiv check arxiv articles based on date
Arxiv Search Engine for AI articles
Scirate: Top arXiv Papers

Curated Papers

42 Papers Trending papers in AI and Computer Science
Papers with Code
Deep AI org AI News and AI Curated Papers
Deep Learn AI News and AI Curated Papers
Best AI Paper 2020
reddit r/machinelearning: what are you reading

Paper Search

38 - Robotic Simulator

Open Source Robotic Simulator

Webots Open Source Robot Simulator
V-REP
Gazebo
ARGoS
OpenRave
Simspark
Drake : Drake aims to simulate even very complex dynamics of robots (e.g. including friction, contact, aerodynamics, …), but always with an emphasis on exposing the structure in the governing equations (sparsity, analytical gradients, polynomial structure, uncertainty quantification, …) and making this information available for advanced planning, control, and analysis algorithms. Drake provides an interface to Python to enable rapid-prototyping of new algorithms, and also aims to provide solid open-source implementations for many state-of-the-art algorithms.

39 - Machine Learning for Sport Pose Analysis

Machine Learning for Sport Pose Analysis

Pose Estimation

Pose Estimation in Jetson Device Blog

Sport Pose Analysis

Pose Estimation

Methods

HRNet
OpenPose
HigherHRNet
Smiple Baselines
Alphapose
Densepose
Personlab

Datasets

COCO (Common Objects in Context)
- Benchmark; Images from Flickr
MPII Human Pose (body_25)
- 25k images, 40k people, 401 human activities, extracted from YouTube videos
Leeds Sports Pose
- 2k images of mostly Sports from Flickr
Frames Labeled in Cinema (FLIC)
- 5003 Images from movies labeled by Amazon Mechanical Turk
FLIC Plus by Jon Tompson
Human3.6M
- 3D Single Person
HumanEva
- 7 videos with 3D body poses, 4 subjects, 6 common actions
SURREAL
- 6m frames of Synthetic Humans
Panoptic
Basketball Pose Analysis

40 - Machine Learning for Satellite Images

Machine Learning for Satellite Images

Notes

Indonesia Landcover Maps

Kementerian LHK

Open Source GIS Tools

History of Open Source GIS

SASPlanet

SASPlanet at SASGIS SASPlanet FAQ SASPlanet unable to load map or here The most needed links:

Program (latest official release, test builds and archive of previous versions): http://sasgis.org/download/
Attention! Servers with maps can be updated after the release, so the maps are strongly recommended to update anyway!
Nightly build, separate from archive versions: http://sasgis.org/programs/sasplanet/nightly.php
Program - all releases, beta versions and nightly builds (mirror): https://bitbucket.org/ sas_team / sas.planet.bin / downloads
NEW merged map set: https://github.com/sasgis/sas.maps/archive/master.zip Very short instruction:
unpack the program somewhere, for example, c: \ SASPlanet, but not in c: \ Program Files and not in c: \ Program Files (x86)
unpack the maps into the Maps directory (since they are newer than in the archive with the program). You should get SASPlanet \ Maps \ sas.maps \ and SASPlanet \ Maps \ plus.maps \.

41 - NLP Models

NLP Models

List of Excellent NLP Pretrained Models

List Pretrained Models NLP:

Multi-Purpose NLP Models
- ULMFiT
- Transformer
- Google’s BERT
- Transformer-XL
- OpenAI’s GPT-2
Word Embeddings
- ELMo
- Flair
Other Pretrained Models
- StanfordNLP
Reimplementation of NLP models in Jupyter

NLP Course

CS224n : Natural Language Processing with Deep Learning

42 - NLP for Bahasa Indonesia

NLP for Bahasa Indonesia

Indonesian NLP Dataset

43 - NLP with GPT

NLP with GPT

GPT-2

GPT-3

GPT Contentyze GPT-3 for writing assistant

GPT Models

EleutherAI Huge open source Language Models (i.e like GPT-3)
- EleutherAI/gpt-neo: An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.
Huggingface : On a mission to solve NLP, provide many NLP models.
- transformer : text generation using transformer GPT2

GPT Notes

GPT

OpenAI’s API Now Available with No Waitlist

GPT Alternatives

NLP

44 - Digital Twin

Digital Twin in Vehicles

DT in Electric Vehicles

Digital Twin Simulation

HASH - Complex Systems Simulation

Digital Twin Video

Digital Twin Project

Digital twin Porsche

45 - Awesome Jupyter Notebooks

Awesome Jupyter Notebooks

Related links: 🔗 note/Awesome Jupyter Notebooks
🔗 app/Jupyter Notebook Apps

Important contribution instructions: If you add new content, please ensure that for any notebook you link to, the link is to the rendered version using nbviewer, rather than the raw file. Simply paste the notebook URL in the nbviewer box and copy the resulting URL of the rendered version. This will make it much easier for visitors to be able to immediately access the new content.

Note that Matt Davis has conveniently written a set of bookmarklets and extensions to make it a one-click affair to load a Notebook URL into your browser of choice, directly opening into nbviewer.

Entire books or other large collections of notebooks on a topic
Scientific computing and data analysis with the SciPy Stack
General Python Programming
Notebooks in languages other than Python
- Julia
- Haskell
- Ruby
- Perl
- F#
- C#
- Javascript
Miscellaneous topics about doing various things with the Notebook itself
Reproducible academic publications
Other publications using the Notebook
Data-driven journalism
Whimsical notebooks
Videos of IPython being used in the wild
Accessing an IBM quantum computer via notebooks
Software Architecture

Entire books or other large collections of notebooks on a topic

Introductory Tutorials

First things first, how to run code in the notebook. There is also a general collection of notebooks from IPython. Another useful one from this collection is an explanation of our rich display system.
A great matplotlib tutorial, part of the fantastic Lectures on Scientific Computing with Python by J.R. Johansson.
The code of the IPython mini-book by C. Rossant, introducing IPython, NumPy, SciPy, Pandas and matplotlib for interactive computing and data visualization.
Python Tutorial by Rajath Kumar M P
Growing Neat Software Architecture from Jupyter Notebooks, a primer by Guillaume Chevalier on how to build clean software using notebooks.

Programming and Computer Science

Automata and Computability using Jupyter, an entire course, based on forthcoming book published by Taylor and Francis; book title: “Automata and Computability: Programmer’s Perspective”, by Ganesh Gopalakrishnan, Professor, School of Computing, University of Utah, Salt Lake City. [in English, has Youtube videos]
Introduction to Programming (using Python), an entire introductory Python course written by Eric Matthes. This post explains the educational context in an Alaskan high school where Eric is a teacher.
Numeric Computing is Fun A series of notebooks created to help educate aspiring computer programmers and data scientists of all ages with no previous programming experience.
Python for Developers, a complete book on Python programming by Ricardo Duarte. Note the book also exists in Portuguese, website translated into English
CS1001.py - Extended Introduction to Computer Science. Recitations from Tel-Aviv University introductory course to computer science, assembled as IPython notebooks by Yoav Ram.
Exploratory Computing with Python, a set of 15 Notebooks that cover exploratory computing, data analysis, and visualization. No prior programming knowledge required. Each Notebook includes a number of exercises (with answers) that should take less than 4 hours to complete. Developed by Mark Bakker for undergraduate engineering students at the Delft University of Technology.
Understanding evolutionary strategies and covariance matrix adaptation, from the Advanced Evolutionary Computation: Theory and Practice course by Luis Martí.
Code Katas in Python, a collection of algorithmic and data structure exercises covering search and sorting algorithms, stacks, queues, linked lists, graphs, backtracking and greedy problems.
Jupyter notebook activities for Part IA of the computing course (Michaelmas Term) in the Engineering Tripos at University of Cambridge, by Garth Wells.
Introduction to Python for Computational Science and Engineering (Hans Fangohr): Textbook for beginners, broken into one Jupyter Notebook per chapter. Can be executed and interacted with online using Binder.

Statistics, Machine Learning, and Data Science

Predicting PewDiePie’s daily subscribers using Linear Regression, a notebook which explains the implementation of Linear Regression from scratch, by Tanu Nanda Prabhu, author and editor at Towards data science.
Top Python Libraries Used In Data Science, this notebook explain the important library used in data science, by Tanu Nanda Prabhu, author and editor at Towards data science.
Web scraping using Python with BeautifulSoup and Requests libraries, a notebook which explains scraping the data from the internet from scratch, by Tanu Nanda Prabhu, author and editor at Towards data science.
Exploratory data analysis in Python, a notebook which explains the steps to perform Exploratory data Analysis in python from the scratch, by Tanu Nanda Prabhu, author and editor at Towards data science.
An introductory notebook on uncertainty quantification and sensitivity analysis developed for the Workshop On Uncertainty Quantification And Sensitivity Analysis For Cardiovascular Modeling by Leif Rune Hellevik, Vinzenz Eck and Jacob T. Sturdy.
Python Data Science Handbook Supplemental Materials, a collection of notebooks by Jake VanderPlas to accompany the book.
Data Cleaning using Python with Pandas Library, a Date Science notebook which clearly explains Data Cleaning using Python with Pandas Library at a beginner level, by Tanu Nanda Prabhu.
Social Network Analysis: From Graph Theory to Applications with Python. A tutorial on network creation, analysis, information flow and influence maximization with Networkx by Dima Goldenberg.
“ISP”: Introduction to Statistics with Python, a collection of notebooks accompanying the book of the same name, by Thomas Haslwanter.
Notebooks for the exercises in Andrew Ng’s online ML course, Spark and TensorFlow, as well as extra material on other tools from the scipy stack, by John Wittenauer.
AM207: Monte Carlo Methods, Stochastic Optimization: a complete course by Verena Kaynig-Fittkau and Pavlos Protopapas from Harvard, with all lecture materials and homework sets as notebooks.
An introduction to Bayesian inference, this is just chapter 1 in an ongoing book titled Probabilistic Programming and Bayesian Methods for Hackers Using Python and PyMC, by Cameron Davidson-Pilon.
Doing Bayesian Data Analysis: Python/PyMC3 code for a selection of models and figures from the book ‘Doing Bayesian Data Analysis: A Tutorial with R, JAGS, and Stan’, Second Edition, by John Kruschke (2015).
Learn Data Science, an entire self-directed course by Nitin Borwankar.
IPython Cookbook by Cyrille Rossant, a comprehensive guide to Python for Data Science. The code of the 100 recipes is available on the GitHub repository.
An introduction to machine learning with Python and scikit-learn (repo and overview) by Hannes Schulz and Andreas Mueller.
A progressive collection notebooks of the Machine Learning course by the University of Turin (with exercises).
Clustering and Regression, part of the UC Berkeley 2014 Introduction to Data Science course taught by Michael Franklin.
Neural Networks, part of a collection on machine learning by Aaron Masino.
An introduction to Pandas, part of an 11-lesson tutorial on Pandas, by Hernán Rojas.
Data Science and Big Data with Python by Steve Phelps.
Big Data for beginners A collections of notebooks on Hadoop, MapReduce, Spark.
The Statsmodels Project has two excellent collections of examples: in their official documentation and extra ones in their wiki. Too many there to directly duplicate here, but they provide great learning materials on statistical modeling with Python.
Machine Learning with the Shogun Toolbox. This is a complete service that includes a ready-to-run IPython instance with a collection of notebooks illustrating the use of the Shogun Toolbox. Just log in and start running the examples.
Python for Data Analysis, an introductory collection from the CU Boulder Research Computing Group.
The Kaggle bulldozers competition example, one of a set on tutorials on exploratory data analysis with the copper toolkit by Daniel Rodríguez/
Understanding model reliability, part of a complete course on statistics and data analysis for psychologists by Michael Waskom.
Graphical Representations of Linear Models, an illustration of the Seaborn statistical visualization library, that also includes Visualizing distributions of data and Representing variability in timeseries plots. By Michael Waskom.
Desperately Seeking Silver, one of the homework sets for Harvard’s CS 109 Data Science course.
The classic ‘An Introduction to Statistical Learning with Applications in R’ by James, Witten, Hastie, Tibshirani (2013), has not one but two collections of notebooks to accompany the book with Python (instead of the book’s default R examples). One by Jordi Warmenhoven and one by Matt Caudill.
Python Notebooks for StatLearning Exercises, Python implementations of the R labs for the StatLearning: Statistical Learning online course from Stanford University taught by Profs Trevor Hastie and Rob Tibshirani.
Applied Predictive Modeling with Python, Python implementations of the examples (originally written in R) from a famous introductory book, Applied Predictive Modeling, by Max Kuhn and Kjell Johnson.
A collection of four courses in foundations of data science, algorithms and databases from multiple faculty at Columbia University’s Lede Program.
SciPy and OpenCV as an interactive computing environment for computer vision by Thiago Santos, a tutorial presented at SIBGRAPI 2014.
Kalman and Bayesian Filters in Python, by Roger Labbe.
Adaboost for digit classification, by Shashwat Shukla. A complete implementation of Adaboost in Python, with code for digit recognition.
An example machine learning notebook, by Randal. S. Olson, part of a collection in Data Analysis and Machine Learning.
Pandas .head() to .tail(), an in-depth tutorial on Pandas by Tom Augspurger.
Apache SINGA tutorial. A Python tutorial for deep learning with SINGA.
Data Science Notebooks, a frequently updated collection of notebooks on statistical inference, data analysis, visualization and machine learning, by Donne Martin.
ETL with Python, a tutorial for ETL (Extract, Transfer and Load) using python petl package, loading to MySQL and working with csv files by Dima Goldenberg.
the-elements-of-statistical-learning, a collection of notebooks implementing the algorithms, reproducing the graphics found in the book “The Elements of Statistical Learning” by Trevor Hastie, Robert Tibshirani and Jerome Friedman and summary of the textbook.
Classification on raw time series in TensorFlow with a LSTM RNN, by Guillaume Chevalier.
Signal forecasting with a Sequence-to-Sequence (seq2seq) Recurrent Neural Network (RNN) model in TensorFlow, by Guillaume Chevalier.
A Coding Kata using Notebooks in Google Colab: Achieve Clean Machine Learning From Dirty Code.
Interactive Machine Learning Experiments - collection of notebooks that use convolutional neural networks (CNNs), recurrent neural networks (RNNs) and multilayer perceptrons (MLPs) to solve basic machine learning tasks like objects detection and classification, sequence-to-sequence predictions etc.

Mathematics, Physics, Chemistry, Biology

A single-atom laser model. This is one of a complete set of lectures on quantum mechanics and quantum optics using QuTiP by J.R. Johansson.
2-d rigid-body transformations. This is part of Scientific Computing in Biomechanics and Motor Control, a complete collection of notebooks by Marcos Duarte.
Astrophysical simulations and analysis with yt: a collection of example notebooks on using various codes that yt interfaces with: Enzo, Gadget, RAMSES, PKDGrav and Gasoline. Note: the yt site currently throws an SSL warning, they seem to have an outdated or self-signed certificate.
Working with Reactions, part of a set of tutorials on cheminformatics and machine learning with the rdkit project, by Greg Landrum.
CFD Python: 12 steps to Navier-Stokes. A complete set of lectures on Computational Fluid Dynamics, from 1-d linear waves to full 2-d Navier-Stokes, by Lorena Barba.
Pytherm - Applied Thermodynamics. Lectures on applied thermodynamics using Python and the SciPy ecosystem, by ATOMS.
AeroPython: Aerodynamics-Hydrodynamics with Python, a complete course taught at George Washington University by Lorena Barba.
Practical Numerical Methods with Python, a collection of learning modules (each consisting of several IPython Notebooks) for a course in numerical differential equations taught at George Washington University by Lorena Barba. Also offered as a “massive, open online course” (MOOC) on the GW SEAS Open edX platform.
Get Data Off the Ground with Python by Lorena Barba: Learn to interact with Python and handle data with Python; assumes no coding experience and creates a foundation in programming applied to technical contexts. With an accompanying online course.
Take Off with Stats in Python by Lorena Barba: Hands-on data analysis using a computational approach and real-life applications. With an accompanying online course.
Tour the dynamics of change and motion by Lorena Barba: Tour of the dynamics of change and motion using computational thinking with Python. With an accompanying online course.
pyuvvis: tools for explorative spectroscopy, spectroscopy library built for integration ipython notebooks, matplotlib and pandas.
HyperPython: a practical introduction to the solution of hyperbolic conservation laws, a course by David Ketcheson.
An Introduction to Applied Bioinformatics: Interactive lessons in bioinformatics, by Greg Caporaso.
Colour science computations with colour, a Python package implementing a comprehensive number of colour theory transformations and algorithms supported by a dedicated collection of IPython Notebooks. More colour science related IPython Notebooks are available on colour-science.org.
The notebooks from the Book Bioinformatics with Python Cookbook, covering several fields like Next-Generation Sequencing, Population Genetics, Phylogenetics, Genomics, Proteomics and Geo-referenced information.
Learning Population Genetics in an RNA world is an interactive notebook that explains basic population genetics tools and techniques by building an in silico evolutionary model of RNA molecules.
An open RNA-Seq data analysis pipeline tutorial with an example of reprocessing data from a recent Zika virus study. This notebook fully reproduces the research published in this paper. The notebook uses mostly python but includes some bash and R as well and is relevant for researchers in bioinformatics and public health.
Lung Cancer Post-Translational Modification and Gene Expression Regulation. This Python notebook uses the Jupyter-widget Clustergrammer-Widget to visualize hierarchical clustering of gene expression and post-translational modification data from 37 lung cancer cell lines as an interactive heatmap. The notebook is part of the research project from this paper.
Materials Science in Python using pymatgen. A series of python notebooks using the pymatgen package and materials project API for materials science.
Applied Elasticity: A collection of notebooks used to determine solutions to some classical 2D elasticity problems. These were mostly live coded during class hours by Jeevanjyoti Chakraborty as part of the course “Applied Elasticity” in the Mechanical Engineering Department of the Indian Institute of Technology Kharagpur.

Earth Science and Geo-Spatial data

EarthPy, a collection of IPython notebooks with a focus on Earth Sciences, from whale tracks to the flow of the Amazon.
Python for Geosciences, a tutorial series aimed at the Earth Sciences community, by Nikolay Koldunov.
Find graffiti close to NY subway entrances, one of a rich collection of notebooks on large-scale data analysis, by Roy Hyunjin Han.
Logistic models of well switching in Bangladesh, part of the “Will it Python” blog series (repo) on Machine Learning and data analysis in Python. By Carl Vogel.
Estimated likelihood of observing a large earthquake on a continental low‐angle normal fault and implications for low‐angle normal fault activity, an executable version of a paper by Richard Styron and Eric Hetland published in Geophysical Research Letters, on earthquake probabilities.
python4oceanographers, a blog demonstrating analyses in physical oceanography from resource-demanding numerical computations with functions in compiled languages to specialized tidal analysis to visualization of various geo data using fancy things like interactive maps.
Machinalis has a public repo with material support for geospatial-data processing related blog posts. It includes notebooks about Object Based Image Analysis and irrigation circles detection.
seismo-live is a collection of live Jupyter notebooks for seismology. It includes a fairly large number of notebooks on how to solve the acoustic and elastic wave equation with various different numerical methods. Additionally it contains notebooks with an extensive introduction to data handling and signal processing in seismology, and notebooks tackling ambient seismic noise, rotational and glacial seismology, and more.
Geo-Python is an introduction to programming in Python for Bachelors and Masters students in geo-fields (geology, geophysics, geography) taught by members of the Department of Geosciences and Geography at University of Helsinki, Finland. Course lessons and exercises are based on Jupyter notebooks and open for use by any interested person.

Linguistics and Text Mining

Workshop on text analysis by Neal Caren.
Detecting Algorithmically Generated Domains, part of the Data Hacking collection on security-oriented data analysis with IPython & friends.
Mining the Social Web (3rd Edition). A complete collection of notebooks accompanying Matthew Russell and Mikhail Klassen’s book by O’Reilly.

Engineering Education

Introduction to Chemical Engineering Analysis by Jeff Kantor. A collection of IPython notebooks illustrating topics in introductory chemical engineering analysis, including stoichiometry, generation-consumption analysis, mass and energy balances.
Sensors and Actuators by Andres Marrugo. A collection of Jupyter notebooks in the form of lecture notes and engineering calculations for the course IMTR 1713 Sensors and Actuators taught at the Universidad Tecnológica de Bolívar.

Scientific computing and data analysis with the SciPy Stack

General topics in scientific computing

Algorithms in IPython notebooks, by Sebastian Raschka
Comparing the performance of Python compilers - Cython vs. Numba vs. Parakeet, by Sebastian Raschka
A Crash Course in Python for Scientists, by Sandia’s Rick Muller.
A gentle introduction to scientific programming in Python, biased towards biologists, by Mickey Atwal, Cold Spring Harbor Laboratory.
Python for Data Science, a self-contained mini-course with exercises, by Joe McCarthy.
First few lectures of the UW/Coursera course on Data Analysis. (Repo) by Chris Fonnesbeck.
CythonGSL: a Cython interface for the GNU Scientific Library (GSL) (Project repo, by Thomas Wiecki.
Introduction to numerical computing with numpy by Steve Phelps.
Using Numba to speed up numerical codes. And another Numba example: self-organizing maps.
Numpy performance tricks, and blog post, by Cyrille Rossant.
IPython Parallel Push/Execute/Pull Demo by Justin Riley.
Understanding the design of the R “formula” objects by Matthew Brett.
Comparing different approaches to evolutionary simulations. Also available here to better visualization. The notebook was converted to a HTML presentation using an old nbconvert with the first developing implementation of reveal converter. By Yoav Ram.
The Traveling Salesperson Problem by Peter Norvig.
A git tutorial targeted at scientists by Fernando Perez.
Running MATLAB in an IPython Notebook, using pymatbridge.
Interactive Curve-Fitting The lmfit package provides a widget-based interface to the curve-fitting algorithms in SciPy.
A visual guide to the Python Spark API for distributed computing by Jeff Thompson
A tutorial on Map-Reduce programming with Apache Spark and Python by Steve Phelps.
CodeCombat gridmancer solver by Arn-O. This notebook explains how to improve a recursive tree search with an heuristic function and to find the minimum solution to the gridmancer.

Survival Analysis, an illustration of the lifelines library, by Cam Davidson Pilon.
A reconstruction of Nate Silver’s 538 model for the 2012 US Presidential Election, by Skipper Seabold (complete repo).
Data about the Sandy Hook massacre in Newtown, Conneticut, which accompanies a more detailed blog post on the subject. Here are the notebook and accompanying data. By Brian Keegan.
More on gun violence analysis with Wikipedia data.
An analysis of the Gaza-Israel 2012 crisis.
Ranking NFL Teams. The full repo also includes an explanatory slideshow. By Sean Taylor.
Automated processing of news media and generation of associated imagery.
An analysis of national school standardized test data in Colombia using Pandas (in Spanish). By Javier Moreno.
Getting started with GDELT, by David Masad. GDELT is a dataset containing more than 200-million geolocated events with global coverage for 1979 to the present. Another GDELT example from David, that nicely integrates mapping visualizations.
Titanic passengers, coal mining disasters, and vessel speed changes, by Christopher Fonnesbeck
A geographic analysis of Indonesian conflicts in 2012 with GDELT, by herrfz.
Bioinformatic Approaches to the Computation of Poetic Meter, by A. Sean Pue, C. Titus Brown and Tracy Teal.
Analyzing the Vélib dataset from Paris, by Cyrille Rossant (Vélib is Paris’ bicycle-sharing program).
Using Python to see how the Times writes about men and women, by Neal Caren.
Exploring graph properties of the Twitter stream with twython and NetworkX, by F. Perez (complete gist repo with utilities here.)
Kaggle Competition: Titanic Machine Learning from Disaster. By Andrew Conti.
How clean are San Francisco’s restaurants?, a data science tutorial that accompanies a blog post from Zipfian Academy.
NYT gender wage gap and US crime by state.
Predicting usage of the subway system in NYC, a final project for the Udacity Intro to Data Science Course, by Asim Ihsan.
An exploratory statistical analysis of the 2014 World Cup Final, by Ricardo Tavares. Part of a notebook collection on football (aka soccer) analysis.
San Francisco’s Drug Geography, a GIS analysis of public crime data in SF, by Lance Martin.
Geographic Data Science is an entire course by Dani Arribas-Bel to learn to access, munge, and analyse spatial data on social phenomena.
Analysis and visualization of a public OKCupid profile dataset using Python and Pandas by Alessandro Giusti includes many colorful data visualizations.

Psychology and Neuroscience

Cue Combination with Neural Populations by Will Adler. Intuition and simulation for the theory (Ma et al., 2006) that through probabilistic population codes, neurons can perform optimal cue combination with simple linear operations. Demonstrates that variance in cortical activity, rather than impairing sensory systems, is an adaptive mechanism to encode uncertainty in sensory measurements.
Modeling psychophysical data with non-linear functions by Ariel Rokem.
Visualizing mathematical models of brain cell connections. The effect of convolution of different receptive field functions and natural images is examined.
Python for Vision Research. A three-day crash course for vision researchers in programming with Python, building experiments with PsychoPy and psychopy_ext, learning the fMRI multi-voxel pattern analysis with PyMVPA, and understading image processing in Python.
Loading and visualizing fMRI data, part of the Functional connectivity with NiLearn course by Gaël Varoquaux.

Machine Learning, Statistics and Probability

A tutorial introduction to machine learning with sklearn, an IPython-based slide deck by Andreas Mueller.
Introduction to Machine Learning in Python with scikit-learn by Cyrille Rossant, a free recipe from the IPython Cookbook, a comprehensive guide to Python for Data Science.
An introduction to Predictive Modeling in Python, by Olivier Grisel.
Face Recognition on a subset of the Labeled Faces in the Wild dataset, by Olivier Grisel.
An Introduction to Bayesian Methods for Multilevel Modeling, by Chris Fonnesbeck.
Introduction to Bayesian Networks by Kui Tang
Bayesian data analysis with PyMC3 by Thomas Wiecki.
A collection of examples for solving pattern classification problems, by Sebastian Raschka.
Introduction to Linear Regression using Python by Kevin Markham
Machine learning in Python, a series based on Andrew Ng’s Coursera class on machine learning. Part of a larger collection of data science notebooks by John Wittenauer.
Probability, Paradox, and the Reasonable Person Principle, by Peter Norvig.
How Likely Would You Give A Five-Star Review on Yelp? – Getting Your Hands Dirty with scikit-learn, by Xun Tang. Complimentary slides.
Geodemographic Segmentation Model, by Filipa Rodrigues

Physics, Chemistry and Biology

Writing A Genome Assembler with blasr and (I)Python, by [Jason Chin](Jason Chin).
Multibody dynamics and control with Python and the notebook file by Jason K. Moore.
Manipulation and display of chemical structures, by Greg Landrum, using rdkit.
The sound of Hydrogen, visualizing and listening to the quantum-mechanical spectrum of Hydrogen. By Matthias Bussonnier.
Particle physics at the Large Hadron Collider (LHC): using ROOT in an LHCb masterclass: Notebook 1 and Notebook 2 notebooks by Alexander Mazurov and Andrey Ustyuzhanin at CERN.
A Reaction-Diffusion Equation Solver in Python with Numpy, a demonstration of how IPython notebooks can be used to discuss both the theory and implementation of numerical algorithms on one page, by Georg Walther.
Comparing different approaches to evolutionary simulations. Also available here to better visualization. The notebook was converted to a HTML presentation using an old nbconvert with the first developing implementation of reveal converter. By Yoav Ram.

Economics and Finance

Replication of the highly-contentious analysis of economic growth by Reinhart and Rogoff, by Vincent Arel-Bundock, full repo here. This is based on the widely-publicized critique of the original analysis done by Herndon, Ash, and Pollin.
fecon235 for Financial Economics series of notebooks which examines time-series data for economics and finance. Easy API to freely access data from the Federal Reserve, SEC, CFTC, stock and futures exchanges. Thus research from older notebooks can be replicated, and updated using the most current data. For example, this notebook forecasts likely Fed policy for setting the Fed Funds rate, but market sentiment across major asset classes is observable from the CFTC Commitment of Traders Report. Major economics indicators are renormalized: for example, various measures of inflation, optionally with the forward-looking break-even rates derived from U.S. Treasury bonds. Other notebooks examine international markets: especially, gold and foreign exchange.
Fixed Income: A Structured Bond- Interactive scenarios , Sequential repayment of a bond using interactive widgets and Python in Jupyter, by Mats Gustavsson.

Earth science and geo-spatial data

Exploring seafloor habitats: geographic analysis using IPython Notebook with GRASS & R. This embeds a slideshow and a Web Spinning Globe (Cesium) in the notebook. By Massimo Di Stefano.
Geo-Spatial Data with IPython. Tutorial by Kelsey Jordahl from SciPy2013.

Data visualization and plotting

Plotting pitfalls: common problems when plotting large datasets, and how to avoid them. By James A. Bednar.
US Census data and NYC Taxi data visualized using datashader.
A Notebook with an interactive Hans Rosling Gapminder bubble chart from Plotly.
Data and visualization integration via web based resources. Using NetCDF, Matplotlib, IPython Parallel and ffmpeg to generate video animation from time series of gridded data. By Massimo Di Stefano.
21 Interactive, D3 Plots from matplotlib, ggplot for Python, prettyplotlib, Stack Overflow, and seaborn.
Visualizing complex-valued functions with Matplotlib and Mayavi, by Emilia Petrisor.
bqplot is a d3-based interactive visualization library built entirely on top of that ipywidgets infrastructure. Checkout the pythonic recreation of Hans Rosling’s Wealth of Nations.
A D3 Viewer for Matplotlib Visualizations, different from above by not depending on Plot.ly account.
Bokeh is an interactive web visualization library for Python (and other languages). It provides d3-like novel graphics, over large datasets, all without requiring any knowledge of Javascript. It also has a Matplotlib compatibility layer.
HoloViews lets you construct visualizations very concisely in the notebook.
Winner of the 2014 E. Tufte Slope Graphs contest, by Pascal Schetelat. The original contest info on Tufte’s site.
matta, d3.js-based visualizations in the IPython Notebook, by Eduardo Graells-Garrido.
Clustergrammer Interactive Heatmap and DataFrame Viewer This Python notebook shows a simple example of how to visualize a matrix file and Pandas DataFrame as an interactive heatmap (built using D3.js) using the Jupyter Widget Clustergrammer (see paper).
The Jupyter Widget Ecosystem - SciPy 2019 Tutorial on ipywidgets - a collection of 40 notebooks.

Mathematics

Linear algebra with Cython. A tutorial that styles the notebook differently to show that you can produce high-quality typography online with the Notebook. By Carl Vogel.
Exploring how smooth-looking functions can have very surprising derivatives even at low orders, combining SymPy and matplotlib. By Javier Moreno.
A Collection of Applied Mathematics and Machine Learning Tutorials (in Turkish) and its English Translation By Burak Bayramli.
Function minimization with iminuit, an introductory companion to their hard core tutorial. By the iminuit project.
The Discrete Cosine Transform, a brief explanation and illustration of the math behind the DCT and its role in the JPEG image format, by Jim Mahoney.
Chebfun in Python, a demo of PyChebfun, by Olivier Verdier. PyChebfun is a pure-python implementation of the celebrated Chebfun package by Battles and Trefethen.
The Matrix Exponential, an introduction to the matrix exponential, its applications, and a list of available software in Python and MATLAB. By Sam Relton.
Fractals, complex numbers, and your imagination, by Caleb Fangmeier.
A SymPy tutorial, by Andrey Grozin.
Introduction to Mathematics with Python, a collection of notebooks aimed at Mathematicians with no/little Python knowledge. Notebooks can be selected to serve as resources for a workshop. By Vince Knight.

Signal and Sound Processing

Simulation of Delta Sigma modulators in Python with deltasigma, Python port of of Richard Schreier’s excellent MATLAB Delta Sigma Toolbox, by Giuseppe Venturini. Several demonstrative notebooks on the package README.
PyOracle: Automatic analysis of musical structure, by Greg Surges.
A Gallery of SciPy’s Window Functions for quick visual inspection and comparison by Jaidev Deshpande
Poisson Image Editing | Seamless Cloning by Dhruv Ilesh Shah is a notebook that achieves Seamless Image Cloning by employing the Poisson Solver in the iterative form.
Blind Source Separation | Cocktail Party Problem by Dhruv Ilesh Shah & Shashwat Shukla is a notebook that achieves blind source separation, on audio signals in an attempt to approach the Cocktail Party Prblem. The problem has been tackled in two different methods - the FOBI and fastICA.

Natural Language Processing

Python Programming for the Humanities by Folgert Karsdorp & Maarten van Gompel.
News Categorization using Multinomial Naive Bayes by Andres Soto Villaverde.
Using random cross-validation for news categorization by Andres Soto Villaverde.
Named Entity Recognition in French biomedical text by Andrés Soto Villaverde
Named Entity Recognition in French biomedical text (Part 2) by Andrés Soto Villaverde

Pandas for data analysis

Note that in the ‘collections’ section above there are also pandas-related links, such as the one for an 11-lesson tutorial.

Python Pandas DataFrame Basics, this notebook explains the basic concepts of a pandas data frame from scratch for beginners with examples, by Tanu Nanda Prabhu.
A 10-minute whirlwind tour of pandas, this is the notebook accompanying a video presentation by Wes McKinney, author of Pandas and the Python for Data Analysis book.
Manipulating the data with Pandas using Python, this notebook explains various operations and methods of Pandas library from the scratch with the help of an example, by Tanu Nanda Prabhu.
Time-series analysis with Pandas.
Financial data analysis with Pandas.
Clustering of smartphone sensor data for human activity detection using pandas and scipy, part of Coursera data analysis course, done in Python (repo).
Log analysis with Pandas, part of a group presented at PyConCa 2012 by Taavi Burns.
Analyzing and visualizing sun spot data with Pandas, by Josh Hemann. An enlightening discussion of how naive plotting choices subtly influence our interpretation of data.
Advanced analysis of Apache logs, by Nikolay Koldunov.
Statistical Data Analysis in Python, by Christopher Fonnesbeck, SciPy 2013. Companion videos 1, 2, 3, 4

General Python Programming

How_to_get_started_coding_in_Python?, this notebook explains how to become a good python programmer, by Tanu Nanda Prabhu, author and editor at Towards data science
Python Strings from Scratch !!!, this notebook explains Python Strings from basic to advance level, by Tanu Nanda Prabhu
Python Tuples from Scratch !!!, this notebook explains Python Tuples from basic to advance level, by Tanu Nanda Prabhu
Python Dictionary from Scratch !!!, this notebook explains Python Dictionary from basic to advance level, by Tanu Nanda Prabhu
Python Lists from Scratch !!!, this notebook explains Python Lists from basic to advance level with the help of an example, by Tanu Nanda Prabhu.
Learning to code with Python, part of an introduction to Python from the Waterloo Python users group.
Introduction to Python for Data Scientists by Steve Phelps (part of a larger collection on Data Science and Big Data).
Python Descriptors Demystified, an in-depth discussion of the descriptor protocol in Python, by Chris Beaumont.
A collection of not so obvious Python stuff you should know!, by Sebastian Raschka.
Key differences between Python 2.7.x and Python 3.x, by Sebastian Raschka.
A beginner’s guide to Python’s namespaces, scope resolution, and the LEGB rule, by Sebastian Raschka.
Sorting CSV files using the Python csv module, by Sebastian Raschka.
Python 3 OOP series by Leonardo Giordani: Part 1: Objects and types, Part 2: Classes and members, Part 3: Delegation - composition and inheritance, Part 4: Polymorphism, Part 5: Metaclasses, Part 6: Abstract Base Classes
How to Aggregate Subscriber’s Interest using the 3 methods: (1) Python Dictionary, (2) Apache PySpark - GroupBy Transformation, and (3) Apache PySpark - ReduceBy Transformation by Abbas Taher.

Notebooks in languages other than Python

These are notebooks that use [one of the IPython kernels for other languages](IPython kernels for other languages):

Julia

The IPython protocols to communicate between kernels and clients are language agnostic, and other programming language communities have started to build support for this protocol in their language. The Julia team has created IJulia, and these are some Julia notebooks:

Fractals 3 ways, by Jeff Bezanson.
The Design Impact of Multiple Dispatch, a detailed explanation of Julia’s multiple dispatch design, by Stefan Karpinski.
A tutorial on making interactive graphs with Plotly and Julia.
Numerical tours in Julia
Functional Geometry by Shashi Gowda
JuliaOpt notebooks, a collection of optimization-related notebooks.
Coursework using IJulia notebooks:
- Métodos Numéricos Avanzados (2015-2), Luis Benet and David P. Sanders
- Métodos Monte Carlo, David Sanders
- Linear Partial Differential Equations: Analysis and Numerics, Steven G. Johnson
- Julia tutorial for Computational Molecular Biology, Younhun Kim and Matthew Reyna
Other collections of IJulia notebooks:
- Jiahao Chen
- Christoph Ortner
- Crossing Language Barriers with Julia, Scipy, and IPython, presented at EuroSciPy ‘14 by Steven G. Johnson.

Haskell

There exists a Haskell kernel for IPython in the IHaskell project.

IHaskell Demo Notebook
Homophone reduction, a solution to a cute problem involving treating English letters as generators of a large group.
Gradient descent typeclass, a look at how arbitrary gradient descent algorithms can be represented with a typeclass.

OCaml

iocaml is an OCaml kernel for IPython

Ruby

Similar to the Julia kernel there exists also a Ruby kernel for IPython.

The interactive plotting library Nyaplot has some case studies using IRuby:

Perl

An example showcasing full use of the display protocol with the IPerl kernel.

F#

F# for Jupyter Notebooks

C#

Xamarin Workbooks Create a rich C# workbook for Android, iOS, Mac, WPF, or Console, and get instant live results as you learn these APIs.

Javascript

Two IJavascript notebooks that demonstrate how to use D3 to do computations and send a SVG back and play with a virtual DOM

Miscellaneous topics about doing various things with the Notebook itself

Blogging With IPython in Blogger, also available in blog post form, full repo here. By Fernando Perez.
Blogging With IPython in Octopress, by Jake van der Plas and available as a blog post. Other notebooks by Jake contain many more great examples of doing interesting work with the scientific Python stack.
Blogging With IPython in Nikola, also available in blog post form by Damián Avila.
Custom CSS control of the notebook, this is part of a blog repo by Matthias Bussonnier.
IPython display hookery: tools to help display visual output from various sources, a gist by @deeplook.
Importing IPython Notebooks as Modules by Min RK.
Getting Started With Jupyter Notebooks for Teaching and Learning by Tony Hirst at OpenLearn
Toward Data Science blogs:
- Bringing the best out of Jupyter Notebooks for Data Science
- Boosting Your Jupyter Notebook Productivity

Reproducible academic publications

This section contains academic papers that have been published in the peer-reviewed literature or pre-print sites such as the ArXiv that include one or more notebooks that enable (even if only partially) readers to reproduce the results of the publication. If you include a publication here, please link to the journal article as well as providing the nbviewer notebook link (and any other relevant resources associated with the paper).

Revealing ferroelectric switching character using deep recurrent neural networks. Github page where code is located. Jupyter Paper. Raw Data
Discovery of Gravitational Waves by the LIGO collaboration. That page, from the LIGO Open Science Center, contains multiple notebooks for various datasets corresponding to different events; this binder lets you run the code right away. More details on the GW150914 event as well as the original main Physical Review Letters paper, “Observation of Gravitational Waves from a Binary Black Hole Merger”.
Characterizing Strain Variation in Engineered E. coli Using a Multi-Omics-Based Workflow, by Brunk et al.
Predicting Coronal Mass Ejections Using Machine Learning Methods by Monica Bobra and Stathis Ilonidis (Astrophysical Journal, 2016). An IPython notebook, which reproduces all the results, has been permanently deposited in the Stanford Digital Repository.
The Paper of the Future by Alyssa Goodman et al. (Authorea Preprint, 2017). This article explains and shows with demonstrations how scholarly “papers” can morph into long-lasting rich records of scientific discourse, enriched with deep data and code linkages, interactive figures, audio, video, and commenting. It includes an interactive d3.js visualization and has an astronomical data figure with an IPYthon Notebook “behind” it.
Reply to ‘Influence of cosmic ray variability on the monsoon rainfall and temperature’: a false-positive in the field of solar-terrestrial research by Benjamin Laken, 2015. Reviewed article will appear in JASTP. The IPython notebook reproduces the full analysis and figures exactly as they appear in the article, and is available on Github: link via figshare.
An open RNA-Seq data analysis pipeline tutorial with an example of reprocessing data from a recent Zika virus study, by Zichen Wang and Avi Ma’ayan. (F1000Research 2016, 5:1574). An IPython notebook was used to perform the proposed RNA-Seq pipeline using public gene expression data of human cells after Zika virus infection. The computational pipeline is also version controlled and Dockerized available here.
The probability of improvement in Fisher’s geometric model: a probabilistic approach, by Yoav Ram and Lilach Hadany. (Theoretical Population Biology, 2014). An IPython notebook, allowing figure reproduction, was deposited as a supplementry file.
Stress-induced mutagenesis and complex adaptation, by Yoav Ram and Lilach Hadany (Proceedings B, 2014). An IPython notebook, allowing figures reproduction, was deposited as a supplementry file.
Automatic segmentation of odor maps in the mouse olfactory bulb using regularized non-negative matrix factorization, by J. Soelter et al. (Neuroimage 2014, Open Access). The notebook allows to reproduce most figures from the paper and provides a deeper look at the data. The full code repository is also available.
Multi-tiered genomic analysis of head and neck cancer ties TP53 mutation to 3p loss, by A. Gross et al. (Nature Genetics 2014). The full collection of notebooks to replicate the results.
Dog and human inflammatory bowel disease rely on overlapping yet distinct dysbiosis networks, by Vázquez-Baeza et al. (Nature microbiology 2016). The full collection of notebooks to replicate the results.
powerlaw: a Python package for analysis of heavy-tailed distributions, by J. Alstott et al.. Notebook of examples in manuscript, ArXiv link and project repository.
Collaborative cloud-enabled tools allow rapid, reproducible biological insights, by B. Ragan-Kelley et al.. The main notebook, the full collection of related notebooks and the companion site with the Amazon AMI information for reproducing the full paper.
A Reference-Free Algorithm for Computational Normalization of Shotgun Sequencing Data, by C.T. Brown et al.. Full notebook, ArXiv link and project repository.
The kinematics of the Local Group in a cosmological context by J.E. Forero-Romero et al.. The Full notebook and also all the data in a github repo.
Warming Ocean Threatens Sea Life, an article in Scientific American backed by a notebook for its main plot. By Roberto de Almeida from MarinExplore.
Extrapolating Weak Selection in Evolutionary Games, by Wu, García, Hauert and Traulsen. PLOS Comp Bio paper and Figshare link.
Using neural networks to estimate redshift distributions. An application to CFHTLenS by Christopher Bonnett paper(submitted to MNRAS)
Mechanisms for stable, robust, and adaptive development of orientation maps in the primary visual cortex by Jean-Luc R. Stevens, Judith S. Law, Jan Antolik, and James A. Bednar. Journal of Neuroscience, 33:15747-15766, 2013. [Notebook1] (https://ioam.github.io/topographica/_static/gcal_notebook.html), Notebook2.
Accelerated Randomized Benchmarking, by Christopher Granade, Christopher Ferrie and D. G. Cory. New Journal of Physics 17 013042 (2015), arXiv, GitHub repo.
Dynamics and associations of microbial community types across the human body, by Tao Ding & Patrick D. Schloss. Notebook replicating results.
Variations in submarine channel sinuosity as a function of latitude and slope, by Sylvester, Z., Pirmez, C., Cantelli, A., & Jobe, Z. R.
Frontoparietal representations of task context support the flexible control of goal directed cognition, by M.L. Waskom, D. Kumaran, A.M. Gordon, J. Rissman, & A.D. Wagner. Github repository | Main notebook
pyparty: Intuitive Particle Processing in Python, Adam Hughes Notebook to Generate the Published Figures | Also, check out the pyparty tutorial notebooks.
Indication of family-specific DNA methylation patterns in developing oysters, Claire E. Olson, Steven B. Roberts doi: https://dx.doi.org/10.1101/012831. Notebook to generate results in the paper.
Parallel Prefix Polymorphism Permits Parallelization, Presentation & Proof, Jiahao Chen and Alan Edelman, HPTCDL'14. Website and notebook
Transcriptome Sequencing Reveals Potential Mechanism of Cryptic 3’ Splice Site Selection in SF3B1-mutated Cancers by Christopher DeBoever et al. There are several notebooks to replicate results and make figures.
A Workflow for Characterizing Nanoparticle Monolayers for Biosensors: Machine Learning on Real and Artificial SEM Images, Adam Hughes, Zhaowen Liu, Maryam Raftari, Mark. E Reeves. Notebooks are linked in Table 1 in the text.
AtomPy: An Open Atomic Data Curation Environment for Astrophysical Applications, by C. Mendoza, J. Boswell, D. Ajoku, M. Bautista.
Visualizing 4-Dimensional Asteroids, in Scientific American (by Jake VanderPlas)
Challenges and opportunities in understanding microbial communities with metagenome assembly, accompanied by IPython Notebook tutorial, by Adina Howe and Patrick Chain.
Structure of a shear-line polar low (2016) by Sergeev, D. E., Renfrew, I. A., Spengler, T. and Dorling, S. R. Q.J.R. Meteorol. Soc. doi:10.1002/qj.2911. Accompanied by Notebooks to generate the published figures.
Detecting High-Order Epistasis in Nonlinear Genotype-Phenotype Maps by Zachary R. Sailer and Michael J. Harms published in Genetics, March 2017 . All figures can be reproduced by the set of notebooks in this Github repo.
Summary Analysis of the 2017 GitHub Open Source Survey by Stuart Geiger. Preprint in SocArXiv, June 2017. doi:10.17605/OSF.IO/ENRQ5. Paper is derived from a notebook converted to LaTeX with nbconvert. Notebook and materials at: OSF, GitHub, nbviewer
The weirdest SDSS galaxies: results from an outlier detection algorithm, by D. Baron and D. Poznanski. Notebooks to replicate.
Clustergrammer, a web-based heatmap visualization and analysis tool for high-dimensional biological data, by Nicolas Fernandez et al. Notebooks: Fig. 3, Fig. 4, Fig. 5
Sociology: An investigation of Social Class Inequalities in General Cognitive Ability in Two British Birth Cohorts. Preprint in SocArXiv, December 2017. doi: 10.17605/OSF.IO/SZXDM. Notebook and materials at: OSF, GitHub, nbviewer.
An on-chip architecture for self-homodyned nonclassical light, quant-ph ArXiV preprint, Nov 2016, by Fischer et al. A supporting notebook for all calculations included in the ArXiV submission.
A nested sampling code for targeted searches for continuous gravitational waves from pulsars, gr-qc ArXiV preprint, May 2017, by Pitkin et al. Complete repo with supporting notebooks and sources on GitHub.
HyperTools: A Python toolbox for visualizing and manipulating high-dimensional data, stat.OT ArXiV preprint by Heusser et al. A repo with companion notebooks is available, that links to the library itself, HyperTools.
Nonsinusoidal beta oscillations reflect cortical pathophysiology in Parkinson’s disease, in Journal of Neuroscience by Cole et al. A repo with companion notebooks with all necessary data is available to reproduce all figures.
Cycle-by-cycle analysis of neural oscillations, in bioRxiv by Cole & Voytek. A repo with companion notebooks with all necessary data is available to reproduce all figures. This repo also links to the related useful library, neurodsp, which contains notebooks of tutorials.
pyKNEEr: An image analysis workflow for open and reproducible research on femoral knee cartilage, a preprint by S. Bonaretti et al. Jupyter notebooks are used as a graphical user interface for medical image processing and analysis. The paper is interactive, with links to data, software, and documentation throughout the text. Every figure caption contains links to fully reproduce graphs.

Data-driven journalism

The Need for Openness in Data Journalism, by Brian Keegan.
St. Louis County Segregation Analysis , analysis for the article The Ferguson Area Is Even More Segregated Than You Probably Guessed by Jeremy Singer-Vine.
Size of thesis and dissertations in Quebec, by Jean-Hugues Roy (in French).

Whimsical notebooks

XKCD-styled plots created with Matplotlib. Here is the blog post version with discussion. By Jake van der Plas.
Van Gogh’s Starry Night with ipythonblocks, part of Matt Davis’ ipythonblocks. This is a teaching tool for use with the IPython notebook that provides visual elements to understand programming concepts.
Conway’s Game of Life. Interesting use of convolution operation to calculate the next state of game board, instead of obvious find neighbors and filter the board for next state.
pynguins. Using jupyter notebook, python, and numpy to solve Board Game “Penguins on Ice”.
“People plots”, stick figures generated with matplotlib.
Reveal converter mini-tutorial, also available in blog post form. Do you want to make static html/css slideshow straight from the IPython notebook? OK, now you can do it with the reveal converter (nbconvert). Demo by Damián Avila.
Personal IPython Weight Notebook. Plot your loss of weight with prognosis and motivation features.
Streaming Double Pendulum Simulation in IPython NB.
Porque Charles Xavier debe cambiar a Cerebro por Python, a study in data and gender in the Marvel comics universe, by Mai Giménez and Angela Rivera.
Functional Geometry: a deconstruction of the MC Escher woodcut Square Limit, an IJulia notebook by Shashi Gowda.
Solving physical puzzles with a Jupyter Noteboook.

Videos of IPython being used in the wild

Of course the first thing you might try is searching for videos about IPython (1900 or so by last count on Youtube) but there are demonstrations of other applications using the power of IPython but are not mentioned is the descriptions. Below are a few such:

Video on how to learn Python featuring IPython as the platform of choice for learning!
This video shows IPython being used in the scikit-learn project
He does not show IPython in use but his IPython sticker is clear for the entire video: Planning and Tending the Garden: The Future of Early Childhood Python Education
Wes McKinney’s speech on Python and data analysis features IPython as does his book Python for Data Analysis
This video shows Plotly and IPython in use at a Montreal Python meetup.

Accessing and programing a IBM quantum computer via notebooks

Github notebook example (scroll down) illustrating how to use Qiskit and access the IBMQ quantum computers.

Software Architecture

Let’s see How to Grow Neat Software Architecture out of Jupyter Notebooks, by Guillaume Chevalier.

46 - NCBI Papers with Code

NCBI Papers with Code

List of awesome NCBI Papers with Code Supplement.

CNN

Dual CNN for Relation Extraction with Knowledge-Based Attention and Word Embeddings Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6664687/ Code: https://github.com/mrlijun2017/Dual-CNN-RE
CNN-BLPred: a Convolutional neural network based predictor for β-Lactamases (BL) and their classes Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751796/ Code: https://github.com/whiteclarence/CNN-BLPred
Design of deep convolutional networks for prediction of image rapid serial visual presentation events Paper: https://www.ncbi.nlm.nih.gov/pubmed/29060296 Code: https://github.com/ZijingMao/ROICNN
A simple convolutional neural network for prediction of enhancer-promoter interactions with DNA sequence data Paper: https://www.ncbi.nlm.nih.gov/pubmed/30649185 Code: https://github.com/zzUMN/Combine-CNN-Enhancer-and-Promoters
A novel attention-based hybrid CNN-RNN architecture for sEMG-based gesture recognition Paper:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6207326/ Code: https://github.com/biopatrec/biopatrec
GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text Paper: https://www.ncbi.nlm.nih.gov/pubmed/29272325 Code: https://github.com/valdersoul/GRAM-CNN
Simple tricks of convolutional neural network architectures improve DNA-protein binding prediction Paper: https://www.ncbi.nlm.nih.gov/pubmed/30351403 Code: https://github.com/zhanglabtools/DNADataAugmentation
EnzyNet: enzyme classification using 3D convolutional neural networks on spatial representation Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5937476/ Code: https://github.com/shervinea/enzynet
Multi-timescale drowsiness characterization based on a video of a driver’s face Paper: https://www.telecom.ulg.ac.be/mts-drowsiness/ https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6165048/ Code: https://github.com/QMassoz/mts-drowsiness
CLoDSA: a tool for augmentation in classification, localization, detection, semantic segmentation and instance segmentation tasks Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6567576/ Code: https://github.com/joheras/CLoDSA
Deep learning with convolutional neural networks for EEG decoding and visualization Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5655781/ Code: https://github.com/robintibor/braindecode/ Code: https://github.com/TNTLFreiburg/braindecode

Rice/Paddy Classification

Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6842562/ Code: https://github.com/bongsongkim/logit.regression.rice
SNNRice6mA: A Deep Learning Method for Predicting DNA N6-Methyladenine Sites in Rice Genome Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6797597/ Code: https://github.com/yuht4/SNNRice6mA
Automatic estimation of heading date of paddy rice using deep learning Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6626381/ Code: https://github.com/svdesai/heading-date-estimation
Distillation of crop models to learn plant physiology theories using machine learning Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6541271/ Code: https://github.com/ky0on/simriw
Evaluating remote sensing datasets and machine learning algorithms for mapping plantations and successional forests in Phnom Kulen National Park of Cambodia Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6814064/ Code: https://github.com/Jojo666/PKNP-Data
PlantCV v2: Image analysis software for high-throughput plant phenotyping Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5713628/ Code: https://github.com/danforthcenter/plantcv-v2-paper
Crop Yield Prediction Using Deep Neural Networks Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540942/ Code: https://github.com/saeedkhaki92/Yield-Prediction-DNN
Using Deep Learning for Image-Based Plant Disease Detection Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5032846/ Code: https://github.com/salathegroup/plantvillage_deeplearning_paper_analysis
Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5500639/ Code: https://github.com/p2irc/deepplantphenomics
DeepWeeds: A Multiclass Weed Species Image Dataset for Deep Learning Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6375952/ Code: https://github.com/AlexOlsen/DeepWeeds

Research

Research

1 - Reinforcement Learning

Reinforcement Learning

Reinfocement Learning Course

Algorithm

2 - PhD How To

PhD How To

3 - Digital Twin News

Digital Twin News

Digital Twin Policy

Digital Twin News

Digital Twin Tools

Digital Twin DIY

Digital Twin on Github

Digital Twin Research Group

4 - Generative Machine Learning GAN

Generative Machine Learning GAN

GAN

Generative Model Course

GAN Course

GAN Project/Paper

Style GAN

Paper

GAN Image Superresolution

GAN

Style-GAN

Research

Research

Music Generation

Video Generation

Colorization

Machine Learning: Image Generator

GAN

Generation

Lightboard

5 - Awesome Research and Academic Writing Assistant Tools

Awesome Research and Academic Writing Assistant Tools

Chrome Extension and Script

Multitools

Arxiv

PubMed and Google Scholar

Code

Jupyter Notebook

Citation Manager

Paper Finder

MS Word Add-ons

Citation Manager

Grammar

Research

Reference Tools

Article/Paper Summarization

Article/Paper Graph and Relationship

Article/Paper Reading

Article/Paper Exploration

Article/Paper Reading Tools

Web Annotation

Research Open Data

List of Researcher and Academia Tools

Research/Academia Forums

PDF-related Paper/Article Tools

Testing

Generic Paper Search

Paper Graph Search

Paper Graph Writing Assistant

6 - Awesome Science Blogs and Forums

Awesome Science Blogs and Forums

AI-related Blogs

7 - Paper and Article Search Tools

Paper and Article Search Tools

Article Search Flowchart

Article/Paper Search Tools

Thesis Search

Paper Search Engine (Sci-hub)

Article

8 - Awesome Conferences Tools

Awesome Conferences Tools

AI-related Conferences

AI Journals

9 - Machine Learning Teaching