First we have to import the libraries needed for this project.
Here we are using fast AI which is based on Pytorch

Note: Link to live app made with the help of binder an example website: Indian ethnicity classifier.

Importing Fast AI

from fastbook import *
from fastai.vision.widgets import *

Using Bing Api to download images

To download images with Bing Image Search , sign up at Microsoft Azure for a free account. You will be given a key, which you can copy and enter in a cell as follows (replacing 'XXX' with your key and executing it):

key = os.environ.get('AZURE_SEARCH_KEY', 'XXX')

We could use this function by providing maximum images to download and the end point of the API

 def search_images_bing(key, term, max_images: int = 100, **kwargs):    
     params = {'q':term, 'count':max_images}
     headers = {"Ocp-Apim-Subscription-Key":key}
     search_url = "https://api.bing.microsoft.com/v7.0/images/search"
     response = requests.get(search_url, headers=headers, params=params)
     response.raise_for_status()
     search_results = response.json()    
     return L(search_results['value'])

As almost all indians are from these three ethnicities

ethnicGroups = 'indo-aryan','dravidian','indian mongloid'
path = Path('ethnicGroups')

Now we are using Fast AI download_images function to download images to the path provided

if not path.exists():
    path.mkdir()
for o in ethnicGroups:
    dest = (path/o)
    dest.mkdir(exist_ok=True)
    results = search_images_bing(key, f'{o} people')
    download_images(dest, urls=results.attrgot('contentUrl'))

 Download of https://atlanblackhouse.files.wordpress.com/2016/07/nazis-greek-olympics-indo-aryan-people.png has failed after 5 retries
 Fix the download manually:
$ mkdir -p ethnicGroups/indo-aryan
$ cd ethnicGroups/indo-aryan
$ wget -c https://atlanblackhouse.files.wordpress.com/2016/07/nazis-greek-olympics-indo-aryan-people.png
$ tar xf nazis-greek-olympics-indo-aryan-people.png
 And re-run your code once the download is successful

 Download of https://blackhistory938.files.wordpress.com/2017/12/the-other_home-of-subcultures-and-style-documentary_india-masquerade-gods-yannick-cormier_14.jpg has failed after 5 retries
 Fix the download manually:
$ mkdir -p ethnicGroups/dravidian
$ cd ethnicGroups/dravidian
$ wget -c https://blackhistory938.files.wordpress.com/2017/12/the-other_home-of-subcultures-and-style-documentary_india-masquerade-gods-yannick-cormier_14.jpg
$ tar xf the-other_home-of-subcultures-and-style-documentary_india-masquerade-gods-yannick-cormier_14.jpg
 And re-run your code once the download is successful

Now we check the images

fns = get_image_files(path)
fns

(#424) [Path('ethnicGroups/dravidian/00000000.jpg'),Path('ethnicGroups/dravidian/00000039.png'),Path('ethnicGroups/dravidian/00000001.jpg'),Path('ethnicGroups/dravidian/00000002.JPG'),Path('ethnicGroups/dravidian/00000002.jpg'),Path('ethnicGroups/dravidian/00000003.jpg'),Path('ethnicGroups/dravidian/00000004.jpg'),Path('ethnicGroups/dravidian/00000005.jpg'),Path('ethnicGroups/dravidian/00000006.jpg'),Path('ethnicGroups/dravidian/00000062.PNG')...]

Now we have to check corrupt images

failed = verify_images(fns)
failed

(#3) [Path('ethnicGroups/dravidian/00000007.jpg'),Path('ethnicGroups/indian mongloid/00000071.jpg'),Path('ethnicGroups/indo-aryan/00000080.png')]

Delete the images with unlink function

failed.map(Path.unlink);

Now we have to make a dataloader.

DataLoader : DataLoaders: A fastai class that stores multiple DataLoader objects you pass to it, normally a train and a valid, although it's possible to have as many as you like. The first two are made available as properties.

[^1] Definition from Jeremy Book "Deep Learning for coders".

groups = DataBlock(
    blocks=(ImageBlock, CategoryBlock), 
    get_items=get_image_files, 
    splitter=RandomSplitter(valid_pct=0.2, seed=45),
    get_y=parent_label,
    item_tfms=Resize(128))

dls = groups.dataloaders(path)

dls.valid.show_batch(max_n=4, nrows=1)

As we can see above the images are not correct so we have to clean our image dataset. If you could manually get some image dataset the result would be so much better.

We are using data augmentation for better accuracy.
Data augmentation refers to creating random variations of our input data, such that they appear different, but do not actually change the meaning of the data. Examples of common data augmentation techniques for images are rotation, flipping, perspective warping, brightness changes and contrast changes
[^1] Definition from Jeremy Book "Deep Learning for coders".*

groups = groups.new(item_tfms=Resize(128), batch_tfms=aug_transforms(mult=2))
dls = groups.dataloaders(path)
dls.train.show_batch(max_n=8, nrows=2, unique=True)

Now using bigger size image for better result.

groups = groups.new(
    item_tfms=RandomResizedCrop(224, min_scale=0.5),
    batch_tfms=aug_transforms())
dls = groups.dataloaders(path)

Now using transfer learning technique in resnet18 architecture
With fast AI we are using so much less lines of code.

learn = cnn_learner(dls, resnet18, metrics=error_rate)
learn.fine_tune(4)

The accuracy is not great the error rate is around 32 % but we will clean aur data and get better results

interp = ClassificationInterpretation.from_learner(learn)
interp.plot_confusion_matrix()

These images are some of the examples of bad results.

interp.plot_top_losses(5, nrows=1)

With the help of fast AI functions we would clean our dataset

cleaner = ImageClassifierCleaner(learn)
cleaner

for idx in cleaner.delete(): cleaner.fns[idx].unlink()
for idx,cat in cleaner.change(): shutil.move(str(cleaner.fns[idx]), path/cat)

Now we will re train our model with new dataset.

groups = groups.new(
    item_tfms=RandomResizedCrop(224, min_scale=0.5),
    batch_tfms=aug_transforms())
dls = groups.dataloaders(path)

As we could see our accuracy has very much increased and error rate is dropped to 23%

learn = cnn_learner(dls, resnet18, metrics=error_rate)
learn.fine_tune(4)

Now we export our model for use

learn.export()

path = Path()
path.ls(file_exts='.pkl')

(#1) [Path('export.pkl')]

learn_inf = load_learner(path/'export.pkl')

We will create a GUI for a small application to use this model on Notebook with the help of IPython widgets (ipywidgets) and Voilà

btn_upload = widgets.FileUpload()
btn_upload

img = PILImage.create(btn_upload.data[-1])

out_pl = widgets.Output()
out_pl.clear_output()
with out_pl: display(img.to_thumb(128,128))
out_pl

pred,pred_idx,probs = learn_inf.predict(img)

lbl_pred = widgets.Label()
lbl_pred.value = f'Prediction: {pred}; Probability: {probs[pred_idx]:.04f}'
lbl_pred

btn_run = widgets.Button(description='Classify')
btn_run

def on_click_classify(change):
    img = PILImage.create(btn_upload.data[-1])
    out_pl.clear_output()
    with out_pl: display(img.to_thumb(128,128))
    pred,pred_idx,probs = learn_inf.predict(img)
    lbl_pred.value = f'Prediction: {pred}; Probability: {100* probs[pred_idx]:.02f}%'

btn_run.on_click(on_click_classify)

With the help of IPython widgets we made a simple gui to implement our result when we click on upload a box will appear to input image and when we press classify it prints our result

VBox([widgets.Label('Select image to check ethnicity!'), 
      btn_upload, btn_run, out_pl, lbl_pred])

epoch	train_loss	valid_loss	error_rate	time
0	1.080864	0.907514	0.392857	00:06
1	0.966108	0.896233	0.333333	00:06
2	0.899231	0.951731	0.333333	00:06
3	0.833207	0.921495	0.321429	00:06

epoch	train_loss	valid_loss	error_rate	time
0	1.326655	1.366351	0.473684	00:05
1	1.124967	0.936145	0.302632	00:05
2	0.983991	0.811113	0.236842	00:05
3	0.864988	0.783508	0.236842	00:05