Tag: computer-vision

2 Which AI tools can be used for food recognition? 2018-11-20T15:46:37.783

2 possible to train some model to recognize trash? 2018-11-29T02:25:18.550

2 Appropriateness of 3D Convolutional Neural Network for segmentation of medical image data 2018-12-21T18:37:27.863

2 Approach to classify a photo and extract text from it 2019-01-26T03:14:09.527

2 Find object location (x, y) in an image 2019-02-18T01:02:16.977

2 Presence of object (highly occluded vehicle) in a scene 2019-02-26T10:23:02.940

2 Calculating tangent vector of curve s(P,$\alpha$) at given point $\alpha$ = 0 2019-02-27T13:30:04.210

2 Why does a fully connected layer only accept a fixed input size? 2019-03-07T07:50:52.097

2 How could we estimate the square footage of a room from an image? 2019-03-12T09:57:37.507

2 Is there any other rotated object detection datasets? 2019-05-01T03:53:26.863

2 Estimate distance between points in perspective image 2019-05-14T06:16:39.420

2 Object size identification and maximum number of classes with convolutional neural networks 2019-05-29T01:53:08.997

2 Which API can I use for tracking the position of animal in one or more images? 2019-05-30T10:52:10.080

2 How does ARKit's Facial Tracking work? 2019-06-14T08:22:42.710

2 What is "dense" in DensePose? 2019-07-09T07:17:33.867

2 Extracting Descriptors and feature points for 3d mesh 2019-08-01T10:54:38.090

2 Can Microsoft's cognitive service find similar person in a set of images without using the face service? 2019-08-03T15:51:41.137

2 How can I detect fast and slow motion in videos? 2019-08-13T17:35:25.323

2 What is the difference between 2d vs 3d convolutions? 2019-08-13T18:34:45.210

2 Why do we get a three-dimensional output after a convolutional layer? 2019-08-16T05:47:03.300

2 How to use machine learning to create combine of opposite images side by side 2019-08-19T14:35:29.083

2 How to generate the original image from feature set? 2019-08-22T19:50:45.020

2 How do I generate structured light for the 3D bin picking system? 2019-08-24T17:36:24.823

2 How to implement fisherface algorithm and how much time will it take? 2019-08-29T16:59:49.980

2 What should load_mask() return if an image doesn't have any objects? (Mask RCNN) 2019-08-29T20:24:33.163

2 Confidence Maps and Non-Linearity 2019-09-08T05:59:26.920

2 What technology do people use to create bots for games like LOL or Runescape? 2019-11-11T02:17:22.167

2 Is there a dataset for the detection of bomb explosions? 2019-11-11T22:47:11.313

2 If an image contains two distinct objects, should I create a copy of this image with distinct labels for each copy? 2019-11-15T12:37:52.180

2 How many training data required for GAN? 2019-11-28T08:10:47.857

2 Are there ensemble methods for regression? 2019-12-04T19:34:49.327

2 Ghost camera or video overlays for example in sports 2019-12-14T21:11:12.080

2 Which AI methods are most appropriate for login face recognition? 2019-12-17T09:07:06.150

2 SLAM versus "STAM" in vision 2019-12-19T23:59:01.223

2 Can an image recognition model used for human pose estimation? 2019-12-27T06:42:13.417

2 What is the reasoning behind the number of filters in the convolution layer? 2020-01-01T21:01:39.787

2 Semantic Segmentation For Multiple Objects When Trained On Single Object 2020-01-12T22:11:30.710

2 When training a CNN, what are the hyperparameters to tune first? 2020-01-15T09:04:46.550

2 How feasible is it to perform pose estimation on a Raspberry Pi 4 using a Pi-Cam? 2020-01-24T11:45:04.393

2 What are the current tools and techniques for image segmentation in order of pragmatism? 2020-02-06T09:55:03.257

2 Does bag-of-words method improve the classification accuracy? 2020-02-07T10:52:41.470

2 Does a fully convolutional network share the same translation invariance properties we get from networks that use max-pooling? 2020-02-21T00:18:55.033

2 Suitable algorithms for classifying terrain condition (asphalt, dirt etc) for motor vehicles 2020-03-03T13:52:41.163

2 Is there an efficient way of determining the layers with the best performance as feature extractors in GoogleNet? 2020-03-13T16:54:18.927

2 Why are conics important in computer vision? 2020-03-25T14:11:51.167

2 Can a fully convolutional network always return an image of the same size as the original? 2020-03-28T12:59:27.360

2 Creating Dataset for Image Classification 2020-04-13T12:31:11.140

2 Could machine learning be used to measure the distance between two objects from a picture or live camera? 2020-04-14T12:56:50.397

2 How do you find the homography matrix given 4 points in both images? 2020-05-10T10:32:04.957

2 How does the region proposal method work in Fast R-CNN? 2020-05-10T12:01:15.010

2 Merge two different CNN models into one 2020-05-11T13:08:39.250

2 How is visual attention mechanism different from a two branch convolutional neural network? 2020-05-16T20:34:07.090

2 Detect object in video and augment another video on top of it 2020-05-23T23:20:21.940

2 What are the main algorithms used in computer vision? 2020-06-17T15:12:56.500

2 Why are RNNs used in some computer vision problems? 2020-07-06T11:27:49.380

2 What is a heatmap in the CornerNet paper? 2020-07-10T22:03:36.810

2 What is meant by "arranging the final features of CNN in a grid" and how to do it? 2020-07-24T09:54:10.420

2 Why can we perform graph convolution using the standard 2d convolution with $1 \times \Gamma$ kernels? 2020-07-24T17:54:52.897

2 How to take the optimal batch_size for training a model? 2020-08-19T06:38:39.940

2 How is the data labelled in order to train a region proposal network? 2020-08-29T21:56:28.467

1 Why do action recognition algorithms perform better on ucf101dataset than HMDB51 dataset? 2017-02-09T20:27:23.960

1 Can I limit the possible choices for a computer vision framework to recognize? 2017-04-30T15:10:22.757

1 What does it mean to categories a feature as low-,mid-,high-level? 2017-06-25T01:58:31.467

1 Could a multi-camera SLAM system that is accurate at low driving speeds be equally accurate at high driving speeds? 2017-10-26T21:55:45.817

1 Searching an AI to recognize and locate persons in a factory 2017-12-18T19:38:08.683

1 Data extraction from medical reports 2018-01-26T14:47:12.340

1 Commercial API Q: is there an api for converting vision tags into a caption? 2018-03-22T11:06:56.577

1 Extracting one class from a pretrained Convolutional Neural Network 2018-03-28T09:36:33.223

1 How to combine heterogeneous image features extracted with different algorithms for similar image retrieval? 2018-05-19T18:20:55.797

1 Given a query image Q and two other images X and Y, how to determine which one is most similar to Q? 2018-05-20T19:10:33.773

1 Normalization for well known data sets like coco-text and total text data set 2018-06-28T05:53:23.973

1 How to label training data for YOLO 2018-07-11T18:09:39.210

1 Difference in trained models between GCP's Google Vision and Firebase's ML kit? 2018-07-24T15:01:27.323

1 How to label “other” while labeling image for object detection/classification? 2018-07-30T12:27:53.110

1 How do I segment each part of a DICOM image? 2018-10-04T06:25:15.123

1 Good papers for implementing as project of computer vision course 2018-10-05T15:25:10.763

1 input annotations quality check for large scale image data 2018-10-19T07:59:55.483

1 How to approach this handwritten digit recognition? 2018-12-16T08:06:37.283

1 Image Segmentation Prediction with cropping 256x256 grids is very slow 2018-12-20T15:14:50.537

1 I need to predict ball position from set of Images 2018-12-26T12:44:54.080

1 Tensorflow : Inception V3 Transfer Learning Parameter Tuning 2019-01-24T02:07:08.700

1 Why is the learning rate is already very small (1e-05) while the model convergences too fast? 2019-02-20T10:42:03.103

1 Siamese Network for unknown object 2019-03-11T01:14:01.363

1 Live video object detection with pose estimation 2019-03-27T03:11:54.003

1 Image-Specific Class Saliency Visualisation 2019-03-27T19:42:47.843

1 Detecting abnormalities in x-rays while taking into account demographics of a patient -automated 2019-03-28T23:55:54.163

1 Transposed convolution as upsampling in DCGAN 2019-05-06T08:08:29.923

1 Can we use Autoencoders for unsupervised CNN feature learning? 2019-05-10T17:58:28.847

1 Autoencoder for MobileNetV2 2019-06-09T18:01:09.007

1 YOLO architecture 2019-06-20T09:17:41.957

1 Understanding average precision (AP) in measuring object detector performance 2019-06-22T07:15:38.327

1 YOLO Architecture - kmeans clustering 2019-07-01T12:37:52.300

1 Applying a 1D convolution for 4D input 2019-07-04T08:05:38.950

1 How do I identify the number and type of objects in the same picture? 2019-07-27T08:24:08.187

1 How well can a CNN distinguish an object from its class? 2019-09-12T15:46:59.000

1 Segmentation of a static object in a video 2019-11-05T10:34:53.090

1 What is the current state of the art in animal facial recognition? 2019-11-05T11:29:41.587

1 How to measure object size from the disparity map using CNN? 2019-11-14T09:21:42.047

1 Can Grad CAM feature maps be used for Training? 2019-11-28T13:12:13.470