cvpr cvpr2013 cvpr2013-22 knowledge-graph by maker-knowledge-mining

22 cvpr-2013-A Non-parametric Framework for Document Bleed-through Removal

Source: pdf

Author: Róisín Rowley-Brooke, François Pitié, Anil Kokaram

Abstract: This paper presents recent work on a new framework for non-blind document bleed-through removal. The framework includes image preprocessing to remove local intensity variations, pixel region classification based on a segmentation of the joint recto-verso intensity histogram and connected component analysis on the subsequent image labelling. Finally restoration of the degraded regions is performed using exemplar-based image inpainting. The proposed method is evaluated visually and numerically on a freely available database of 25 scanned manuscript image pairs with ground truth, and is shown to outperform recent non-blind bleed-through removal techniques.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Abstract This paper presents recent work on a new framework for non-blind document bleed-through removal. [sent-2, score-0.214]

2 The framework includes image preprocessing to remove local intensity variations, pixel region classification based on a segmentation of the joint recto-verso intensity histogram and connected component analysis on the subsequent image labelling. [sent-3, score-0.537]

3 Finally restoration of the degraded regions is performed using exemplar-based image inpainting. [sent-4, score-0.314]

4 The proposed method is evaluated visually and numerically on a freely available database of 25 scanned manuscript image pairs with ground truth, and is shown to outperform recent non-blind bleed-through removal techniques. [sent-5, score-0.198]

5 Introduction Ink bleed-through degradation poses one of the most difficult problems in document restoration. [sent-7, score-0.249]

6 It occurs when ink has seeped through from one side of the page and interferes with text on the other side. [sent-8, score-0.171]

7 Physical restoration of degraded documents is an invasive, expensive, and time con- suming process that may affect the integrity of the original. [sent-10, score-0.326]

8 It is therefore preferable to perform document restoration on a digital copy, where any number of changes may be made whilst leaving the original document intact. [sent-11, score-0.586]

9 Previous approaches to bleed-through removal struggle with severe bleed-through, where the intensity ranges of bleed-through and foreground regions overlap significantly. [sent-12, score-0.491]

10 Furthermore in previous non-blind approaches [10, 8, 12], though intensity and spatial information from both recto and verso sides of the page are used to locate bleed-through regions, processing is performed separately on each side. [sent-13, score-1.077]

11 The aim of this paper is to present a fully automated, nonparametric approach to non-blind bleed-through removal that can deal with a wider degree of degradation than other approaches, whilst producing results which preserve the characteristics of the original document. [sent-14, score-0.236]

12 ie processing is performed on recto and verso images separately to enforce uniform global intensity characteristics. [sent-18, score-0.934]

13 Secondly a two stage classification is performed on both sides of the document simultaneously to locate regions of bleed-through degradation. [sent-19, score-0.424]

14 Thirdly clean background plate images are created using texture synthesis, and finally restored recto and verso images are obtained by blending the original degraded images and the clean background plates in regions classified as bleed-through. [sent-20, score-1.443]

15 Visual and numerical comparisons between the proposed method and three recent non-blind removal methods, using the database and methodology proposed in [13] are made in Section 4, and finally the conclusions are presented in Section 5. [sent-28, score-0.185]

16 Previous Work Approaches to bleed-through reduction generally fall into one of two groups; blind or non-blind, depending on whether they operate on one or both sides of the document. [sent-30, score-0.139]

17 The image intensity is the main source of information used, with spatial information included in some approaches. [sent-31, score-0.15]

18 This assumption does not hold for severe cases where the bleed-through intensity can be equivalent to or darker than the foreground text, and so intensity information alone is not enough to remove bleed-through successfully. [sent-33, score-0.436]

19 Non-blind methods make use of intensity information from both sides of the page, however the sides must first be registered so that they are aligned and of the same reso222999555422 lution. [sent-35, score-0.351]

20 Some non-blind methods use comparative intensity information from both sides to improve the performance of well known binarisation algorithms. [sent-36, score-0.311]

21 [6] are improved for bleedthrough interference by adding in secondary threshold levels in [1], and the Sauvola and Pietikainen adaptive binarisation algorithm [14], improved by fuzzy classification, is used in [2]. [sent-38, score-0.18]

22 The ICA method is extended to doublesided documents in [18], using the recto and verso images as the sources for a blind-source separation. [sent-39, score-0.881]

23 A model based approach is used by Moghaddam and Cheriet in [9], where a function of the difference in intensities between the two sides is used to indicate bleed-through regions. [sent-40, score-0.204]

24 The same authors incorporate this diffusion model into a unified framework [10], using variational models for both blind and non-blind bleed-through removal with spatial smoothness enforced in the wavelet domain. [sent-42, score-0.303]

25 in [7] and [8] proposed a user assisted method that classifies each pixel based on the ratio of intensities between the two sides, with spatial smoothness is enforced in a dual-layer MRF framework. [sent-44, score-0.289]

26 The data cost energy is defined from a small set of user input training data, in the form of coloured strokes drawn by the user in foreground, background, and bleed-through regions on both sides. [sent-45, score-0.144]

27 More recently Rowley-Brooke and Kokaram [12] proposed to represent the degradation via linear mixing models combined with foreground text masks, and to estimate restored image intensities explicitly, thus preserving the background texture of the document. [sent-46, score-0.567]

28 An example of an image with local intensity variations before (top), and after (bottom) detrending. [sent-49, score-0.123]

29 As highlighted in Section 1, the method proposed here seeks to emulate the non-parametric approach of [8], in that no assumptions about the document properties need to be made, whilst maintaining the restoration goal of [12], that is to preserve the intrinsic characteristics of the document. [sent-50, score-0.4]

30 Preprocessing Registration of the recto and verso images is an essential preprocessing step for non-blind bleed-through reduction as it ensures that bleed-through pixels are aligned with their originating text pixels from the opposite side. [sent-54, score-0.918]

31 For the purposes of this paper, it is assumed that the input recto and verso images are already registered - those in the database used for testing were registered manually. [sent-55, score-0.901]

32 Prior to classification it is necessary to compensate for any variations in the intensity profile over the document image, for example due to page binding or water stains. [sent-56, score-0.408]

33 These effects can interfere with bleed-through restoration methods that rely on global intensity properties. [sent-57, score-0.228]

34 Since many document imaging projects perform little or no image enhancement it can not be assumed that the resultant images have uniform global intensity properties. [sent-58, score-0.337]

35 Therefore the recto and verso images are adjusted separately by applying local intensity offsets such that the peaks of the lo- × cal intensity histograms, corresponding to mean local background intensities, are aligned. [sent-60, score-1.161]

36 This is performed by examining intensity histograms of overlapping blocks in the original image and storing the corresponding peak intensities. [sent-61, score-0.176]

37 Classification The proposed method aims to create a joint labelling of recto and verso images, from a set of four ‘pair’ labels: background on both sides, bgbg, recto foreground and verso bleed-through, fgbl, recto bleed-through and verso foreground, blfg, or foreground on both sides, fgfg. [sent-66, score-2.928]

38 Thus the recto and verso images 푟, 푣 are treated as a joint image 푝, and each pixel pair 푟(푖, 푗) , 푣(푖, 푗) is treated as a single pixel 푝(푖, 푗) with intensity pair x in the range [0, 255] , where 0 corresponds to white, and 255 to black. [sent-67, score-0.96]

39 The motivation for considering pair rather than single intensities is to reduce the instances of overlap between labels. [sent-68, score-0.131]

40 There are therefore two stages to classification, firstly ajoint histogram of intensity pairs is segmented into four regions 222999555533 Figure 2. [sent-70, score-0.242]

41 This histogram labelling is then used as a map to obtain an initial image labelling. [sent-73, score-0.194]

42 Secondly a set of rules governing connected label components in the image labelling is applied to produce the final labels for the rectoverso image 푝. [sent-74, score-0.449]

43 2, it is clear from the large peak in the points with lighter intensity that the largest proportion of pixels in 푝 will correspond to regions where both recto and verso are background (bgbg). [sent-79, score-1.134]

44 So the labelling is formulated as a MRF framework with a spatial smoothness prior based in the recto-verso image domain rather than the joint histogram domain. [sent-83, score-0.277]

45 Document background regions generally have a lower range of intensities than foreground, so to prevent over classification of points as bgbg, 푈x(푙x) is defined as the mahalanobis distance between point x and the centre of the label cluster corresponding to 푙x. [sent-88, score-0.306]

46 e 풩x = {y∣y =푝(푖′, 푗′) , x =푝(푖, 푗) , (푖′푗′) ∈ 풩푖,푗 } (2) So each in{stance of an intensity pair x is located} in the recto-verso image 푝, then the corresponding points in the joint histogram of the 4-connect neighbours in 푝 of these instances are added to the neighbourhood of x. [sent-97, score-0.21]

47 Binary Terms: The pairwise energy 푉 (푙x, 푙y) represents the cost of neighbouring points in the histogram being assigned labels 푙x and 푙y respectively. [sent-98, score-0.142]

48 Smoothness Weight: A smoothness weight is applied to 푉 (푙x, 푙y) to balance the influence of the binary and unary energies, and depends on the range of intensities in the recto-verso image. [sent-100, score-0.163]

49 When the range of intensities is small, there is a greater overlap between labels, and so there is less information available from the recto-verso intensities. [sent-101, score-0.131]

50 2 Image Segmentation Following colour segmentation, the image labelling is initialised by using the histogram labelling as a look up table for pixels in the recto-verso image 푝. [sent-111, score-0.327]

51 A subset of pixels will inevitably be misclassified due to the overlapping nature of the histogram label boundaries, however as the pairwise energy used in the histogram segmentation is derived from neighbourhoods in the image domain, spatial smoothness 222999555644 Figure 3. [sent-112, score-0.327]

52 Left to right: recto extract, verso extract, image labelling before rules applied, and after rules applied. [sent-114, score-1.1]

53 Row 1: Misclassified bgbg components (dark blue) are corrected. [sent-115, score-0.335]

54 Row 2: fgfg components (pink) are replaced with fgbl (green). [sent-116, score-0.538]

55 Row 3: fgbl components (green) connected to blfg (light blue), but not fgfg (pink) are replaced with blfg. [sent-117, score-0.822]

56 Row 4: A blfg component is connected to fgfg, but not bgbg so is replaced with fgfg. [sent-118, score-0.642]

57 Therefore a full per-pixel analysis is not performed on the image labelling, and instead connected components of each label are examined, and rules governing permitted neighbouring components iteratively applied to correct misclassifications until convergence. [sent-120, score-0.408]

58 bgbg: This label covers the greatest proportion of the image, and so connected components will mostly be larger than the average character size. [sent-122, score-0.29]

59 Smaller components correspond either to valid within character spaces, such as in ‘a’ and ‘o’, or to misclassifications. [sent-123, score-0.137]

60 To avoid relabelling valid within character spaces, only the connected components that are less than 10% of the average character component size are analysed. [sent-124, score-0.311]

61 Presumed to be mislabelled these components are relabelled with the label corresponding to the largest proportion of their neighbours. [sent-125, score-0.294]

62 fgfg: Conversely, this label covers the smallest proportion of the image, and as very dark bleed-through can often be mislabelled as fgfg, no assumptions can be made about the size of components and all are examined. [sent-126, score-0.248]

63 The outer edges of components with this label must contain both fgbl and blfg labels, as overlapping text regions will originate from text alone on both sides. [sent-127, score-0.799]

64 If this is not the case the component is relabelled fgbl or blfg according to which is present in the outer edge, or as bgbg if neither. [sent-128, score-0.894]

65 fgbl: For this label, again only components less than 10% of the average character size are examined. [sent-129, score-0.137]

66 The outer edges of these components must contain either fgfg and bgbg, or bgbg only. [sent-130, score-0.623]

67 If the outer edge of a component contains fgfg, but not bgbg also, then the component is relabelled as fgfg. [sent-131, score-0.509]

68 If the outer edge of a component contains the label blfg ,but Figure 4. [sent-132, score-0.379]

69 Top row left: degraded recto with feint ruled lines, right: corresponding verso. [sent-134, score-0.624]

70 Second row left: image labelling (dark blue=texture source), right: visible artefacts in the recto background plate. [sent-135, score-0.657]

71 Bottom left: labelling with 10% of source gradients removed (yellow), right: the improved background plate. [sent-136, score-0.257]

72 blfg: Components labelled blfg are processed in exactly the same way as fgbl, with the two labels interchanged. [sent-138, score-0.305]

73 Restoration The aim of this method is to preserve as much of the document as possible; the background texture is preserved to ensure that the experience of studying the document image remains close to that of studying the physical docu- ment. [sent-142, score-0.551]

74 The restored recto and verso images ˆ 푟(푥, 푦) , ˆ 푣(푥, 푦) are obtained by replacing identified bleed-through regions, where 푙푖 = fgbl for ˆ 푟(푥, 푦), and 푙푖 = blfg for ˆ 푣(푥, 푦), with background texture from clean background images 푟푏(푥, 푦) , 푣푏(푥, 푦). [sent-143, score-1.56]

75 The images 푟푏(푥, 푦) , 푣푏(푥, 푦) for recto and verso sides are generated using regions labelled as bgbg as the texture source, and inpainting all other label regions. [sent-148, score-1.411]

76 Problems may be encountered with this approach in regions where feint foreground information might not have been identified during classification, with the result that foreground patterns are replicated in the background images. [sent-149, score-0.439]

77 To mitigate this the gradients of the regions labelled as bgbg are examined and the highest 10% of gradients removed from the inpainting source (see Fig 4). [sent-150, score-0.516]

78 2 Blending Using a per-pixel replacement of bleed-through pixels with corresponding clean background pixels creates restored im222999555755 Figure 5. [sent-153, score-0.226]

79 An example of blending the background image with the degraded image in bleed-through boundary regions. [sent-154, score-0.279]

80 Results & Discussion The proposed method was tested on the database of 25 manuscript recto-verso image pairs with manually created binary foreground text images, presented in [13]. [sent-160, score-0.268]

81 The results are evaluated first subjectively via a visual comparison, and then objectively, via a numerical comparison, against three recent non-blind bleed-through removal methods: (i) The dual-layer MRF approach with user trained likelihood proposed by Huang et al. [sent-161, score-0.202]

82 The user assisted method (H) copes well with dark bleed-through when it is isolated, but tends to remove foreground text in overlapping fgfg regions, reducing legibility. [sent-168, score-0.602]

83 The Wavelet method (M) preserves the foreground information, but does not cope well with dark bleed-through regions, leaving visible artefacts. [sent-169, score-0.203]

84 The linear model based approach (R) also preserves foreground information well in most cases, but again does not cope well with dark-bleed through regions. [sent-170, score-0.159]

85 The proposed method removes most of the bleed-through in all the examples whilst preserving the foreground well. [sent-171, score-0.187]

86 However this is achieved as the cost of foreground information in fgfg regions and so this method performs worst in terms of FgError. [sent-195, score-0.417]

87 The Wavelet based method (M) [10] preserves the foreground well, however does not cope well with severe bleedthrough so has a high average BgError and is ranked fourth for this metric. [sent-196, score-0.28]

88 This is due to the fact that the mixing parameters in the model are assigned a very high smoothness such that at each successive estimation iteration the bleed-through removed regions increase in size and regions misclassified in the initial stages are gradually blended into the background. [sent-198, score-0.312]

89 In each example from top to bottom: Degraded recto and verso images, results from the user assisted method (H) [8], results from the Wavelet method (M) [10], results from the linear- based method (R) [12], results from the proposed method. [sent-201, score-0.904]

90 disadvantage of such a high smoothness is that valid foreground characters connected to bleed-through regions may also be increasingly blended into the background (as can be seen in the top left example ofFig. [sent-202, score-0.411]

91 The pairwise comparison results (Table 3) and RP metric rankings highlight that the proposed method outperforms the other three in terms of foreground preservation, and overall error. [sent-204, score-0.159]

92 The preprocessing stage removes intensity trends in the input images. [sent-207, score-0.202]

93 The classification stage has the advantage over other methods that both recto and verso images are processed simultaneously, first by performing a joint histogram segmentation, then by applying rules to label connected components in the corresponding image segmentation. [sent-208, score-1.202]

94 The restoration is performed using exemplar based image inpainting to preserve the character of the original document image. [sent-209, score-0.469]

95 03054 HMRProposed Probability of foreground error Figure 7. [sent-214, score-0.134]

96 Enhanced bleedthrough correction for early music documents with recto-verso registration. [sent-223, score-0.208]

97 Restoring ink bleed-through degraded document images using a recursive unsupervised classification technique. [sent-253, score-0.461]

98 Color space transformations for analysis and enhancement of ancient degraded manuscripts. [sent-363, score-0.191]

99 Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique. [sent-383, score-0.138]

100 Document ink bleed-through removal with two hidden markov random fields and a single observation field. [sent-392, score-0.19]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('recto', 0.428), ('verso', 0.383), ('bgbg', 0.27), ('blfg', 0.225), ('fgfg', 0.225), ('document', 0.214), ('fgbl', 0.203), ('degraded', 0.151), ('foreground', 0.134), ('labelling', 0.133), ('intensity', 0.123), ('removal', 0.12), ('restored', 0.116), ('intensities', 0.106), ('restoration', 0.105), ('sides', 0.098), ('bgerror', 0.09), ('binarisation', 0.09), ('bleedthrough', 0.09), ('fgerror', 0.09), ('relabelled', 0.09), ('rules', 0.078), ('moghaddam', 0.074), ('character', 0.072), ('documents', 0.07), ('ink', 0.07), ('background', 0.068), ('binarised', 0.068), ('gatos', 0.068), ('kokaram', 0.068), ('components', 0.065), ('outer', 0.063), ('histogram', 0.061), ('blending', 0.06), ('connected', 0.059), ('regions', 0.058), ('smoothness', 0.057), ('text', 0.056), ('whilst', 0.053), ('manuscript', 0.052), ('wavelet', 0.052), ('preprocessing', 0.051), ('inpainting', 0.05), ('assisted', 0.05), ('misclassified', 0.05), ('labelled', 0.049), ('label', 0.048), ('music', 0.048), ('proportion', 0.046), ('cheriet', 0.045), ('feint', 0.045), ('mislabelled', 0.045), ('piti', 0.045), ('sauvola', 0.045), ('tonazzini', 0.045), ('replaced', 0.045), ('page', 0.045), ('dark', 0.044), ('component', 0.043), ('user', 0.043), ('lncs', 0.042), ('clean', 0.042), ('blind', 0.041), ('ancient', 0.04), ('hysteresis', 0.04), ('objectively', 0.04), ('numerical', 0.039), ('peaks', 0.036), ('editors', 0.035), ('blended', 0.035), ('governing', 0.035), ('degradation', 0.035), ('rp', 0.033), ('misclassifications', 0.033), ('examined', 0.033), ('enforced', 0.033), ('ica', 0.032), ('mrf', 0.032), ('registered', 0.032), ('severe', 0.031), ('labels', 0.031), ('pink', 0.031), ('removed', 0.029), ('peak', 0.028), ('artefacts', 0.028), ('stage', 0.028), ('preserve', 0.028), ('texture', 0.027), ('source', 0.027), ('classification', 0.026), ('joint', 0.026), ('database', 0.026), ('overlapping', 0.025), ('overlap', 0.025), ('pairwise', 0.025), ('secondly', 0.025), ('remove', 0.025), ('neighbouring', 0.025), ('mixing', 0.025), ('preserves', 0.025)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.9999994 22 cvpr-2013-A Non-parametric Framework for Document Bleed-through Removal

Author: Róisín Rowley-Brooke, François Pitié, Anil Kokaram

2 0.087575451 207 cvpr-2013-Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation

Author: Ľ

Abstract: Our goal is to detect humans and estimate their 2D pose in single images. In particular, handling cases of partial visibility where some limbs may be occluded or one person is partially occluding another. Two standard, but disparate, approaches have developed in the field: the first is the part based approach for layout type problems, involving optimising an articulated pictorial structure; the second is the pixel based approach for image labelling involving optimising a random field graph defined on the image. Our novel contribution is a formulation for pose estimation which combines these two models in a principled way in one optimisation problem and thereby inherits the advantages of both of them. Inference on this joint model finds the set of instances of persons in an image, the location of their joints, and a pixel-wise body part labelling. We achieve near or state of the art results on standard human pose data sets, and demonstrate the correct estimation for cases of self-occlusion, person overlap and image truncation.

3 0.08628758 216 cvpr-2013-Improving Image Matting Using Comprehensive Sampling Sets

Author: Ehsan Shahrian, Deepu Rajan, Brian Price, Scott Cohen

Abstract: In this paper, we present a new image matting algorithm that achieves state-of-the-art performance on a benchmark dataset of images. This is achieved by solving two major problems encountered by current sampling based algorithms. The first is that the range in which the foreground and background are sampled is often limited to such an extent that the true foreground and background colors are not present. Here, we describe a method by which a more comprehensive and representative set of samples is collected so as not to miss out on the true samples. This is accomplished by expanding the sampling range for pixels farther from the foreground or background boundary and ensuring that samples from each color distribution are included. The second problem is the overlap in color distributions of foreground and background regions. This causes sampling based methods to fail to pick the correct samples for foreground and background. Our design of an objective function forces those foreground and background samples to be picked that are generated from well-separated distributions. Comparison on the dataset at and evaluation by www.alphamatting.com shows that the proposed method ranks first in terms of error measures used in the website.

4 0.085846499 309 cvpr-2013-Nonparametric Scene Parsing with Adaptive Feature Relevance and Semantic Context

Author: Gautam Singh, Jana Kosecka

Abstract: This paper presents a nonparametric approach to semantic parsing using small patches and simple gradient, color and location features. We learn the relevance of individual feature channels at test time using a locally adaptive distance metric. To further improve the accuracy of the nonparametric approach, we examine the importance of the retrieval set used to compute the nearest neighbours using a novel semantic descriptor to retrieve better candidates. The approach is validated by experiments on several datasets used for semantic parsing demonstrating the superiority of the method compared to the state of art approaches.

5 0.082379036 346 cvpr-2013-Real-Time No-Reference Image Quality Assessment Based on Filter Learning

Author: Peng Ye, Jayant Kumar, Le Kang, David Doermann

Abstract: This paper addresses the problem of general-purpose No-Reference Image Quality Assessment (NR-IQA) with the goal ofdeveloping a real-time, cross-domain model that can predict the quality of distorted images without prior knowledge of non-distorted reference images and types of distortions present in these images. The contributions of our work are two-fold: first, the proposed method is highly efficient. NR-IQA measures are often used in real-time imaging or communication systems, therefore it is important to have a fast NR-IQA algorithm that can be used in these real-time applications. Second, the proposed method has the potential to be used in multiple image domains. Previous work on NR-IQA focus primarily on predicting quality of natural scene image with respect to human perception, yet, in other image domains, the final receiver of a digital image may not be a human. The proposed method consists of the following components: (1) a local feature extractor; (2) a global feature extractor and (3) a regression model. While previous approaches usually treat local feature extraction and regres- sion model training independently, we propose a supervised method based on back-projection, which links the two steps by learning a compact set of filters which can be applied to local image patches to obtain discriminative local features. Using a small set of filters, the proposed method is extremely fast. We have tested this method on various natural scene and document image datasets and obtained stateof-the-art results.

6 0.081065319 382 cvpr-2013-Scene Text Recognition Using Part-Based Tree-Structured Character Detection

7 0.079195403 148 cvpr-2013-Ensemble Video Object Cut in Highly Dynamic Scenes

8 0.072890982 284 cvpr-2013-Mesh Based Semantic Modelling for Indoor and Outdoor Scenes

9 0.067542002 450 cvpr-2013-Unsupervised Joint Object Discovery and Segmentation in Internet Images

10 0.055355661 180 cvpr-2013-Fully-Connected CRFs with Non-Parametric Pairwise Potential

11 0.054491546 55 cvpr-2013-Background Modeling Based on Bidirectional Analysis

12 0.054188319 375 cvpr-2013-Saliency Detection via Graph-Based Manifold Ranking

13 0.053808376 457 cvpr-2013-Visual Tracking via Locality Sensitive Histograms

14 0.052830338 222 cvpr-2013-Incorporating User Interaction and Topological Constraints within Contour Completion via Discrete Calculus

15 0.052790038 295 cvpr-2013-Multi-image Blind Deblurring Using a Coupled Adaptive Sparse Prior

16 0.050327826 131 cvpr-2013-Discriminative Non-blind Deblurring

17 0.046670884 327 cvpr-2013-Pattern-Driven Colorization of 3D Surfaces

18 0.046498265 453 cvpr-2013-Video Editing with Temporal, Spatial and Appearance Consistency

19 0.045491491 165 cvpr-2013-Fast Energy Minimization Using Learned State Filters

20 0.044850141 297 cvpr-2013-Multi-resolution Shape Analysis via Non-Euclidean Wavelets: Applications to Mesh Segmentation and Surface Alignment Problems

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.122), (1, 0.024), (2, 0.017), (3, 0.026), (4, 0.032), (5, 0.02), (6, 0.004), (7, 0.015), (8, -0.02), (9, -0.005), (10, 0.037), (11, -0.015), (12, -0.011), (13, 0.01), (14, 0.002), (15, -0.01), (16, 0.018), (17, -0.034), (18, 0.057), (19, -0.015), (20, -0.006), (21, 0.062), (22, -0.032), (23, -0.045), (24, -0.009), (25, -0.065), (26, 0.11), (27, 0.031), (28, -0.05), (29, 0.001), (30, 0.022), (31, 0.018), (32, -0.008), (33, -0.032), (34, -0.041), (35, -0.008), (36, -0.032), (37, 0.001), (38, -0.073), (39, 0.017), (40, -0.046), (41, 0.039), (42, -0.005), (43, -0.025), (44, -0.014), (45, -0.037), (46, -0.02), (47, 0.015), (48, 0.058), (49, -0.021)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.92027062 22 cvpr-2013-A Non-parametric Framework for Document Bleed-through Removal

Author: Róisín Rowley-Brooke, François Pitié, Anil Kokaram

2 0.70795816 55 cvpr-2013-Background Modeling Based on Bidirectional Analysis

Author: Atsushi Shimada, Hajime Nagahara, Rin-ichiro Taniguchi

Abstract: Background modeling and subtraction is an essential task in video surveillance applications. Most traditional studies use information observed in past frames to create and update a background model. To adapt to background changes, the backgroundmodel has been enhancedby introducing various forms of information including spatial consistency and temporal tendency. In this paper, we propose a new framework that leverages information from a future period. Our proposed approach realizes a low-cost and highly accurate background model. The proposed framework is called bidirectional background modeling, and performs background subtraction based on bidirectional analysis; i.e., analysis from past to present and analysis from future to present. Although a result will be output with some delay because information is takenfrom a futureperiod, our proposed approach improves the accuracy by about 30% if only a 33-millisecond of delay is acceptable. Furthermore, the memory cost can be reduced by about 65% relative to typical background modeling.

3 0.66099912 216 cvpr-2013-Improving Image Matting Using Comprehensive Sampling Sets

Author: Ehsan Shahrian, Deepu Rajan, Brian Price, Scott Cohen

4 0.64189625 148 cvpr-2013-Ensemble Video Object Cut in Highly Dynamic Scenes

Author: Xiaobo Ren, Tony X. Han, Zhihai He

Abstract: We consider video object cut as an ensemble of framelevel background-foreground object classifiers which fuses information across frames and refine their segmentation results in a collaborative and iterative manner. Our approach addresses the challenging issues of modeling of background with dynamic textures and segmentation of foreground objects from cluttered scenes. We construct patch-level bagof-words background models to effectively capture the background motion and texture dynamics. We propose a foreground salience graph (FSG) to characterize the similarity of an image patch to the bag-of-words background models in the temporal domain and to neighboring image patches in the spatial domain. We incorporate this similarity information into a graph-cut energy minimization framework for foreground object segmentation. The background-foreground classification results at neighboring frames are fused together to construct a foreground probability map to update the graph weights. The resulting object shapes at neighboring frames are also used as constraints to guide the energy minimization process during graph cut. Our extensive experimental results and performance comparisons over a diverse set of challenging videos with dynamic scenes, including the new Change Detection Challenge Dataset, demonstrate that the proposed ensemble video object cut method outperforms various state-ofthe-art algorithms.

5 0.64055312 235 cvpr-2013-Jointly Aligning and Segmenting Multiple Web Photo Streams for the Inference of Collective Photo Storylines

Author: Gunhee Kim, Eric P. Xing

Abstract: With an explosion of popularity of online photo sharing, we can trivially collect a huge number of photo streams for any interesting topics such as scuba diving as an outdoor recreational activity class. Obviously, the retrieved photo streams are neither aligned nor calibrated since they are taken in different temporal, spatial, and personal perspectives. However, at the same time, they are likely to share common storylines that consist of sequences of events and activities frequently recurred within the topic. In this paper, as a first technical step to detect such collective storylines, we propose an approach to jointly aligning and segmenting uncalibrated multiple photo streams. The alignment task discovers the matched images between different photo streams, and the image segmentation task parses each image into multiple meaningful regions to facilitate the image understanding. We close a loop between the two tasks so that solving one task helps enhance the performance of the other in a mutually rewarding way. To this end, we design a scalable message-passing based optimization framework to jointly achieve both tasks for the whole input image set at once. With evaluation on the new Flickr dataset of 15 outdoor activities that consist of 1.5 millions of images of 13 thousands of photo streams, our empirical results show that the proposed algorithms are more successful than other candidate methods for both tasks.

6 0.62168324 263 cvpr-2013-Learning the Change for Automatic Image Cropping

7 0.60128599 211 cvpr-2013-Image Matting with Local and Nonlocal Smooth Priors

8 0.5983575 352 cvpr-2013-Recovering Stereo Pairs from Anaglyphs

9 0.59437352 190 cvpr-2013-Graph-Based Optimization with Tubularity Markov Tree for 3D Vessel Segmentation

10 0.59245032 327 cvpr-2013-Pattern-Driven Colorization of 3D Surfaces

11 0.56904668 281 cvpr-2013-Measures and Meta-Measures for the Supervised Evaluation of Image Segmentation

12 0.55296224 437 cvpr-2013-Towards Fast and Accurate Segmentation

13 0.55239826 453 cvpr-2013-Video Editing with Temporal, Spatial and Appearance Consistency

14 0.55002052 450 cvpr-2013-Unsupervised Joint Object Discovery and Segmentation in Internet Images

15 0.53365678 382 cvpr-2013-Scene Text Recognition Using Part-Based Tree-Structured Character Detection

16 0.53225577 177 cvpr-2013-FrameBreak: Dramatic Image Extrapolation by Guided Shift-Maps

17 0.52612984 180 cvpr-2013-Fully-Connected CRFs with Non-Parametric Pairwise Potential

18 0.52406681 391 cvpr-2013-Sensing and Recognizing Surface Textures Using a GelSight Sensor

19 0.52071619 332 cvpr-2013-Pixel-Level Hand Detection in Ego-centric Videos

20 0.51947004 171 cvpr-2013-Fast Trust Region for Segmentation

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(10, 0.121), (16, 0.046), (26, 0.025), (28, 0.015), (33, 0.246), (67, 0.064), (69, 0.035), (87, 0.057), (98, 0.294)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.77843881 22 cvpr-2013-A Non-parametric Framework for Document Bleed-through Removal

Author: Róisín Rowley-Brooke, François Pitié, Anil Kokaram

2 0.77388138 372 cvpr-2013-SLAM++: Simultaneous Localisation and Mapping at the Level of Objects

Author: Renato F. Salas-Moreno, Richard A. Newcombe, Hauke Strasdat, Paul H.J. Kelly, Andrew J. Davison

Abstract: We present the major advantages of a new ‘object oriented’ 3D SLAM paradigm, which takes full advantage in the loop of prior knowledge that many scenes consist of repeated, domain-specific objects and structures. As a hand-held depth camera browses a cluttered scene, realtime 3D object recognition and tracking provides 6DoF camera-object constraints which feed into an explicit graph of objects, continually refined by efficient pose-graph optimisation. This offers the descriptive and predictive power of SLAM systems which perform dense surface reconstruction, but with a huge representation compression. The object graph enables predictions for accurate ICP-based camera to model tracking at each live frame, and efficient active search for new objects in currently undescribed image regions. We demonstrate real-time incremental SLAM in large, cluttered environments, including loop closure, relocalisation and the detection of moved objects, and of course the generation of an object level scene description with the potential to enable interaction.

3 0.71142441 187 cvpr-2013-Geometric Context from Videos

Author: S. Hussain Raza, Matthias Grundmann, Irfan Essa

Abstract: We present a novel algorithm for estimating the broad 3D geometric structure of outdoor video scenes. Leveraging spatio-temporal video segmentation, we decompose a dynamic scene captured by a video into geometric classes, based on predictions made by region-classifiers that are trained on appearance and motion features. By examining the homogeneity of the prediction, we combine predictions across multiple segmentation hierarchy levels alleviating the need to determine the granularity a priori. We built a novel, extensive dataset on geometric context of video to evaluate our method, consisting of over 100 groundtruth annotated outdoor videos with over 20,000 frames. To further scale beyond this dataset, we propose a semisupervised learning framework to expand the pool of labeled data with high confidence predictions obtained from unlabeled data. Our system produces an accurate prediction of geometric context of video achieving 96% accuracy across main geometric classes.

4 0.70006371 225 cvpr-2013-Integrating Grammar and Segmentation for Human Pose Estimation

Author: Brandon Rothrock, Seyoung Park, Song-Chun Zhu

Abstract: In this paper we present a compositional and-or graph grammar model for human pose estimation. Our model has three distinguishing features: (i) large appearance differences between people are handled compositionally by allowingparts or collections ofparts to be substituted with alternative variants, (ii) each variant is a sub-model that can define its own articulated geometry and context-sensitive compatibility with neighboring part variants, and (iii) background region segmentation is incorporated into the part appearance models to better estimate the contrast of a part region from its surroundings, and improve resilience to background clutter. The resulting integrated framework is trained discriminatively in a max-margin framework using an efficient and exact inference algorithm. We present experimental evaluation of our model on two popular datasets, and show performance improvements over the state-of-art on both benchmarks.

5 0.69788098 408 cvpr-2013-Spatiotemporal Deformable Part Models for Action Detection

Author: Yicong Tian, Rahul Sukthankar, Mubarak Shah

Abstract: Deformable part models have achieved impressive performance for object detection, even on difficult image datasets. This paper explores the generalization of deformable part models from 2D images to 3D spatiotemporal volumes to better study their effectiveness for action detection in video. Actions are treated as spatiotemporal patterns and a deformable part model is generated for each action from a collection of examples. For each action model, the most discriminative 3D subvolumes are automatically selected as parts and the spatiotemporal relations between their locations are learned. By focusing on the most distinctive parts of each action, our models adapt to intra-class variation and show robustness to clutter. Extensive experiments on several video datasets demonstrate the strength of spatiotemporal DPMs for classifying and localizing actions.

6 0.69746035 104 cvpr-2013-Deep Convolutional Network Cascade for Facial Point Detection

7 0.69723952 248 cvpr-2013-Learning Collections of Part Models for Object Recognition

8 0.69646299 325 cvpr-2013-Part Discovery from Partial Correspondence

9 0.69584072 414 cvpr-2013-Structure Preserving Object Tracking

10 0.69523937 446 cvpr-2013-Understanding Indoor Scenes Using 3D Geometric Phrases

11 0.69513386 98 cvpr-2013-Cross-View Action Recognition via a Continuous Virtual Path

12 0.6949898 14 cvpr-2013-A Joint Model for 2D and 3D Pose Estimation from a Single Image

13 0.69467413 221 cvpr-2013-Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection

14 0.6940912 419 cvpr-2013-Subspace Interpolation via Dictionary Learning for Unsupervised Domain Adaptation

15 0.69393826 143 cvpr-2013-Efficient Large-Scale Structured Learning

16 0.69384998 206 cvpr-2013-Human Pose Estimation Using Body Parts Dependent Joint Regressors

17 0.69366509 245 cvpr-2013-Layer Depth Denoising and Completion for Structured-Light RGB-D Cameras

18 0.69347173 387 cvpr-2013-Semi-supervised Domain Adaptation with Instance Constraints

19 0.6933586 74 cvpr-2013-CLAM: Coupled Localization and Mapping with Efficient Outlier Handling

20 0.69314897 360 cvpr-2013-Robust Estimation of Nonrigid Transformation for Point Set Registration