Data filtering in spreadsheets is a common problem faced by millions of end-users. The task of data filtering requires a computational model that can separate intended positive and negative string instances. We present a system, FIDEX, that can efficiently learn desired data filtering expressions from a small set of positive and negative string examples.
There are two key ideas of our approach. First, we design an expressive DSL to represent disjunctive filter expressions needed for several real-world data filtering tasks. Second, we develop an efficient synthesis algorithm for incrementally learning consistent filter expressions in the DSL from very few positive and negative examples. A DAG-based data structure is used to succinctly represent a large number of filter expressions, and two corresponding operators are defined for algorithmically handling positive and negative examples, namely, the intersection and subtraction operators. FIDEX is able to learn data filters for 452 out of 460 real-world data filtering tasks in real time (0.22s), using only 2.2 positive string instances and 2.7 negative string instances on average.
Wed 2 NovDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
13:30 - 15:10 | |||
13:30 25mTalk | Deriving Divide-and-Conquer Dynamic Programming Algorithms using Solver-Aided Transformations OOPSLA Shachar Itzhaky MIT CSAIL, Rohit Singh MIT, Rezaul Chowdhury Stony Brook University, Kuat Yessenov MIT, Yongquan Lu MIT, Charles E. Leiserson MIT, Armando Solar-Lezama MIT CSAIL DOI Pre-print Media Attached | ||
13:55 25mTalk | Speeding Up Machine-Code Synthesis OOPSLA Venkatesh Srinivasan University of Wisconsin - Madison, Tushar Sharma University of Wisconsin - Madison, USA, Thomas Reps University of Wisconsin - Madison and Grammatech Inc. DOI Pre-print Media Attached | ||
14:20 25mTalk | Automated Reasoning for Web Page Layout OOPSLA DOI Media Attached | ||
14:45 25mTalk | FIDEX: Filtering Spreadsheet Data using Examples OOPSLA DOI Media Attached |