Professional Web Applications Themes

a tricky duplicates problem - FileMaker

Dear Listeners: I encountered a duplicates problem where I couldn't find a solution. From a certain electronic banking system downloads of account data are possible, up to 1 year back. They contain basic data like date, amount +/-, acount numbers from/to, name of opposite party, sometimes a message. There is *no* unique identifier per record ;=((((((((( The period of the download can be chosen freely which allows overlaps of up to almost a year. I made a test with my own historical electronic banking data from an off-line system (with modem connection), being 13070 contiguous records over 11 years. I ...

  1. #1

    Default a tricky duplicates problem

    Dear Listeners:

    I encountered a duplicates problem where I couldn't find a solution.

    From a certain electronic banking system downloads of account data are
    possible, up to 1 year back. They contain basic data like date, amount +/-,
    acount numbers from/to, name of opposite party, sometimes a message. There
    is *no* unique identifier per record ;=(((((((((

    The period of the download can be chosen freely which allows overlaps of up
    to almost a year. I made a test with my own historical electronic banking
    data from an off-line system (with modem connection), being 13070 contiguous
    records over 11 years. I found out (by making a check for duplicates in
    Textpad) that there are 2 records that are 100% duplicates in all fields.
    This is perfectly possible, one can pay/recieve the same sum the same day
    from the same account to the same account and the same remarks, if any.

    So is there a safe way to do so to speak a "visual" overlap? I can only
    compare this to a light box for photo slides where you would put 2 paper
    lists with banking data over each other. The lists have a certain period of
    overlap. By moving the top sheet up or down you find that e.g. the last 50
    records of sheet 1 and the first 20 records of sheet 2 overlap 100%. By
    doing this, the chance is probably 100%, too, that one has found the perfect
    *net dataset*. But can this performed within Filemaker? I hope I made myself
    clear.

    --


    Met vriendelijke groet / Mit freundlichen Grüßen / With kind regards
    Christoph Bouthillier
    post (-a-t-) no-no-spam tekstotaal dot com
    Please remove "no-no-spam" when e-mailing me directly - Thanks
    ------


    Christoph Guest

  2. #2

    Default Re: a tricky duplicates problem

    Christoph -- I think there's a way to create a near-perfect unique
    identifier, though I'm not good enough to suggest exactly how.

    1. Calculate an identifier field by creating a reasonably long alphameric
    string based on arithmetic and boolean operations performed on the basic
    data.

    2. This could theorticallly (but not probably) have duplicates, as in the
    example you cite. So... calculate a second identifier field by using some
    data from the preceding and/or following record.

    3. Use both fields to script a comparison and match search.

    Just a thought that came to mind as I read your post.

    Simon
    ================================================== ====================

    In article <c0ou8i$26c$cistron.nl>, "Christoph Bouthillier"
    <com> wrote:
     

    --
    Simon Lewenstein
    JPT Graphics, Irving, Texas
    Simon Guest

  3. #3

    Default Re: a tricky duplicates problem

    Dear Simon,

    Ah, that's a good idea, it mimics indeed the "paper-shifting" over a lit
    box! I'll give it a try - Thanks a lot!

    --


    Met vriendelijke groet / Mit freundlichen Grüßen / With kind regards
    Christoph Bouthillier
    post (-a-t-) no-no-spam tekstotaal dot com
    Please remove "no-no-spam" when e-mailing me directly - Thanks
    ------
    "Simon Lewenstein" <com> schreef in bericht
    news:0.1.7... [/ref]
    +/-, [/ref]
    There [/ref]
    up [/ref]
    banking [/ref]
    contiguous [/ref]
    fields. [/ref]
    day [/ref]
    of [/ref]
    50 [/ref]
    perfect [/ref]
    myself 
    >
    > --
    > Simon Lewenstein
    > JPT Graphics, Irving, Texas[/ref]


    Christoph Guest

Similar Threads

  1. A tricky query problem
    By quiero mas in forum Coldfusion Database Access
    Replies: 2
    Last Post: November 12th, 07:17 AM
  2. Tricky problem with WebControl vs. Panel
    By Ovidiu Birladeanu via .NET 247 in forum ASP.NET Building Controls
    Replies: 1
    Last Post: September 20th, 10:02 AM
  3. Tricky problem with classes!
    By Lars Plessmann in forum PHP Development
    Replies: 13
    Last Post: May 15th, 12:08 PM
  4. tricky disable button problem
    By sid vishus in forum Macromedia Flash Actionscript
    Replies: 1
    Last Post: March 4th, 07:58 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139