OWB Mapping, how to remove similar or duplicate data?

Dear all,

I intend to create a mapping to move data from my data source. I have this kind of data

Type of change in Date Employee_ID fpin fpout workhour

29/08/2014 1234, 1 trip duty 08.00 17.00 8

1234                              29/08/2014          1                                                            07.45          17.30          8.45

This table has no primary key and the constraint. On the ETL process, the target is not a given with double Employee_ID, Date and shift as given above. When there is a data load only the data that has a value in the Type column. Therefore, only the first data loaded.

I already use the deduplicator component to remove duplicate data, but it does not work in this case.

No idea how to filter such data in OWB?

Best regards

Akhmad H Gumas

Hello

There are several options here:

(1) use a grouping of the aggregation on Employee_ID operator (perhaps to the Date as well, since this seems to be your goal.)

All other attributes could be using the min, max, first, last aggregate functions.

But this way, it is difficult to get all the data in the first row (where the Type is filled)

(2) use analytical functions to qualify each record in front of a filter for the first line only.

This can be done using the sequence of operators like this:

source_table-> Employee_ID expression-> filter

Hereby, by using the following expression:

INPUTGRP1 (employe_id, Type)

OUTPUTGRP1 (DUPROWSEQ: row_number() over (partition of INGRP1.) Order of employe_id by INGRP1. TYPE DESC)

The filter will be hav efollowing expression:

INOUTGRP1. DUPROWSEQ = 1

As an example of following seqq generated code:

create table ttt (Employee_ID number, Date_w date, number of SHIFT, Type_w varchar2 (20), number, number, number of workhour fpout fpin);

insert into values of ttt ("1234, sysdate, 1, ' travel duty ', 8.0, 17.0, 8);
insert into values of ttt (1234, sysdate, 1 ", 7.45, 17.30, 8.45);

commit;

Select * from
(
Select row_number() over (partition by Employee_ID arrested by Type_w desc) DupRowSeq, ttt a a.*
)
where DupRowSeq = 1;

Hope this will help you more.

Bertram greetings

Tags: Business Intelligence

Similar Questions

  • How to remove an extension of data store (while keeping the files)?

    Hello world

    I am looking for a way to move/delete an extension of my single data store. My data store consists of 2 extends from ~ 150 GB each. The data on this subject would be inserted in the Go ~ 150 remaining.

    Is this possible? If yes how? ESXi automatically rearrange the data to avoid losing data while dropping an extension?

    Thank you

    Robert

    It is not possible to remove a measure and keep the original data store.  You must move your data, delete the data store and then re-create.

  • How to remove a store of data with force

    Last week we did a migration of old data spinning again spinning. After in the opposite direction of all the virtual machines to the new file server, we made all virtual machines online since the new file server. Now the thing is, I can't delete the existing data store (old spin). When I tried to delete the VC data store, it throws an error:

    error.JPG

    Y at - it a command/tip to remove the store of data with force?

    Storage.JPG

    Thank you

    Ganesh

    See this thread:

    http://communities.VMware.com/thread/89271

    André

  • How to remove a display duplicate in the resolution of the screen?

    I had a Windows 7 laptop to an external display connected to it.  My laptop has a resolution of 1920 x 1080, while my external monitor has a resolution of 1920 x 1200.  Initially, I had my external monitor hooked through my docking station, but I've since decided to plug in my external monitor to the HDMI port on my laptop.  When I did this, in the screen resolution Control Panel, I now have 3 screens!  The 3rd posting is currently DISABLED.  But when I decide to have two active screens, I can't do this each with their native resolution with 3rd current display - even if this were not true, I still want to have this '3rd view' deleted.

    Therefore, I have 2 questions:

    1. how 'remove' the 3rd display twice the screen resolution?  I tried to remove the double 'generic PNP' driver from the Device Manager Control Panel.  It did not work.  I then tried to turn it off - it does not work either.

    2 is it possible that when I unplug the HDMI cable from my laptop (like when I want to take my laptop somewhere) that the native computer screen is automatically enabled?  I'd rather NOT have my laptop screen to be active at all when I use my monitor (because they are of different resolutions), but that it has become active when I disconnect the HDMI cable, so if I forgot to transfer the laptop of my external monitor display when I take the laptop with me, I don't have a very expensive brick with an idle screen.

    Right-click for a resolution on the first page, select the monitor you want to remove, place "multiple" click on disable display-> press apply-> select "Multiple display" drop down again and now you will be presented with "delete this view"-> apply.

  • How to remove a line duplicate a table

    Hello

    I have two lines in a table of data same as duplicate. I want to remove a line from that.

    When I try, it was remove the two lines of the same data.

    Thank you

    Remove the table table_name
    where (rowid, column_name)
    not in
    (select min (rowid), column_name column_name PGE group)

    ex:

    Delete from emp
    Where (ROWID, empno)
    not in
    (select min (rowid), empno from emp by empno group)

  • The list of functions: how to remove similar values/strings in a list

    Hello

    I'm stuck with the following: I created several lists and they won't stay to show similar values or strings.
    e.g. my list resembles
    93,97,101,104,105,107,110,111,113,93,111
    each of the 93 and 111 values occur twice in this list. What function (or something?) deletes these two values, even if the list looks like
    93,97,101,104,105,107,110,111,113?

    I would like to do as well with a list containing the names. e.g.
    Tom Jones, kathy winter, daryll hana, steven jobs, hillary Hill, joshua fall, hillary Hill
    where hill hillary occurs twice. should look like to my list
    kathy winter, daryll hana, steven jobs, jones Tom, hillary Hill, joshua graves

    Thank you so much for a short solution!

    One trick is to use the fact that Coldfusion will automatically replace the keys in a struct, avoiding duplicates.

  • How to remove music files duplicate Windows Media Player library

    I would like to know if there is an easy way to remove a copy of the music from media player like mine, the seams to have reproduced all the

    I would like to know if there is an easy way to remove a copy of the music from media player like mine, the seams to have reproduced all the

    Easy, you tried to save/load from a USB key.  Stop of Z (or other) boot USB key > computer and doubles will disappear!

  • How to remove Firefox from Google data for me to master password CHG

    I have had my master password. Support instructions are to remove these data from Google.

    If you have forgotten your master password, you can reset it but to reset your password, it will remove all of your saved usernames and passwords.
    Follow the instructions in this article to reset your master password if you forgot

  • How to remove some of the data in the time capsule

    Can I just remove data backup in the time capsule by a certain date?

    No, it's almost impossible to mess with the TM backup. If you make them likely to have usable backups are minimal. You can try to limit the size of the sparsebundle... but you can't go back.

    I'm not sure of what you're trying to achieve... perhaps spell... but if your TC disk is full, my recommendation is to archive existing backups on USB connected to the TC... This option is available through the disc tab airport utility...

    Then erase the TC... also on the disk tab.

    And start new backups for all of your computer/s.

    Once you have a few months worth of the USB can be erased as the chances for need of an old backup file is very likely. Up to you... you can keep it forever if you want... or at least until the car dies.

  • How to remove the slash "/" article date when a form? Thank you!

    I tried to remove the forward slash ' / ' when you fill out a form, but I can't! Please help thanks!

    Go to tools - edit form. Then right click on this field and select Properties. Under the Format tab define the format of the None field.

  • How to remove an individual of data in a column

    Hi all
    I created a table like this script below


    CREATE TABLE WORK_ALLOC)
    (3) IDENTIFICATION NUMBER,
    FORMNAME VARCHAR2 (8).
    FORM_TYPE VARCHAR2 (2) NOT NULL);

    I insert data in the table above.
    Then I add a column called "CNT" Using ALTER TABLE statement.
    Can I update the table and enter a value in the first row

    UPDATE work_alloc
    NTC SET = '100'
    WHERE rownum = '1'


    Here are the data in my table

    CNT ID FORMNAME FO

    100 1 txactdef F
    2 txalerts F
    3 txbehacc EN

    Now, I need to delete the value '100' in the column of the CNT.
    Y at - it any possibity to remove the value. If Yes please answer.

    UPDATE work_alloc
    SET CNT = null

  • How to remove header, time and date of the Subvi "export waveforms to the spreadsheet file?

    I use 'Export waveforms for spreadsheet File.vi' in order to export the labview data into a file.

    However, the default format is the following:

    waveform [0]
    T0 13/11/2009 14:54:34
    Delta t 0.001000

    time Y [0]
    2009-11-13 14:54:34 - 2.441406E - 3
    2009-11-13 14:54:34 - 2.441406E - 3
    2009-11-13 14:54:34 0.000000E 0

    Yet I am interested in only the actual data without header or stamp date and time, for example:

    -2.441406E - 3

    -2.441406E - 3

    0.000000E + 0

    Could someone help me please with the adaptation of the Subvi to my needs?

    Transposes set to true.

  • How to remove the partition of data (D) on the Toshiba Satellite

    Hi, I have a Toshiba C660, which is part of the series of satellites. I got my laptop yesterday and while I was looking at the capacity of the hard drive that I have seen that there are 2 partitions. A partition is my C drive and the other is my D drive, drive D is called 'data' and I double click on the folder, there is a folder named "HDD Recovery." Is it not possible to delete this partition, because I don't think that this partition is necessary and I'd like to expand my C drive using the D drive space.

    Thank you

    Partition D, your description, would appear to be a recovery partition. If so, it is provided so that the computer can be recovered/restored to the State it's in when the computer was delivered to you.

    If you have other means in place to make this (backups of the entire system to storsge external, for example), the partition could be deleted and the space used for other purposes. If you do not have such a plan for backup in place, you can delete it anyway - it's your computer. However, it would be unwise, in my opinion - but it's not my computer.

    Happy computing and good luck.

    Tom Ferguson

  • dreaded default zero in calculated field - how to remove so that the data entered

    I'm not experienced with scripting in LiveCycle Designer ES2.

    That's what my PDF form looks like when it is opened by the user:

    Form.PNG

    But, I don't want the default $ 0.00 because some users can print the form and manually enter all areas. I want it to be empty, until the data is added in one of the hourly rate or the Total number of hours of the contract.

    Capture.PNG

    It comes to the design of form with tab linking objects (and as Name() use data binding to):

    TtlHrs

    HrlyRate

    VacRate (this is a protected field to set the value by default. 04; as 4% on the form when it is opened)

    It is the simple calculation in FormCalc for the TOTAL field:

    TtlHrs * HrlyRate + (TtlHrs * HrlyRate * VacRate) which gives the default value of 0 in the form.

    I want to write a script that will delete the 0 until data is entered in one of the first two fields. I tried using the code suggested in other posts, but it does not work - just keep getting syntax errors.

    If you reply, please indicate whether your script is for the event to Java or FormCalc claculate.

    Your calculation in formcalc should be something like:

    If (TtlHrs.isNull == 0 and HrlyRate.isNull == 0) then

    $ = TtlHrs * HrlyRate + (TtlHrs * HrlyRate * VacRate)

    else $ = «»

    endif

  • Options / Applications I have many double entries. How to remove or delete a double entry?

    Options / Applications I have much double entries of Content Type with the same applications. How to remove or delete duplicate entries, please?

    Hover ads by Type of content and a ToolTip appears, you will see slight differences in the MIME type for this may look like identical entries.

Maybe you are looking for