java - Spark data processing with grouping -

- April 15, 2015

I need a group group of CSV lines through a specific column and do some processing on each group.

 JavaRDD & lt; String & gt; Rows = sc .textfile ("somefile.csv"); JavaPairRDD & lt; String, string & gt; PairRDD = lines.mapToPair (new SomeParser ()); & Lt; String & gt; Keys = pairRDD.keys (). Distinct (). collection (); (String key: keys) {list & lt; String & gt; Rows = jodi rdd Lookup (key); NoOfVisits = rows.size (); Country = COMMA.split (rows.get (0)) [6]; AccessDuration = getAccessDuration (rows, time format); Click the & lt; String, integer & gt; Calculation = gets (rows); Whitepapers = counts.get ("white paper"); Tutorial = counts.get ("Tutorial"); Workshops = Calculations ("Workshop"); Accidental = Calculations ("Concise"); Product page = Calculations ("Product page"); } Private Static Long Dade Pursure (String Destestering) throws ParseException {Simple Data Format Format = New SimpleDateform ("MMM DD Yyyy HH: MMA"); Date date = format PRS (datestring); Return date .getTime (); } Is called dateParser for each row, then it is calculated to get the minimum and maximum usage duration for the group. There are other string matches

pairRDD.lookup is extremely slow .. Is this a better way to do with spark.

I think you can use that column as a key and a GroupByKey . There is no mention on the operation on those lines. If it is an action that combines those lines in some way, then you can also reduce lesser .

Something like this:

  import org.apache.spark sparkcontact._ // the built-in pair functional pairing = lines.map (parser _) well grouped = pairs.groupByKey / / Here is grouped: (key, iterator [string])

* edit After looking at this process, I think every line Will be more able to contribute, which contributes to it, and then to reduce all of them to the totality Use total. CombOp: 2 functions and one zero:

  def consolidated Becky [u: class tag] (zeroview: u) (U, U) = & gt; U): RDD [(K, U)]

The first task is a partition aggregator and it will run efficiently through local partitions, local copy Partition Segmented Partial

 val line = sc.textFile ("somefile.csv") Something like this:) // Parsed gives a key and a decomposed record of the values of the track: (Key, record ("country", timestamp, "whitepaper", ...)) val records = lines.map (parse (_)) val totals = Records.aggregateByKey ((0, set [string]. Fair, long (Record, Count, Desert, Minetime, Maximum Time, Countermap)) => (Count + 1, countrySet + record.country, mathematics). .min (minTime, record.timestamp), math.max (maxTime, record.timestamp), ...) (cumm1, cumm2) = & gt; ??? //




















Get link





Facebook





X





Pinterest





Email





Other Apps




Comments





Post a Comment



Popular posts from this blog




mysql - How to enter php data into a html multiple select box -



-



March 15, 2014








    I am working on a page that will create a team. I just want to create a selection box which will list all the users of all the users. After that, the user can select more than one friend (i.e. 5) and can send the request to these 5 people to join the team but I am having problems I did this:   Select  $ sql = "Select i. *, Join * Connect with M * m.id = i.clique_id where adder_id =: id"; $ Stmt = $ db- & gt; Query ($ sql, array ('id' = & gt; $ _SESSION ['id'])); While ($ record = $ stmt-> fetch ()) {$ surname = $ record ['surname']; }    and this:    & lt; Select multiple = "true" name = "category" well id = "category" name = "category" category = "" & gt; & Lt; Option value = "& lt ;? php echo $ surname ;? & gt;" & Gt; & Lt ;? Php ek $ nickname; ? & Gt; & Lt; / Options & gt; & Lt; / Select & gt;    But only a friend's nickname...





Read more





java - Can't add JTree to JPanel of a JInternalFrame -



-



September 15, 2015













    There are two panels in my JInternalframe. I want to add TopPanel to Jtree to nominate another. But I can not add Jaitari to the top of the panel. Please help me in this piece of code:    DefaultMutableTreeNode root = new DefaultMutableTreeNode ("deck"); DefaultMutableTreeNode Item Clubs = New DefaultMutableTreeNode ("Club"); AddAllCard (itemClubs); Root.add (itemClubs); DefaultMutableTreeNode item = New DefaultMutableTreeNode ("Diamonds"); AddAllCard (itemDiamonds); Root.add (itemDiamonds); DefaultMutableTreeNode Itemspads = New DefaultMutableTreeNode ("Spades"); AddAllCard (itemSpades); Root.add (itemSpades); DefaultMutableTreeNode item increases = new DefaultMutableTreeNode ("heart"); AddAllCard (itemHearts); Root.add (itemHearts); Default Trimodel Tree Model = New DefaultTreeModel (Route); Tree = new jetty (tree modal); ScrollPane = new JScrollPane (tree); // scrollPane.setViewportView (tree); . ScrollPane.getViewport () add (tr...





Read more





java - How to drag a JavaFX node and detect a drop event outside the
JavaFX Windows? -



-



May 15, 2010








    I am implementing a tabkin that can be separated from the JavaFX window. When the tab panel is pulled out of the window where it came from, I have to create a new window and the tab to be placed in that window.   I have already applied some methods using drag gestures, drag the tab between the existing window. However, when I'm taking a mouse out of javax sequences, I'm not able to get any mouse events or drag events. Is this something possible?      You can listen to a mouse release incident on the appropriate node (like the graphic of the tab). Check the screen coordinates using  MouseEvent.getScreenX ()  and  MouseEvent.getScreenY ()  and see if they are out of the current window. If they do, create a new window, view and tab panel; Remove the tab from the current tab panel and place it in a new location.   There is no fundamental example with no frills (for example, a user does not indicate that the dragging is happening), but you should consider this:    import Please ...





Read more

Search This Blog

Quick

java - Spark data processing with grouping -

Comments

Post a Comment

Popular posts from this blog

mysql - How to enter php data into a html multiple select box -

java - Can't add JTree to JPanel of a JInternalFrame -

java - How to drag a JavaFX node and detect a drop event outside the JavaFX Windows? -