Replies: 36 comments 8 replies
-
You can use a workaround: However, this workaround is only valid, as long as the timestamps are all distinct. Otherwise the grouping would actually aggregate more than one value... |
Beta Was this translation helpful? Give feedback.
-
Good workaround. I still would like to see this as a feature. I.e. aggregations should be optional, since they may be done during the query. |
Beta Was this translation helpful? Give feedback.
-
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. For admin, please label this issue |
Beta Was this translation helpful? Give feedback.
-
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. For admin, please label this issue |
Beta Was this translation helpful? Give feedback.
-
I'd like to see this feature, too. having data in two columns and then plotting the first column against the second column is the base case, in my opinion. it's very confusing to not have this as an option for a first-time user (such as myself). |
Beta Was this translation helpful? Give feedback.
-
This shouldn't be too difficult to implement. I'll take a stab at adding this in the coming weeks. |
Beta Was this translation helpful? Give feedback.
-
@villebro I would love to see this feature happen, let's make it post 1.0 item! added to roadmap inbox :) https://github.com/apache-superset/superset-roadmap/projects/1 |
Beta Was this translation helpful? Give feedback.
-
Kinda baffled that this wouldn't have been the very first behaviour implemented, with aggregates coming after |
Beta Was this translation helpful? Give feedback.
-
Superset's explorer is used to explore multidimensional datasets, and semantically metrics in Superset are strictly defined aggregate expressions. That is the case for metric definitions in most BI tools. The dimensions / metric mental model is widely accepted and generally easy to reason about. Doing a sum of a single row is valid, personally fail to see why people see this as a problem. If you happen to add other columns/dimensions to your dataset, things will still work. There are complex implications here but if you'd like to say "I don't want Superset to not aggregate this" through the UI and happen to have duplicates in your dataset for whatever reason, or simply by check that box by mistake, there's a whole lot of implications, like dealing with high volume data and/or duplicates. In the current model, Superset has guaranties around the granularity of the queries it generates, if that's not the case, Superset has to trust that the user is right, or assert that the grain of the query is the one expected. Handling these exceptions and communicating them to the user "hey looks like you have dups an you should use an aggregate function" seem overall harder and less intuitive than the original proposition: "metrics are aggregate expressions". |
Beta Was this translation helpful? Give feedback.
-
I think that we need to see that there are two worlds. The thing is, that these worlds happen to converge at some companies. I agree on your concerns regarding the potentially huge amount or returned data, @mistercrunch. However, I think that the LIMITs that Superset applies anyway will minimize the damage here. We just need to ensure that the user is well aware of the applied limit. |
Beta Was this translation helpful? Give feedback.
-
@manikanta-dornala nicely summarised |
Beta Was this translation helpful? Give feedback.
-
Bump x100 (is that allowed?) I have simple time series data that corresponds to sensor data. time (seconds) = [0, 1, 2, 3] I want to plot Time against Data, but I can't do it without aggregating values. Can anyone explain to me why this is not supported? It's the only thing hindering me from using Superset.
|
Beta Was this translation helpful? Give feedback.
-
@rusackas, I know that tagging is immoral, but I hope that you rather appreciate to be made aware of this pain point which users have reported for years now. I have been in contact with many different users of different backgrounds and at different companies, and this is the No. 1 (now that the generic x-axis is implemented). Is there any awareness and/or roadmap for un-aggregated plotting? I wonder if Preset users don't also request this feature. |
Beta Was this translation helpful? Give feedback.
-
@srinify I know your team is working on this, would you be able to shed some light on what you're building? Which reminds me I will deliver those action points I promised in our last chat. |
Beta Was this translation helpful? Give feedback.
-
For the rest of the people I'd like to remind you that this is an Open Source Community, and everyone is happily encouraged to step up and help develop this feature if it's blocking them. |
Beta Was this translation helpful? Give feedback.
-
Hey @villebro , I think others might be willing to work on this if the Superset team acknowledged that this is desirable. @mistercrunch made the case that this feature doesn't belong in Superset at all and that Superset is only meant for the subset of a subset of BI data visualization which only allows aggregated metrics. Given that the Superset team has verbally invalidated the needs for non-aggregated visualization in this tool, nobody will want to spend their valuable time forking and maintaining their own version Superset when there are other tools that already do the job that they can use. |
Beta Was this translation helpful? Give feedback.
-
@jpedrick thanks for sharing some additional background. I think others can also chime in here, but I think it might be a good idea to reiterate some core pillars about how Apache projects operate. Firstly, no one person makes decisions for the project - rather, the project as a collective makes decisions. While it's true that people who have an active role in the codebase can be seen as having more influence over the project than, say, a person who hasn't contributed commits, this is not strictly so - anyone has the power to propose new features, and if if there is enough momentum and community support to back them up, they will most certainly be seriously considered. I can't count the times I've reversed my own position on something after getting pushback from non-committers. And IMO, this is one of the best aspects of OSS. I completely agree that having a non-aggregating scatterplot absolutely makes sense, and just this week we met (virtually) with @srinify and team to discuss this very topic. Please check the Slack thread here that kickstarted the effort: https://apache-superset.slack.com/archives/C0170U650CQ/p1679361957397339 . If there are others who feel strongly about this feature, e.g. have ideas about how a non-aggregated scatterplot should work or would even be open to contribute to the feature, please do speak up! If there's broad interest in this I'm happy to setup a dedicated Slack channel for this to make it easier to coordinate collaboration, and it would definitely help to start a dev@ discussion about it to make sure it gets broad visibility. At the end of the day this project lives and dies by its community, and if the community feels we're not serving their interests, then we're not doing a good job. For that reason I apologize for not having been more active in this discussion, and hope we can turn a new page on this discussion and make sure Superset keeps evolving in a direction that the community agrees with. |
Beta Was this translation helpful? Give feedback.
-
Any update on this? |
Beta Was this translation helpful? Give feedback.
-
I encountered the same need today, bump. |
Beta Was this translation helpful? Give feedback.
-
I installed Superset and the first simple thing I needed I couldn't see: a simple graphic with x vs y with 2 columns of the table. |
Beta Was this translation helpful? Give feedback.
-
I'd like to see a gauge of the latest value in my dataset whatever the time in the datesource may be. |
Beta Was this translation helpful? Give feedback.
-
+1 need this improvement. |
Beta Was this translation helpful? Give feedback.
-
Hi everyone, This issue/discussion is 5 years old, and is not a bug but rather a feature request. I'm closing it for those reasons, mainly. There's also the consideration that if someone does want to implement this change, it's a fundamental departure from how Superset currently operates, and should be proposed as a SIP that considers all the implications for things like how unaggregated data would be visualized by all plugins (i.e. what happens with a line chart when you have many Y values on a single X value), what error/edge cases that night impose and how to solve them, what performance issues we might face (e.g. more data over the wire) scaling/performance issues, etc. We're open to proposals here, but this issue is not a bug, does not have an implementation plan, and is full of bump comments that do not contribute to its success. Again, I'm not closing this to sweep it under the rug, but in this issue/discussion's current state, it's effectively unactionable. if anyone seriously wants to contribute towards this, we're happy to continue the discussion on the dev list, on a rebooted ideas thread here on GitHub Discussions, on slack, at Town Hall, or any other appropriate venue. This thread doesn't seem to be going in a constructive direction at this point, but I'm more than happy to reopen it if anyone disagrees. |
Beta Was this translation helpful? Give feedback.
-
We have a query that returns timeseries data with two columns:
I would like to simply graph the data as a line chart. However, when selecting the line chart, it requires that I choose a metric for the data, e.g. sum, average, etc
Since my data are already in the desired form, how can I just tell the chart to use the data as-is?
Note: I can also select the "sum" metric, since I am not performing any additional aggregation, but this seems a bit "kludgy".
Beta Was this translation helpful? Give feedback.
All reactions